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AmPAL22XP10 


24 


TIL 


20, 30, 40 


90, 180,210 


XOR gate 


5-271 


PAL20S10 


24 


TIL 


35 


240 


Product term stooring 


5-103 


AmPAL22P10 


24 


TTL 


15,25 


105,210 


24-pin superset 


5-291 


Am/PAL20L10 


24 


TIL 


15,20,25,30 


105,165,210 


10 outputs 


5-113,5-306 


PAL20L8 


24 


TTL, E CMOS 


15,25,35,45 


0.1, 105,210 


Standard 


5-122 


PAL6L16 


24 


TTL 


25 


90 


Wide output 


5-141 


PAL8L14 


24 


TTL 


25 


90 


Wide output 


5-141 


PAL12L10 


24 


TTL 


40 


100 


Simple combinatorial 


5-147 


PAL14L8 


24 


TTL 


40 


100 


Simple combinatorial 


5-147 


PAL16L6 


24 


TTL 


40 


100 


Simple combinatorial 


5-147 


PAL18L4 


24 


TTL 


40 


100 


Simple combinatorial 


5-147 


PAL20L2 


24 


TTL 


40 


100 


Simple combinatorial 


5-147 


PAL20C1 


24 


TTL 


40 


100 


Simple combinatorial 
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120 


Advanced Async. Macro 
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24 


EE CMOS 
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120 


Advanced Macrocell 


5-231 
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24 


TTL 


25,22 


180 


J-K, varied terms 


5-70 


Am/PAL22V10 


24 


TTL, E CMOS 


40,33,28.5,20,18 


90, 180 


Versatile 


5-79, 5-249 
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24 


TTL 


28.5 


210 


J-K flip-flops 


5-87 


PAL20RA10 


24 


TTL 


20 


200 


Asynchronous 


5-95 


AmPAL20XRP10 


24 


TTL 


30,22, 14 


105,180,210 


XOR gate & polarity 


5-271 


AmPAL20XRP8 


24 


TTL 


30,22, 14 


105,180,210 


XOR gate & polarity 


5-271 


AmPAL20XRP6 


24 


TTL 


30,22, 14 


105,180,210 


XOR gate a polarity 


5-271 
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24 


TTL 


30,22, 14 


105, 180,210 
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5-271 
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24 


TTL 


20 


240 


Product term steering 


5-103 
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24 


TTL 


20 


240 


Product term steering 


5-103 


PAL20RS4 


24 


TTL 


20 


240 


Product term steering 


5-103 
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24 


TTL 


22 


180 


XOR gate 


5-113 
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24 


TTL 


22 


180 


XOR gate 


5-113 
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24 


TTL 


22 


180 


XOR gate 


5-113 
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24 


TTL 


37,25 


105,210 


Programmable polarity 


5-291 
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24 


TTL 


37,25 


105,210 


Programmable polarity 


5-291 
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24 


TTL 


37,25 


105,210 


Programmable polarity 


5-291 
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24 


TTL 


37,25 


105,210 


Programmable polarity 


5-291 
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24 


TTL, E CMOS 


37,25,20, 15 


0.1, 105,210 


Standard 


5-122 
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24 


TTL, E CMOS 


37.25,20,15 


0.1, 105,210 


Standard 


5-122 
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24 


TTL, E CMOS 


37, 25, 20, 15 


0.1, 105,210 


Standard 


5-122 
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40 


TTL 


16 


280 


MegaPAL™ device 
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Advanced Micro Devices reserves the right to make changes 

in its products without notice in order to improve design or performance characteristics. 

The performance characteristics listed in this document are guaranteed by specific tests, 

correlated testing, guard banding, design and other practices common to the industry. 

For specific testing details contact your local AMD sales representative. 
The company assumes no responsibility for the use of any circuits described herein. 

901 Thompson Place, P.O. Box 3453, Sunnyvale, California 94088 
(408) 732-2400 TWX: 910-339-9280 TELEX: 34-6306 



In late 1 987, the two programmable logic market leaders, Monolithic Memories and Advanced Micro 
Devices, merged into one great company. We combined our strong traditions of customer service 
and innovation to offer you the best line of programmable logic devices. The 1988 Data Book 
presents the technical specifications on the entire product line. The separate 1988 Handbook 
presents all of the necessary support material, whether you are accustomed to using MMI or 
AMD products. 

AMD/MMl has the products, manufacturing capacity and technology to perpetuate the leadership 
in the programmable logic field which we pioneered. In the 1988 Data Bookyou will notice that we 
not only have the broadest programmable logic line, but we also have the industry's most 
comprehensive CMOS programmable logic line. We think that you will find the Data Book and 
Handbook informative and useful. If you have any comments or questions on the books or the 
product line, please contact us. 




Cyrus Tsui 

Vice President 
Programmable Logic Division 
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Description 



This 1988 PAL Device Handbook/Data Book is your complete guide to all programmable logic 
devices (PLDs) from Monolithic Memories and Advanced Micro Devices. The merger of the two 
companies provides a greater wealth of products and services for you. Note that all PLDs which 
were in production before the merger are still being produced. 

The PALDevice Handbook/Data Book is organized into two volumes and six easy-to-use sections: 



PAL Device Handbook 

Section 1: Introduction 

Includes an overview of the PLD product family. 

Section 2: Applications 

Includes detailed application examples. The first few chapters provide tutorials in PLD design. 
The application notes are grouped by application area. 



PAL Device Data Book 

Section 3: Programming and Quality 

Includes information on PLD software programs, programming information, PLD technology and 
quality discussions, and package information. 

Section 4: PALASM 2 Software User Documentation 

Includes complete documentation for PALASM 2 software. 

Section 5: Data Sheets 

Includes specifications for all PLDs from the combined company. PAL devices formerly from MMI 
are under "PAL Devices," while PAL devices formerly from AMD are under "AmPAL Devices." 

Section 6: Appendices 

Includes quick reference information. 

If you have any questions or comments on PLDs or any other products, please contact your most 
convenient Af\^D/MMI sales office, listed at the end of each book. 
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The Programmable Array Logic device, commonly known as the 
PAL® device, was invented at Monolithic Memories over 1 2 years 
ago. The concept forthis revolutionary type of device sprang forth 
as a simple solution to the shortcomings of discrete TTL logic. 



each of a given set of input combinations (addresses), it gener- 
ates a value which has been programmed into the device. 



The successfully proven PROM technology which allowed the 
end user to "write on silicon" provided the technological basis 
which made this kind of device not only possible, but very popular 
as well. 

The availability of design software made it much easierto design 
with programmable logic. As designers were freed from the 
drudgery of low-level implementation issues, new complex de- 
signs were easier to implement, and could be completed more 
quickly. 

This chapter outlines some basic information essential to those 
who are unfamiliar with Programmable Logic devices (PLDs). 
The information may also be useful to those who are current users 
of programmable logic. The specific issues which need to be 
addressed are: 

• WhatisaPLD? 

• What other implementations are possible? 

• What advantages do PLDs have over other 
implementations? 

What is a PLD? 

In general, a programmable logic device is a circuit which can be 
configured by the user to perform a logic function. Most "stan- 
dard" PLDs consist of an AND array followed by an OR array, 
either (or both) of which is programmable. Inputs are fed into the 
AND array, which performs the desired AND functions and 
generates product terms. The product terms are then fed into the 
OR array. In the OR array, tfie outputs of the various product 
terms are combined to produce the desired outputs. There are 
three fundamental types of standard PLD: the PROM, the PAL 
device, and the PLS device. 

PROMs 

PROMs are usually thought of as memory elements. However, 
the PROM has a fixed AND array (which decodes the memory 
address) followed by a programmable OR array (Figure 1). For 
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PAL Devices 

The PAL device has a programmable AND array followed by a 
fixed OR array (Figure 2). The fact that the AND array is 
programmable makes it possible for the devices to have many 
inputs. The fact that the OR array is fixed makes the devices small 
(which means less expensive) and fast. 

PLS Devices 

The PLS (Programmable Logic-based sequencer) devices are 
based on the standard PLA architecture (Figure 3), where both 
the AND and the OR arrays are programmable. This arrange- 
ment allows for greater overall flexibility. The architecture is a bit 
more costly in terms of die size and speed, so for simple logic 
functions, a PAL device is usually more cost effective. However, 
this architecture is very effective for sequencers, where the 
flexibility allows larger state machines than might fit in a PAL 
device. 

Other PLDs 

In addition to the basic sum-of-products PLDs, some more 
complex PLDs dedicated to sequencing are available, most 
notably the PROSE™ device and the Am29PL1 41 . Their archi- 
tectures are described elsewhere in the handbook, but their fun- 
damental benefits are the same as those of the more traditional 
PLDs. 



In practice, the distinctions between architectures are not as 
significant as the differences between the types of functions to be 
performed. For this reason, this handbook is organized primarily 
into discussions about PAL devices and discussions about se- 
quencers, whether those sequencers be implemented in PAL 
devices, PLS devices, or one of the dedicated sequencers. 

What Other Implementations Are 
Possible? 

There are essentially four alternatives to programmable logic: 

• Discrete Logic 

• Gate Arrays 

• Standard Cell Circuits 

• Full Custom Circuits 

Discrete Logic 

Discrete logic, or conventional TTL logic, has the advantage of 
familiarity; hence its popularity. It is also quite inexpensive when 
only unit cost is considered. The drawback is that the implemen- 
tation of even a simple portion of a system may require many units 
of discrete logic. There are "hidden" costs associated with each 
unit that goes into a system, which can render the overall system 
more expensive. 
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Designing with discrete chips can also be very tedious. Each 
design decision directly affects the layout of the board. Changes 
are difficultto make. Thedesign is also moredifficult to document, 
making it harder to debug and maintain later. These items ail 
contribute to a long design cycle when discrete chips are used 
extensively. 

Gate Arrays 

Gate arrays have been increasing in popularity. The attractive- 
ness of this solution lies in the device's flexibility. By packing the 
functions into the device, a great majority of the available silicon 
is actually used. Since such a device is customized for an 
application, it would seem to be the optimum device for that 
application. 

However, one also pays substantial development costs, espe- 
cially in the case of a design which needs changes after silicon 
has already been processed. Even though the unit costs are 
generally quite low for gate arrays, the volumes required to make 
their use worthwhile excludes them as a solution for many 
designers. This fact, added to the long design cycle and high risk 
involved, make this solution practical for only a limited number of 
designers. 

Standard Cell Circuits 



then assembled and collected to perform the desired functions. 
This can ideally lead to reduced turnaround from conception to 
implementation, and a much more efficient circuit. 

The drawback is that even though the individual components of 
the circuit have been laid out, a complete layout must still be per- 
formed to arrange the cells. Instead of just customizing the metal 
interconnections, as is done in a gate array, the circuit must be 
developed from the bottom up. Development costs can be even 
higherthan forgate arrays, and despite the standard cell concept, 
turnaround time often tends to be longerthan planned. Again, the 
volume must be sufficiently high to warrant the development 
costs. 

Full Custom Circuits 

Full custom designs require that a specific chip be designed from 
scratch to perform the needed functions. The intent is to provide 
a solution which gives the designer exactly what is needed for the 
application in question; no more and no less. Ideally, not a square 
micron of silicon is wasted. This normally results in the smallest 
piece of silicon possible to fit the needs of the design, which in turn 
reduces the system cost. Understandably, though, development 
costs and risks for such adesign are extremely high, and volumes 
must be commensurately high in order for such a solution to be 
of value. 



Standard cell circuits are quite similar to gate arrays, their main 
advantage being that they consist of a collection of different parts 
of circuits which have already been debugged. These circuits are 
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What Advantages do PLDs Have Over 
Other Implementations? 

As user-programmable semicustom circuits, PLDs provide a 
valuable compromise which combines many of the benefits of 
discrete logic with many of the benefits of other semicustom 
circuits. The overall advantages can be found in several areas: 

• Ease of design 

• Performance 

• Reliability 

• Cost savings 

Ease of Design 

The support tools available for use in designing with PLDs greatly 
simplify the design process by making the lower-level implemen- 
tation details transparent. In a matter of one cr two hours, a first 
time PLD user can learn to design with a PAL device, program it, 
and implement the design in a system. 

Tlie design support tools consist of design software and a 
programmer. The design software is used in generating the 
design; the programmer is used to configure the device. The 
software provides the link between the higher-level design and 
the low-level programming details. 

All of the available design software packages (of which fvlonolithic 
Memories' PALASM® software is the most widely used) perform 
essentially the same tasks. The design is specified with relatively 
high-level constructs; the software takes the design and converts 
it into a form which the programmer uses to configure the PLD. 
Most software packages provide logic simulation, which allows 
one to debug the design before actually programming a device. 
The high-level design file also serves as documentation of the 
design. This documentation can be even easier to understand 
than traditional schematics. 

Depending on the capabilities desired, a device programmer can 
cost anywhere from under $1 ,000.00 to around $1 5,000.00 for a 
high-volume production programmer. Many PLD users do not 
find it necessary to purchase a programmer; it is often quite cost 
effective and convenient to have either the manufacturer or an 
outside distributordo the programming forthem. Fordesignand 
prototyping, though, it is very helpful to have a programmer; this 
allows one to implement designs immediately. 

The convenience of programmable logic lies in the ability to 
customize a standard, off-the-shelf product. PLDs can be found 
in stock to suit a wide range of speed and power requirements. 
The variety of architectures available also allows a choice of the 
properf unctionality for the application at hand. Thus adesign can 
be implemented using a standard device, with the end result 
essentially being a custom device. If a design change is needed, 
it is a simple matter to edit the original design and then program 
a new device, or, in the case of reprogrammable CMOS devices, 
erase and reprogram the old device. 

Board layout is vastly simplified with the use of programmable 
logic. PLDs offer great flexibility in the location of inputs and 
outputs on the device. Since larger functions are implemented 
inside the PLD, board layout can begin once the inputs and 



outputs are known. The details of what will actually be inside the 
PLD can be worked out independently of the layout. In many 
cases, any needed design changes can be taken care of entirely 
within the PLD, and will not affect the PC board. 

Performance 

Speed is one ofthe main reasonsthat designers use PALdevices. 
The TTL PAL devices presently on the market can provide equal 
or better performance than the fastest discrete logic available. 
ECL PAL devices extend the benefits of programmable logic to 
the even higher-speed realm of ECL logic. Today's fastest PAL 
devices are being developed on the newest technologies to gain 
every extra nanosecond of performance. 

Performance cannot come strictly at the expense of power 
consumption. Since PLDs can be used to replace several 
discrete circuits, the power consumption of a PLD may well be 
less than that of the combined discrete devices. As more PLDs 
are developed in CMOS technology, the option for even lower 
power becomes available, including zero standby power devices 
for systems which can tolerate only minute standby power con- 
sumption. 

Reliability 

Reliability is an area of increasing concern. As systems get larger 
and more complex, the increase in the amount of circuitry tends 
to reduce the reliability of the system; there are "more things to go 
wrong". Thus a solution which inherently reduces the number of 
chips in the system will contribute to higher reliability. Aprogram- 
mable logic approach can provide device quality levels up to 50 
parts per million (ppm), while also providing a more reliable 
solution due to the smaller number of devices required. 

With the reduction in units and board space, PC boards can be 
laid out less densely, which greatly improves the reliability of the 
board itself. This also reduces crosstalk and other potential 
sources of noise, making the operation of the system cleaner and 
more reliable. 

Cost 

For any design approach to be practical, it must be cost effective. 
Cost is almost always a factor in considering a new design or a 
design change. But the calculation of total system cost can be 
misleading if not all aspects are considered. Many of the costs 
can be elusive or difficult to measure. For example, it is difficult 
to quantify the cost of market share lost due to late product 
introduction. 

The greatest savings over a discrete design are derived from the 
fact that a single PLD can replace several discrete chips. Board 
space requirements can drop by 25% or more when PLDs are 
used. Figure 4 illustrates some of the costs of the various 
solutions discussed so far, with many ofthe factors that may not 
always be considered included for comparison. These involve 
such items as inventory costs, inspection costs, test costs, board 
materials costs, and of course the very costly time spent design- 
ing and debugging such systems, and isolating and replacing 
units which fail. With each design change, the cost of a custom 
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Unit 
Cost 
Range 


NREs/ 
Development Costs 


Purchase 

Volume 

For Best 

Economics 


Design 
Flexibility 


Gate Count 
per Device 


Engineering 

Design 

Time 


Design 

Turnaround 

Time 


Cost 

of Each 

Design Change 


PLDs: BLANK 


Low-Med 


None 


5K-200K 


Med 


200-2K 


1/2Weel< 


Short 
(1-10 Days) 


Very Low 


PLDs: FACTORY 
PROGRAIVIIVIED 


Low-Med 


Low 


10K-200K 


Med 


200-2K 


3-10 
Weeks 


Moderate 
(8-10 Wks.) 


Med 


DISCRETE LOGIC 


Low-Med 


None 


1K-10K 


Low 


10-30 


1 Week 


Short 
(1-10 Days) 


Med 


GATE ARRAYS 


Low 


Med-High 


10K-200K 


Med-High 


1K-10K 


12-40 
Weeks 


Long 
(3-9 Mos.) 


High 


STANDARD CELLS 


Low 


High 


100K-300K 


Med-High 


1K-10K 


26-52 
Weeks 


Long 
(6-12 Mos.) 


High 


FULL CUSTOM 


Very Low 


High 


200K and Up 


High 


1K-60K 


1-2 Yrs. 


Long 
(6-12 Mos.) 


High 



o 
a 
c 
o 

5' 

3 



Figure 4. Cost and Design-time Comparisons 
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solution rises dramatically, while that of a user-customizable 
approach is minimal. The relationship between the various 
alternatives is summarized in Figure 5. 

Another economic benefit of the use of PLDs is that when one PAL 
device is used in several different designs, as is often the case, 
the user has not com mitted that device to any one of the particular 
designs until the device has been programmed. This means that 
inventory can be stocked for several different designs in the form 
of one device. As requirements change, the parts can be 
programmed to fit the need. And in the case of reprogrammable 
CMOS devices, one is not committed even after programming. 

There is also a cost-effective PLD solution for high-volume 
production. Just as a ROM is a PROM which has been hard-wired 
for mass production, HAL® (Hard Array Logic) devices can be 
produced in high volumes for extra cost savings. This masl<- 
programmed version can be produced in high volume with 
unparalleled quality. In addition, in the event that production 
quantities for your system show an unexpected increase, the 
equivalent PAL or ProPAL™ devices (in-factory programmed 
PAL devices) can be quickly obtained and programmed in your 
factory, by Monolithic Memories, or through distribution to cover 
the temporary shortage. More details on HAL and ProPAL 
devices are provided on page 3-104 of the Data Book. 

One final subtle cost issue is derived from the ease with which a 
competitorcan copy a design. PLDs have a unique feature called 
a security fuse, whose purpose is to protect a design from being 
copied. By using secured PLDs extensively in a system, one can 



safely avoid having one's system easily deciphered. The added 
design security provided by this feature can buy extra market 
time, forcing competitors to do their own original design work 
rather than copying the designs of others. 

Summary 

Programmable logic provides the means of creating semi-custom 
designs with readily available standard components. There is a 
wide variety of PLDs; PAL devices are most widely used, and 
perform well for basic logic and some sequencing functions. 
Other dedicated sequencers provide the circuitry required to 
implement more complex designs. 

By assuming some of the attributes of gate arrays, programmable 
logic provides the cost savings of any other semicustom device, 
without the extra engineering costs, risks, and design delays. 
Reliability is also enhanced as quality increases and board 
complexity decreases. 

The design tasks are greatly simplified due to the design tools 
which are now available. Design software and device program- 
mers allow top-down high-level designs, with a minimum of time 
spent on actual implementation issues. Simulation allows some 
design debug before a device is programmed. 

For all of these reasons, programmable logic has become, and 
will continue to be, the design methodology of choice among 
digital systems designers. 



COST 





WEEKS MONTHS 

DEVELOPMENT TIME 



Figure 5. Development Cost vs Time for Alternative Logic 
Implementations 
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Introduction 

Monolithic Memories and Advanced Micro Devices offer a wide 
variety of PLDs, implemented in a variety of technologies. In this 
section, we will briefly discuss the device families, and look at the 
various architecture, speed, and power options. More specific 
device information can be found in the individual data sheets in 
Section 5. Discussions on some of the special architectural fea- 
tures of many of the devices can also be found in their respective 
datasheets. 

There are three basic PLD areas addressed by Monolithic 
Memories' devices: 

• PAL devices — general purpose 

• Programmable sequencers — state machines 

• LCA™ devices — high density 

The largest application area is that covered by the PAL devices. 
There is a wide variety of PAL devices, ranging from simple 
devices that address general logic design problems to more 
sophisticated devices that deal with more complex problems. 

There is also a series of sequencers which are not PAL devices, 
featuring architectures particularly well suited to sequencing 
operations. While there are PAL devices that work well as state 
machines in addition to their other applications, these dedicated 
sequencers have given up some of their generality to provide 
optimal state machine solutions. 

The final area covered is that of high-density design, addressed 
by the LCA devices. The LCA device takes the approach of a 
programmable gate array to provide a PLD with many usable 
gates. 

Design software radically simplifies the design of any circuit in a 
PLD. PALASM software is used for all of the PLDs except the LCA 
devices. TheLCAdevicesmakeuseofXACT™ software for logic 
configuration. The PALASM software documentation is in 
Section 4; XACT software is discussed more fully on 
page 3-1 7 of the Data Book. 

PAL Devices 

PAL devices are available in three different technologies: 

• TTL 

• CMOS 

• ECL 

The CMOS devices often provide the same functions as the TTL 
devices, and can be used in the same sockets, with the added 
benefit of lower power. Thus the TTL and CMOS devices will be 
discussed together, followed by a summary of specific CMOS 
issues. ECLdevices require completely different design consid- 
erations, and cannot be interchanged with TTL or CMOS devices. 
They will therefore be discussed separately. 

PAL devices generally have a mnemonic naming scheme, which 
provides some basic information about the devices' capabilities. 



Because Monolithic Memories and Advanced Micro Devices 
recently merged, there are presently two slightly different naming 
conventions in use. Some devices were originally Monolithic 
Memories' devices; some were Advanced Micro Devices'; some 
devices were produced by both companies before the merger. In 
order to minimize the confusion after the merger, the old device 
names are being maintained. The two naming conventions will be 
discussed separately. Throughout this handbook, "AmPAL" 
designates products originally from Advanced Micro Devices. 

Note that all products that Monolithic Memories and Advanced 
Micro Devices were producing before the merger are still in 
production. 

Monolithic Memories Nomenclature 

Monolithic Memories originally set the PAL device naming stan- 
dard. An example of an older device name is shown in Figure 1 . 

As devices have become faster, and as CMOS has become 
available, it has become necessary to change the way we 
describe the performance of newer devices. While the conven- 
tion is slightly different, it does make it easier to see exactly what 
the speed grade and power level of a device are. An example of 
such a new device is shown in Figure 2. 

Advanced Micro Devices 
Nomenclature 

The Advanced Micro Devices nomenclature follows the Mono- 
lithic Memories nomenclature with a few modifications. Atypical 
older device name is shown in Figure 3. 

Advanced Micro Devices also found the need to modify the 
nomenclature with the advent of faster devices and CMOS 
devices. An example of a newer device is shown in Figure 4. 

The relationship between a device and its name will become 
apparent in the Tables following. 

The package designators also differ. A cross reference for com- 
mercial packages is shown in Table 1 . 





MONOLITHIC 


ADVANCED 


PACKAGE 


MEMORIES 


MICRO DEVICES 


Plastic DIP 


N 


P 


Plastic SKINNYDIP® 


NS 


P 


Ceramic DIP 


J 


D 


Ceramic SKINNYDIP 


JS 


D 


PLCC-20-pin 


NL 


J 


PLCC from 24-pin DIP 


FN 


J 


JEDEC 






PLCC from 24-pin DIP 


NL 


- 


PLCC from 28-pin DIP 


FN 


J 


LCC 


L 


L 



Table 1. Package Designators 
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PAL 16 



8 A -2 



NUMBER OF INPUTS 



ARCHITECTURE 



Notes: 

ECL devices also have either '10H' or '100' following 'PAL' to 
indicate the ECL family compatibility. 

The number of inputs includes feedback, if feedback exists. 

The architecture code varies. It is usually mnemonic, and the 
meanings of each code will be listed in each of the following 
sections. 



I 



POWER LEVEL 



SPEED GRADE 



NUMBER OF OUTPUTS 
(OR FLIP-FLOPS) 



The speed grade indicates the propagation delay. In general, the 
speeds are as follows: 

blank: 35 ns (Note that -2 and -4 devices are slower) 
A: 25 ns 
B: 15 ns 
D: 10 ns 

The power level can be one of three designators: 



blank 
-2 
-4 



'full power', 1 80-240 mA 
'half power', 90-105 mA 
'quarter power', 45-55 mA 



Figure 1. Monolithic Memories Nomenclature for Older Products 



PAL C 16 



TECHNOLOGY 



NUMBER OF INPUTS . 
ARCHITECTURE 



I 



SPEED GRADE 



POWER LEVEL 



NUMBER OF OUTPUTS 
(OR FLIP-FLOPS) 



Notes: 

The technology designator can be one of four codes: 

blank = TTL 
C = CMOS 
10H =10KHECL 
100 =100KECL 

The number of inputs includes feedback, if feedback exists. 



The architecture code varies. It is usually mnemonic, and the 
meanings of each code will be listed in each of the following 
sections. 

The power level can be one of four designators: 

blank: 'full power', 1 80-240 mA 
H: 'half power', 90-105 mA 
Q: 'quarter power', 45-55 mA 
Z: 'zero power', <0.1 mA standby current 



Figure 2. New Monolithic Memories Nomenclature 
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AmPAL 16 L 



Am PAL • 



NUMBER OF INPUTS - 



ARCHITECTURE 



Notes: 

The number of inputs includes feedbacl<, if feedbacl< exists. 

The architecture code varies. It is usually mnemonic, and the 
meanings of each code will be listed in each of the following 
sections. 



L 



POWER LEVEL 



SPEED GRADE 



— NUMBER OF OUTPUTS 
(OR FLIP-FLOPS) 

The speed grade indicates the propagation delay. In general, the 
speeds are as follows: 

blank: 35 ns 
A: 25 ns 
B: 15 ns 
D: 10 ns 

The power level can be one of three designators: 

blank: 'full power', 1 80-240 mA 
L: 'half power', 90-105 mA 
Q: 'quarter power', 45-55 mA 



Figure 3. Advanced Micro Devices Nomenclature for Older Products 



TECHNOLOGY 



22 XP 10 -30 L 



NUMBER OF INPUTS ■ 
ARCHITECTURE 



L 



POWER LEVEL 



SPEED GRADE 



NUMBER OF OUTPUTS 
(OR FLIP-FLOPS) 



Notes: 

The technology designator can be one of two codes: 

blank = TTL 
C = CMOS 

The number of inputs includes feedback, if feedback exists. 

The architecture code varies. It is usually mnemonic, and the 
meanings of each code will be listed in each of the following 
sections. 



The speed grade indicates the propagation delay in nano- 
seconds. 

The power level can be one of three designators: 

blank: full power', 180-240 mA 
L: 'half power', 90-105 mA 



Figure 4. New Advanced iVilcro Devices Nomenclature 
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AMD/MMI Programmable Logic Product 
Cross Reference Guide 

As a merged company, the combined PLD offerings of AMD/MM! 
provide the PLD userthe broadest iineof PLDs available from one 
supplier. The combined product offering also contains a number 
of devices for which each of the original separate companies had 
equivalent devices. 

This is a guide to the devices that are equivalent and can be 
directly substituted (with exceptions as noted in the comments 
section). 

These are the only products which are direct substitutes for each 
other, or which we will be encouraging customers to use as 
alternate parts. 



TTL and CMOS PAL Devices 

TTL PAL devices are used more widely by far than devices made 
with any other technology. They provide very high speed without 
the design requirements of ECL. CMOS devices provide the 
benefits of reduced power consumption and erasability. Each 
CMOS device is either TTL compatible, or has aTTL-compatible 
version. 

The available devices can be generally placed into four functional 
groups: 

• Simple combinatorial 

• Simple registered 

• Sequencer 

• Asynchronous 



AMD 


MMI 


COMMENTS 


Part Number 


Speed/Power 


Part Number 


Speed/Power 


AmPAL16R8D 
AmPAL16R6D 
AmPAL16R4D 
AmPAL16L8D 


10/180 
10/180 
10/180 
10/180 


PAL16R8D 
PAL16R6D 
PAL16R4D 
PAL16L8D 


10/180 
10/180 
10/180 
10/180 




AmPAL16R8B 
AmPAL16R6B 
AmPAL16R4B 
AmPAL16L8B 


15/180 
15/180 
15/180 
15/180 


PAL16R8B 
PAL16R6B 
PAL16R4B 
PAL16L8B 


15/180 
15/180 
15/180 
15/180 




AmPAL16R8AL 
AmPALl6R6AL 
AmPAL16R4AL 
AmPAL16L8AL 


25/90 
25/90 
25/90 
25/90 


PAL16R8B-2 
PAL16R6B-2 
PAL16R4B-2 
PAL16L8B-2 


25/90 
25/90 
25/90 
25/90 




AmPAL16R8A 
AmPAL16R6A 
AmPAL16R4A 
AmPAL16L8A 


25/180 
25/180 
25/180 
25/155 


PAL16R8A 
PAL16R6A 
PAL16R4A 
PAL16L8A 


25/180 
25/180 
25/180 
25/180 


AMD— 20-ns X^^ 
AMD— 20-ns X^^ 
AMD— 20-ns t^^ 


AmPAL16R8Q 
AmPALl6R6Q 
AmPAL16R4Q 
AmPAL16L8Q 


35/45 
35/45 
35/45 
35/45 


PAL16R8B-4 
PAL16R6B-4 
PAL16R4B-4 
PAL16L8B-4 


35/55 
35/55 
35/55 
35/55 




AmPAL16R8L 
AmPALl6R6L 
AmPAL16R4L 
AmPAL16L8L 


35/90 
35/90 
35/90 
35/80 


PAL16R8A-2 
PAL16R6A-2 
PAL16R4A-2 
PAL16L8A-2 


35/90 
35/90 
35/90 
35/90 




AmPAL16R8 
AmPAL16R6 
AmPAL16R4 
AmPALl6L8 


35/180 
35/180 
35/180 
35/180 


PAL16R8A-2 
PAL16R6A-2 
PAL16R4A-2 
PAL16L8A-2 


35/90 
35/90 
35/90 
35/90 


Note that the MMI part requires half the power 
(90 mA \^) of the AMD part (1 80 mA) 


AmPAL18P8L 
AmPAL18P8L 
AmPAL18P8L 
AmPAL18P8L 
AmPAL18P8L 
AmPAL18P8L 
AmPAL18P8L 
AmPAL18P8L 


35/90 
35/90 
35/90 
35/90 
35/90 
35/90 
35/90 
35/90 


PAL10H8 
PAL12H6 
PAL14H4 
PAL16H2 
PAL10L8 
PAL12L6 
PAL14L4 
PAL16L2 


35/90 
35/90 
35/90 
35/90 
35/90 
35/90 
35/90 
35/90 


The 18P8L can replace any of the MMI parts, 
but the reverse is not necessarily true. 
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Simple Combinatorial Devices 

These PAL devices provide simple combinatorial functions 
quickly and efficiently. . There is a wide selection of devices, 
allowing the user to choose the right device for the given 
application. 

There are five basic characteristics that differentiate these 
devices: 

• The number of inputs 

• The number of outputs 

• The number of product terms per output 

• The speed: propagation delay 

• Power consumption 

Several of the devices have internal feedback, which routes 
output pins back as inputs. This is generally coupled with 
programmable three-state capability, so that the output pins can 
be used as outputs, inputs, or bidirectional pins. 

Most of the devices have fixed output polarity. The AmPALI 8P8, 
AmPAL22P10, PAL16P8. PAL20S10, and AmPAL22XP10 all 
have programmable output polarity. 

The PAL20S10 has a feature called "product term steering", 
which allows product terms to be allocated between outputs. This 
is described more fully on page 5-103 of the Data Book. 



The AmPAL22XP10 has built-in exclusive-OR gates on each 
output. 

In addition to the devices that are purely combinatorial, there are 
also registered devices with flip-flops that can be bypassed to 
give a combinatorial device. Although these devices are not 
listed in Table 2, they can be used as combinatorial devices. The 
best example is the PAL22\/10, which is widely used as a com- 
binatorial device. 

A summary of the architecture codes for these types of devices 
is shown in Table 2. 

The combinatorial devices are listed in Table 3, with their logic, 
speed, and power characteristics. 



CODE 


MEANING 


EXAMPLE 


H 
L 
P 
C 
XP 

S 


Active-HIGH outputs 

Active-LOW outputs 

Programmable output polarity 

Complementary outputs 

Exclusive-OR gate, prog. 

polarity 

Product term steering 


PAL10H8 
PAL16L8 
PAL16P8 
PAL16C1 
AmPAL22XP10 

PAL20S10 



Table 2. Combinatorial Architectures 













STANDBY 










PRODUCT 


SPEED 


(m&) 


DATA SHEET 


DEVICE NAME 


INPUTS 


OUTPUTS 


TERMS/OUTPUT 


(tpB in ns) 


PAGE NO. 


PAL8L14A 


8 


14 


1 


25 


90 


5-141 


PAL6L16A 


6 


16 


1 


25 


90 


5-141 


PAL10H8 


10 


8 


2 


35 


90 


5-56 


PAL12H6 


12 


6 


2,4 


35 


90 


5-56 


PAL14H4 


14 


4 


4 


35 


90 


5-56 


PAL16H2 


16 


2 


8 


35 


90 


5-56 


PAL10L8 


10 


8 


2 


35 


90 


5-56 


PAL12L6 


12 


6 


2,4 


35 


90 


5-56 


PAL14L4 


14 


4 


4 


35 


90 


5-56 


PAL16L2 


16 


2 


8 


35 


90 


5-56 


PAL16C1 


16 


2 


16 


40 


90 


5-56 


PAL16L8D 


16* 


8 


7 


10 


180 


5-29 


AmPALI 6L8D 








10 


180 


5-183 


PAL16L8B 








15 


180 


5-31 


AmPALI 6L8B 








15 


180 


5-197 


PALC16L8Z-25 








25 


0.1 


5-50 


PALC16L8Q-25 








25 


45 


5-33 


PAL16L8B-2 








25 


90 


5-35 


AmPALI 6L8AL 








25 


90 


5-197 


PAL16L8A 








25 


180 


5-37 


AmPALI 6L8A 








25 


155 


5-197 


PAL16L8B-4 








35 


55 


5-39 


AmPALI 6L8Q 








35 


45 


5-197 


PAL16L8A-2 








35 


90 


5-41 


AmPALI 6L8L 








35 


80 


5-197 


AmPALI 6L8 








35 


155 


5-197 


PAL16L8A-4 








55 


50 


5-43 



Table 3. Simple Combinatorial PAL Devices 
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DEVICE NAME 


INPUTS 


OUTPUTS 


PRODUCT 
TERMS/OUTPUT 


SPEED 
(tpo In ns) 


STANDBY 
(mA) 


DATA SHEET 
PAGE NO. 


PAL16P8A 


16* 


8 


7 


25/30** 


180 


5-17 


AmPAL18P8B 

AmPAL18P8AL 

AmPAL18P8A 

AmPAL18P8Q 

AmPAL18P8L 


18* 


8 


8 


15 
25 
25 
35 
35 


180 
90 
180 
55 
90 


5-202 
5-202 
5-202 
5-202 
5-202 


PAL12L10 

PAL14L8 

PAL16L6 

PAL18L4 

PAL20L2 

PAL20C1 


12 
14 
16 
18 
20 
20 


10 
8 
6 
4 
2 
2 


2 

2,4 
2,4 
4,6 
8 
16 


40 
40 
40 
40 
40 
40 


100 
100 
100 
100 
100 
100 


5-147 
5-147 
5-147 
5-147 
5-147 
5-147 


PAL20L8B 

PAL20L8B-2 

PAL20L8A 

PALC20L8Z-35 

PAL20L8A-2 

PALC20L8Z-45 


20* 


8 


7 


15 
25 
25 
35 
35 
45 


210 
105 
210 
0.1 
105 
0.1 


5-125 
5-126 
5-128 
5-133 
5-130 
5-133 


AmPAL20L10B 
AmPAL20L10-20 
AmPAL20L10AL 
PAL20L10A 


20* 


10 


3 


15 
20 
25 
30 


210 
165 
105 
165 


5-306 
5-306 
5-306 
5-113 


PAL20S10 


20* 


10 


0-1 6tt 


35 


240 


5-103 


AmPAL22P10B 

AmPAL22P10AL 

AmPAL22P10A 


22* 


10 


8 


15 
25 
25 


210 
105 
210 


5-306 
5-306 
5-306 


Am PAL22XP 10-20 
AmPAL22XP10-30L 
Am PAL22XP 10-30 
AmPAL22XP10-40L 


22* 


10 


2/6t 


20 
30 
30 
40 


210 

105 
180 
105 


5-286 
5-286 
5-286 
5-286 



Includes feedback 
Depending on polarity 



t Has an exclusive-GR gate 
tt Product term steering 

Table 3. Simple Combinatorial PAL Devices (Continued) 
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Simple Registered Devices 

A basic registered device is generally equivalent to a combinato- 
rial device with registers on the outputs for signal synchronization. 
All such devices have feedback from the flip-flops back into the 
array as inputs, so that they can also be used for very fast small 
stale machines. The differentiating characteristics of these de- 
vices are: 

• The number of inputs 

• The number of outputs 

• The number of flip-flops 

• The number of product terms per output 

• The speed; maximum clock frequency 

• Power consumption 

Most of these devices have dedicated clock and output enable 
pins. On devices such as the PAL16R4, which have some 
dedicated combinatorial outputs, only the registered outputs are 
controlled by the enable pin; the combinatorial outputs have 
programmable three-state functions. 

The PAL22V10, AmPAL22RP10 family, and AmPAL22XRP10 
family have programmable three-state outputs. With these fami- 
lies, the clock pin may also be used as a logic input if desired. 

The PALI 6X4 has extra bit-pair decoding circuitry which makes 
it especially well suited to arithmeticoperations. This is discussed 
more fully on page 2-53 in the combinatorial logic section. 



The simpler devices all have active-LOW outputs. The 
PAL16RP8 family, PAL22V10, AmPAL22RP10 family, 
AmPAL22XRP1 family, PAL20RS1 family, and the PAL32R1 6 
have programmable polarity. 

The PAL20RS1 family and PAL32R1 6 have product term steer- 
ing, as was discussed above for the PAL20S10. 

The outputs of the PAL32R16 can also be programmed to be 
combinatorial in banks of eight. In addition, the PAL32R16 has 
multiple clock, enable, and TTL-level preload pins. 

The PAL22V10 has two programmable initialization product 
terms: a synchronous preset term and an asynchronous reset 
term. These terms control all flip-flops. 

A summary of the architecture codes for these types of devices 
is shown in Table 4. The devices are summarized in Table 5. 



CODE 


MEANING 


EXAMPLE 


R 
X 

RP 
RS 
V 


Registered outputs 
Exclusive-OR gates 
Registered with prog, polarity 
Registered with term steering 
Versatile 


PAL16R8 
PAL16X4 
PAL16RP8 
PAL20RS10 
AmPAL22V10 



Table 4. Registered Architectures 















STANDBY 












PRODUCT 


SPEED 


(m^) 


DATA SHEET 


DEVICE NAME 


INPUTS 


OUTPUTS 


FLIP-FLOPS 


TERMS/OUTPUT 


(f„„ In MHz) 


PAGE NO. 


PAL16R8D 


16* 


8 


8 


8 


55 


180 


5-29 


AmPAL16R8D 










55 


180 


5-183 


PAL16R8B 










37 


180 


5-31 


AmPAL16R8B 










40 


180 


5-197 


PALG16R8Z-25 










28.5 


0.1 


5-50 


PALG16R8Q-25 










28.5 


45 


5-33 


PAL16R8B-2 










25 


90 


5-35 


AmPAL16R8AL 










28.5 


90 


5-197 


PAL16R8A 










25 


180 


5-37 


AmPAL16R8A 










28.5 


180 


5-197 


PAL16R8B-4 










16 


55 


5-39 


AmPAL16R8Q 










18 


45 


5-197 


PAL16R8A-2 










16 


90 


5-41 


AmPAL16R8L 










18 


90 


5-197 


AmPAL16R8 










18 


180 


5-197 


PAL16R8A-4 










11 


50 


5-43 


PAL16R6D 


16* 


8 


6 


8 


55 


180 


5-29 


AmPAL16R6D 










55 


180 


5-183 


PAL16R6B 










37 


180 


5-31 


AmPAL16R6B 










40 


180 


5-197 


PALG16R6Z-25 










28.5 


0.1 


5-50 


PALC16R6Q-25 










28.5 


45 


5-33 


PAL16R6B-2 










25 


90 


5-35 


AmPAL16R6AL 










28.5 


90 


5-197 


PAL16R6A 










25 


180 


5-37 
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SPEED 


STANDBY 












PRODUCT 


('m« 


>cc 


DATA SHEET 


DEVICE NAME 


INPUTS 


OUTPUTS 


FLIP-FLOPS 


TERMS/OUTPUT 


In MHz) 


(mA) 


PAGE NO. 


AmPAL16R6A 










28.5 


180 


5-197 


PAL16R6B-4 










16 


55 


5-39 


AmPAL16R6Q 










18 


45 


5-197 


PAL16R6A-2 










16 


90 


5-41 


AmPAL16R6L 










18 


90 


5-197 


AmPAL16R6 










18 


180 


5-197 


PAL16R6A-4 










11 


50 


5-43 


PAL16R4D 


16* 


8 


4 


8 


55 


180 


5-29 


AmPAL16R4D 










55 


180 


5-183 


PAL16R4B 










37 


180 


5-31 


AmPAL16R4B 










40 


180 


5-197 


PALC16R4Z-25 










28.5 


0.1 


5-50 


PALC16R4Q-25 










28.5 


45 


5-33 


PAL16R4B-2 










25 


90 


5-35 


AmPAL16R4AL 










28.5 


90 


5-197 


PAL16R4A 










25 


180 


5-37 


AmPAL16R4A 










28.5 


180 


5-197 


PAL16R4B-4 










16 


55 


5-39 


AmPAL16R4Q 










18 


45 


5-197 


PAL16R4A-2 










16 


90 


5-41 


AmPAL16R4L 










18 


90 


5-197 


AmPAL16R4 










18 


180 


5-197 


PAL16R4A-4 










11 


50 


5-43 


PAL16X4 


16* 


8 


4 


8t 


14 


225 


5-51 


PAL16RP8A 


16* 


8 


8 


8 


25** 


180 


5-17 


PAL16RP6A 


16* 


8 


6 


8 


25** 


180 


5-17 


PAL16RP4A 


16* 


8 


4 


8 


25** 


180 


5-17 


PAL20R8B 


20* 


8 


8 


8 


37 


210 


5-125 


PAL20R8B-2 










25 


105 


5-126 


PAL20R8A 










25 


210 


5-128 


PALC20R8Z-35 










20 


0.1 


5-133 


PAL20R8A-2 










16 


105 


5-130 


PALC20R8Z-45 










15.3 


0.1 


5-133 


PAL20R6B 


20* 


8 


6 


8 


37 


210 


5-125 


PAL20R6B-2 










25 


105 


5-126 


PAL20R6A 










25 


210 


5-128 


PALC20R6Z-35 










20 


0.1 


5-133 


PAL20R6A-2 










16 


105 


5-130 


PALC20R6Z-45 










15.3 


0.1 


5-133 


PAL20R4B 


20* 


8 


4 


8 


37 


210 


5-125 


PAL20R4B-2 










25 


105 


5-126 


PAL20R4A 










25 


210 


5-128 


PALC20R4Z-35 










20 


0.1 


5-133 


PAL20R4A-2 










16 


105 


5-130 


PALC20R4Z-45 










15.3 


0.1 


5-133 



Table 5. Simple Registered PAL Devices (Continued) 
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DEVICE NAME 


INPUTS 


OUTPUTS 


FLIP-FLOPS 


PRODUCT 
TERMS/OUTPUT 


SPEED 
I'max 

in MHz) 


STANDBY 

'cc 
(mA) 


DATA SHEET 
PAGE NO. 


PAL20RS10 

PAL20RS8 

PAL20RS4 


20* 
20* 
20* 


10 
10 
10 


10 
8 

4 


0-1 6tt 
0-1 6tt 
0-1 6tt 


20 
20 
20 


240 
240 
240 


5-103 
5-103 
5-103 


AmPAL22V10-15 
PALC22V10H-25 

AmPAL22V10A 
PALC22V10H-35 

AmPAL22V10 


22* 


10 


0-1 0§ 


8-1 6§§ 


40 

33.3 

28.5 

20 

18 


180 
90 
180 
90 
180 


5-249 

5-79 

5-260 

5-79 

5-260 


AmPAL20RP10B 

AmPAL20RP10AL 

AmPAL20RP10A 


22* 


10 


10 


8 


37 
25 
25 


210 
105 
210 


5-306 
5-306 
5-306 


AmPAL20RP8B 

AfnPAL20RP8AL 

AmPAL20RP8A 


22* 


10 


8 


8 


37 
25 
25 


210 
105 
210 


5-306 
5-306 
5-306 


AmPAL20RP6B 

AmPAL20RP6AL 

AmPAL20RP6A 


22* 


10 


6 


8 


37 
25 
25 


210 
105 
210 


5-306 
5-306 
5-306 


AmPAL20RP4B 

AmPAL20RP4AL 

AmPAL20RP4A 


22* 


10 


4 


8 


37 
25 
25 


210 
105 
210 


5-306 
5-306 
5-306 


PAL32R16 


32* 


16 


16§ 


0-1 6tt 


16 


280 


5-158 



* Includes feedback 
** With polarity fuse intact 
t Has an exclusive-OR gate 



tt Product term steering 
§ Flip-flops can be bypassed 
§§ Has varied product term distribution 

Table 5. Simple Registered PAL Devices (Continued) 



PAL Devices as Sequencers 

Several registered PAL devices have added features that make 
them particularly well suited to state machine applications. 

The basic differentiating aspects of these devices are: 

• The number of inputs 

• The number of outputs 

• The number of flip-flops 

• The type of flip-flop possible (D, T, J-K, S-R, latch, 
and/or buried) 

• The number of product terms per output 

• The speed: maximum clock frequency 

• Power consumption 

The PAL20X1 series, AmPAL20XRP10 series, PAL22RX8, and 
PAL32VX1 all have an exclusive-OR gate that makes it possible 
to design with D, T J-K, or S-R flip-flops. Since the PAL20X1 has 
fewer product terms, it is best for counter-like applications. The 
AmPAL20XRP1 series, PAL22RX8 and PAL32VX1 have more 
product terms, for much more sophisticated state machines. 



The AmPAL23S8 has six extra flip-flops that are permanently 
buried in addition to eight output flip-flops. The PAL32VX1 and 
AmPALG29M1 6 also have flip-flops that can be buried if they are 
not needed at the outputs. An output pin can be used as an input 
if its flip-flop is buried. All of these devices are ideal for applica- 
tions incorporating internal state machines as part of the overall 
design. 

The PAL22RX8, PAL32VX10, and AmPAL23S8 have global 
preset and reset control, making system initialization much 
easier. 

The AmPALC29M1 6 is an electrically erasable device that has an 
unusually flexible architecture. Almost all pins can be used as 
inputs or outputs; both inputs and outputs can be registered, 
latched, or combinatorial. There are two pins that can be used as 
clocks. Global preset and reset functions are also provided. 
Several of the flip-flops can be buried to provide internal state 
machines. This device is HO and HOT compatible. 

A summary of the architecture codes for these types of devices 
is shown in Table 6. 

The PAL devices best suited to state machine applications are 
listed in Table 7. 
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CODE 


MEANING 


EXAMPLE 


X 


Exclusive-OR gates 


PAl^OXIO 


XRP 


Exclusive-OR gates, prog, polarity 


AmPAL20XRP10 


RX 


Registered, exciusive-OR gates 


PAL22RX8 


S 


Sequencer 


AnnPAL23S8 


M 


Advanced nnacrocell 


AmPALC29M16 


VX 


Varied term distribution, XOR gate 


PAL32VX10 



Table 6. State Machine Architectures 



DEVICE NAME 


INPUTS 


OUTPUTS 


FLIP-FLOPS 


FLIP-FLOP 
TYPES 


PRODUCT 
TERMS/OUTPUT 


SPEED 

\ MAX 

In MHz) 


STAND 
BY 
Ice 

(mA) 


DATASHEET 
PAGE NO. 


PAL20X10A 

PAL20X8A 

PAL20X4A 


20* 
20* 
20* 


10 
10 
10 


10 
8 

4 


D,T,JK,SR 
D,T,JK,SR 
D,T,JK,SR 


2/2t 
2/2t 
2/2t 


22.2 
22.2 
22.2 


180 
180 
180 


5-113 
5-113 
5-113 


AmPAL20XRP10-20 
AmPAL20XRP10-30L 
AmPAL20XRP10-30 
AmPAL20XRP10-40L 


22* 


10 


10 


D,T,JK,SR 


2/6t 


30.3 
22.2 
22.2 

14.3 


210 
105 
180 
105 


5-286 
5-286 
5-286 
5-286 


AmPAL20XRP8-20 
AmPAL20XRP8-30L 
AmPAL20XRP8-30 
AmPAL20XRP8-40L 


22* 


10 


8 


D,T,JK,SR 


2/6, 8t 


30.3 
22.2 
22.2 
14.3 


210 
105 
180 
105 


5-286 
5-286 
5-286 
5-286 


AmPAL20XRP6-20 
AmPAL20XRP6-30L 
AmPAL20XRP6-30 
AmPAL20XRP6-40L 


22* 


10 


6 


D,T,JK,SR 


2/6, 8t 


30.3 
22.2 
22.2 
14.3 


210 
105 
180 
105 


5-286 
5-286 
5-286 
5-286 


AmPAL20XRP4-20 
AmPAL20XRP4-30L 
AmPAL20XRP4-30 
AmPAL20XRP4-40L 


22* 


10 


4 


D.TJK.SR 


2/6. 8t 


30.3 
22.2 
22.2 
14.3 


210 
105 
180 
105 


5-286 
5-286 
5-286 
5-286 


PAL22RX8A 


22* 


8 


8§ 


D,T,JK,SR 


1/8t 


28.5 


210 


5-87 


AmPAL23S8-20 
AmPAL23S8-25 


23* 


8 


14§ 


D,BO 


6-1 2§§ 


33 
28.5 


200 
200 


5-169 
5-169 


AmPALC29M16-35 
AmPALC29M16-45 


29* 


16 


16§ 


D.B,LO 


8-1 6§§ 


20 
15 


120 
120 


5-231 
5-231 


PAL32VX10A 
PAL32VX10 


32* 


10 


10§ 


D,TJK,SR, 

BO 


1/8-1 6t 


25 
22.2 


180 
180 


5-70 
5-70 



* Includes feedbacl< §§ Has varied product term distribution 

t Has an exciusive-OR gate B=fiip-fiops are or can be buried; L=latched outputs possible 

§ Some flip-flops can be bypassed 

Table 7. State Machine PAL Devices 
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Asynchronous PAL Devices 

Several PAL devices have been designed for applications that 
require asynchronous control of flip-flops. The simplest devices 
are the PAL20RA1 and the PAL1 6RA8. These devices essen- 
tially consist of several flip-flops driven by a programmable AND 
array and a fixed OR array. Thus the D-type flip-flops have 
individually programmable clock, set, and reset lines. They also 
have programmable polarity, and the flip-flops can be bypassed 
to give combinatorial outputs. 

These devices are also useful for replacing random discrete logic. 
Each flip-flop is independent; the clocks can be gated if desired, 
as can the set and reset lines. This provides for flexibility both 
when replacing random gates and when designing logic that is 
asynchronous in nature. 

The AmPALC29MA1 6 combines some of the advantages of the 
AmPALC29M1 6 with the advantages of the PAL20RA1 0. Most of 
the AmPALC29M16 macrocell is preserved, except that each 
flip-flop is given a programmable clock, set, and reset line. Each 
output also has individual three-state control. This device is 
electrically erasable, and is HC and HCT compatible. 

A summary of the architecture codes for these types of devices 
is shown in Table 8. The asynchronous devices available are 
summarized in Table 9. 



CODE 


MEANING 


EXAMPLE 


RA 
MA 


Registered, asynchronous 
Asynchronous macrocell 


PAL20RA10 
AmPALC29MA16 



Tablo 8. Asynchronous Architectures 



devices, but with reduced power consumption. The functions of 
these devices have been discussed above, along with the TTL 
devices. The particular characteristics of the CMOS versions are 
discussed below. 

Zero-Power vs Low-Power 

Two basic types of CMOS PAL devices are available: those that 
dissipate essentially no power when in a quiescent state, and 
faster devices which draw a nominal amount of current even 
when quiescent. Devices are thus classified as "zero-power" or 
"low-power". The low-power devices still require far less current 
than their bipolar equivalents. 

The basicdifference between zero-power and low-power devices 
is the wake-up circuitry used in the zero-power devices. Any time 
a signal changes, the device wakes up in order to respond to the 
signal. After the transition is complete, the device goes "back to 
sleep", as long as no other signals are changing. This wake-up 
circuitry exacts a slight speed penalty, usually about 5 ns, but in 
applications where minimal power dissipation is crucial, the 
benefits of zero-power operation greatly outweigh the extradelay. 

The PALC16R8Z family actually has a 'lurbo product term". This 
product term allows the device to operate either in zero-power or 
low-power mode. Since this choice has been provided by a 
product term, it is actually possible to have a device idle in zero- 
power mode, and then switch into low-power mode for faster 
operation. This is particularly appropriate for devices that tend to 
operate in bursts, with idle periods between bursts. 

HC vs HCT 

CMOS devices can be characterized as being HC and/or HCT 
compatible. HC-compatible devices have outputs that swing 
"rail-to-rail", or from ground to VCC. They also have an input 
threshold which is about 2.5 V. These devices are normally used 
when driving other CMOS devices. 



CMOS PAL Devices 

PAL devices have recently become available in CMOS technol- 
ogy, providing performance competitive with that of many bipolar 



DEVICE NAME 


INPUTS 


OUTPUTS 


PRODUCT 
TERMS/OUTPUT 


SPEED 
(tpD in ns) 


STANDBY 
lcc(mA) 


DATA SHEET 
PAGE NO. 


PAL16RA8 


16* 


8 


4 


30** 


170 


5-11 


PAL20RA10-20 
PAL20RA10 


20* 


10 


4 


20** 
30** 


200 
200 


5-95 
5-97 


AmPALC29MA16-35 
AmPALC29MA16-45 


29* 


16 


4-1 2tt 


35 
45 


120 
120 


5-209 
5-209 



* Includes feedback 
** With polarity fuse intact 
tt Has product term steering 



Table 9. Asynchronous PAL Devices 
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HCT-compatible devices have TTL-level output voltages; that is, 
a VOH of about 3.5 V and a VOL of about 0.5 V. The input 
threshold is usually adjusted to 1 .5 V, which isthe threshold of TTL 
devices. These devices are more suitable for a mixed TTL and 
CMOS system. 

Roughly speaking, HC and HCT devices can be interchanged. 
However, since HCT outputs do not swing rail-to-rail, if they drive 
another CMOS device, that device will draw more power. HCT 
outputs are faster, have more drive, and are more latch-up 
immune. The input threshold is less critical, and is important only 
in very touchy situations. 

Note that some devices are both HC and HCT compatible. This 
meansthat they have aquick, high drive output rise to aTTL VOH, 
and then a slower rise to VCC, ultimately providing rail-to-rail 
levels. This is illustrated in Table 10. 



FAMILY 


INPUT 
THRESHOLD (V) 


Vo. 
(V) 


(V) 


HC 

HCT 

HC/HCT 


2.5 

1.5 
1.5 




0.5 



Vcc 

3.5 

3.5 -> Vcc 



Table 10. CMOS/TTL Compatibility 



UV Erasable vs Electrically Erasable 

CMOS PAL devices come in one of two basic technologies: 

• UV erasable 

• Electrically erasable 

UV-erasable devices use EPROM technology in implementing 
the array. They can be erased by exposure to ultraviolet light ifthe 
device is packaged in a windowed package. These packages are 
more expensive, so the devices are also available in lower-cost 
plastic packages. UV-erasable devices in plastic packages are 
not erasable, and are thus referred to as "One-Time Program- 
mable", or OTP devices. 

Electrically erasable devices can be erased by providing the 
appropriate signals to the device. No window is required in the 
package, and the erase time is much faster than the erase time 
of UV-erasable devices. Erasure is implemented by the device 
programmer as a portion of the overall programming algorithm. 

Summary of CMOS PAL Devices 

The PAL devices available in CMOS have been described along 
with their bipolar counterparts in the preceding pages. They are 
summarized in Table 1 1 , along with their power, compatibility, and 
erasability characteristics. 



DEVICE NAME 


INPUTS 


OUTPUTS 


SPEED 
(tpB In ns) 


STANDBY 
U,(mA) 


COMPATIBILITY 


ERASABILITY 


DATA SHEET 
PAGE NO. 


PALC16L8Z-25 
PALC16R8Z-25 
PALC16R6Z-25 
PALC16R4Z-25 


16* 
16* 
16* 
16* 


8 
8 
8 
8 


25 
25 
25 
25 


0.1/45" 
0.1/45" 
0.1/45** 
0.1/45** 


HC/HCT 
HC/HCT 
HC/HCT 
HC/HCT 


UV 
UV 
UV 
UV 


5-50 
5-50 
5-50 
5-50 


PALC16L8Q-25 
PALC16R8Q-25 
PALC16R6Q-25 
PALC16R4Q-25 


16* 
16* 
16* 
16* 


8 
8 
8 
8 


25 
25 
25 
25 


45 
45 
45 
45 


HCT 
HCT 
HCT 
HCT 


UV 
UV 
UV 
UV 


5-33 
5-33 
5-33 
5-33 


PALC20L8Z-35 
PALC20L8Z-45 


20* 
20* 


8 
8 


35 
45 


0.1 
0.1 


HC/HCT 
HC/HCT 


UV 
UV 


5-133 
5-133 


PALC20R8Z-35 
PALC20R8Z-45 


20* 
20* 


8 
8 


35 
45 


0.1 
0.1 


HC/HCT 
HC/HCT 


UV 
UV 


5-133 
5-133 


PALC20R6Z-35 
PALC20R6Z-45 


20* 
20* 


8 
8 


35 
45 


0.1 
0.1 


HC/HCT 
HC/HCT 


UV 
UV 


5-133 
5-133 


PALC20R4Z-35 
PALC20R4Z-45 


20* 
20* 


8 
8 


35 
45 


0.1 
0.1 


HC/HCT 
HC/HCT 


UV 
UV 


5-133 
5-133 


PALC22V10H-25 
PALC22V10H-35 


22* 
22* 


10 
10 


25 
35 


90 
90 


HCT 
HCT 


UV 
UV 


5-79 
5-79 


AmPALC29M16-35 
AmPALC29M16-45 


29* 
29* 


16 
16 


35 
45 


120 
120 


HC/HCT 
HC/HCT 


EE 
EE 


5-231 
5-231 


AmPALC29MA16-35 
AmPALC29MA16-45 


29* 
29* 


16 
16 


35 
45 


120 
120 


HC/HCT 
HC/HCT 


EE 
EE 


5-209 
5-209 



* Includes feedback 
** With turbo term 



Table 11. Summary of CMOS PAL Devices 
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ECL PAL Devices 



Programmable Sequencers 



For a long period of time, PLDs were available only for TTL- 
compatible systems. More recently, however, PAL devices have 
been introduced which address the users of ECL logic. 

Two basic families of ECL logic are addressed by the ECL PAL 
devices. These families differ in their voltage ranges and the 
compensation built into the circuitry (Table 1 2). These differences 
also affect noise margins and thresholds. Refer to page 3-1 50 for 
more information on ECL design. 

The ECL PAL devices presently have a 6-ns propagation delay; 
the differences between the devices can be found in the architec- 
tures. The PAL10H20P8 and PAL10H20G8 are simpler devices 
which are compatible with other 10KH logic; the former is a 
combinatorial device with programmable polarity, the latter has 
output latches. Both devices have product term steering for 
added flexibility in allocating logic resources. These features are 
described in more detail on page 5-382. 

The PAL10H20EV/EG8 and PAL10020EV/EG8 are 10KH and 
1 0OK compatible options of the same device, respectively. These 
devices have more product terms than the first two devices, and 
have outputs that can be programmed as combinatorial or regis- 
tered/latched. These devices also have global reset and preset 
controls. 

A summary of the architecture codes for these types of devices 
is shown in Table 13. The ECL PAL devices and their basic 
characteristics are summarized in Tables 14 and 15. 



FAMILY 


v.. 


COMPENSATION 


10KH 
100K 


-5.2V ±5% 
-4.5V ± 0.3V 


Voltage 
Voltage, temperature 



Many PAL devices can be used to design very fast, efficient 
sequencers. Optimal among these devices are the PAL32VX1 
and the AmPAL23S8. However, there are additional devices with 
entire architectures dedicated to the task of sequencing. 

Since these devices are primarily intended for use as state 
machines, the important characteristics to be evaluated are: 

• The number of inputs 

• The number of outputs 

• The number of states that can be implemented 

• The number of branches available from any state 

• The speed: maximum clock frequency 

• Power consumption 

The programmable sequencers available are shown in Table 1 6. 

The PMS14R21 (also known as the PROSE™ device) actually 
consists of a PAL device for branching, followed by a PROM to 
store the states. This device also has the Diagnostics-On-Chip™ 
(DOC™) testability circuitry, allowing the device to be inserted 
into a serial scan path. This is described more thoroughly on page 
3-123 of the testability section. 

The PLS105, PLS167, and PLS168 are based on the standard 
PLS architecture, where both the AND array and the OR array are 
programmable. S-R flip-flops are used to minimize the need for 
specifying 'HOLD' conditions. There is also a "complement 
array", which makes it easier to define default state transitions. 



CODE 


MEANING 


EXAMPLE 


P 
G 
EG 
EV 


Combinatorial, prog, polarity 
Latched outputs 
Latched or combinatorial 
Registered or combinatorial 


PAL10H20P8 
PAL10H20G8 
PALI OH/1 0020EG8 
PAL10H/10020EV8 



Table 12. 10KH ECL vs 100K ECL 



Table 13. ECL PAL Dcvico Architectures 



DEVICE NAME 


INPUTS 


OUTPUTS 


FLIP-FLOPS 


PRODUCT 
TERMS/OUTPUT 


SPEED 
(tpoO'-'MJ 


I^JmA) 


DATA SHEET 
PAGE NO. 


PAL10H20P8 
PAL10H20G8 
PAL10H20EV/EG8 


20* 
20* 
20* 


8 
8 
8 




8§ 
8§ 


0-8tt 
0-8tt 
8-1 2§§ 


6 ns 

6 ns 

125 MHz 


210 
225 
220 


5-385 
5-382 
5-381 



• Includes feedback 
tt Has product term steering 



§ Flip-flops can be bypassed 
5§ Has varied product term distribution 

Table 14. lOKH-Compatible PAL Devices 



DEVICE NAME 


INPUTS 


OUTPUTS 


FLIP-FLOPS 


PRODUCT 
TERMS/OUTPUT 


SPEED 


I^C^A) 


DATA SHEET 
PAGE NO. 


PAL10020EV/EG8 


20* 


8 


8§ 


8-1 2§§ 


125 MHz 


220 


5-381 


* Includes feedback 
§§ Has varied product term distribution 


§ Flip-flops can be bypassed 
Table 15. lOOK-Compatlble PAL Devices 
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The PMS14R21. PLS105. PLS167, and PLS168 are all sup- 
ported by PALASM software, using aconvenient entry formatthat 
has been optimized for state machines. This essentially makes 
it possible to transfer a state diagram directly to a design file 
withoutthe need for manual logictransformations or calculations. 

The Am29PL141 Fuse Programmable Controller (FPC) is an in- 
struction-based sequencer. This is especially convenient for de- 
signers who are more comfortable using instruction-based de- 
vices. The design is implemented more as microprocessor 
program might be written. Indeed, the architecture itself, which is 
described in more detail on page 5-339, resembles that of a 
simple processor. This device has its own assembler which is 
used to reduce the sequencer "program" into a file that can be 
used to program the device. 

The Am2971 is a Progammable Event Generator. It is a versatile 
timing device that can be used as a digital substitute for analog 
delay lines or as a general-purpose user-programmable timing/ 
waveform generator. 

LCA Devices 

Large-scale designs are now possible with the introduction of the 
Logic Cell Array (LCA) devices. These devices can be thought of 



as programmable gate arrays, and provide an efficient, cost- 
effective means of implementing dense designs within a single 
PLD. 

Unlike other PLDs, which are non-volatile, LCAdevices make use 
of static RAM technology. This makes them easy to develop, easy 
to test and debug, and reprogrammable. 

There is an entire family of support software and hardware 
packages available to aid in the design of LCA applications. 
These are described in more detail on page 3-17. 

The architecture of an LCA device resembles that of a gate array 
more closely than a traditional PLD. It has an array of Configur- 
able Logic Blocks (CLBs) that can be connected via a variety of 
interconnect resources. There are also Input/Output Blocks 
(lOBs) for transferring the signals on and off the chip. 

The main features that distinguish one LCA device from another 
are: 

• The number of I/O pins 

• The number of Configurable Logic Blocks 

• The speed: internal maximum toggle frequency 

• Power consumption 

The available devices are summarized in Table 17. 



DEVICE NAME 


INPUTS 


OUTPUTS 


STATES 
(MAX) 


BRANCHES 
PER STATE 


SPEED 
(f„„ In MHz) 


•cc 

(mA) 


DATA SHEET 
PAGE NO. 


PMS14R21A 
PMS14R21 


8 


8 


128 


4 


30 
25 


210 
210 


5-315 
5-315 


PLS 105-37 
PLS1 67-33 
PLS1 68-33 


16 
14 
12 


8 
6 
8 


<64* 
<128* 
<1028* 


: 


37 
33 
33 


200 
200 
200 


5-331 
5-331 
5-331 


Am29PL141 


6 


16 


64 


2 


20 


450 


5-339 


Am2971 


6 


14 


N/A 


N/A 


85 


310 


5-365 



Depends highly on state diagram topology. May be limited by number of product terms or number of flip-flops. 

Table 16. Programmable Sequencers 



DEVICE NAME 


I/O PINS 


CLBs 


SPEED 

(INTERNAL TOGGLE 

f„„ In MHz) 


STANDBY 
l„(mA) 


DATA SHEET 
PAGE NO. 


M2064-70 
M2064-50 
M2064-33 


58 


64 


70 
50 
33 


5 
5 
5 


5-483 
5-483 
5-483 


M2018-50 
M2018-33 


74 


100 


50 
33 


5 
5 


5-483 
5-483 



Table 17. LCA Devices 
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Installing PALASM Software 



This section is intended as a beginner's introduction to PLD 
design, although experienced users may find it agood review. We 
will take a step-by-step approach through two very simple de- 
signs to demonstrate the basic PLD design implementation 
process. 

By "beginner", we mean a logic designer who is just beginning to 
use programmable logic. You may have a lot of experience with 
discrete digital logic, or you may have just graduated from 
college. We assume a basic understanding of digital logic, 
although there is a logic reference on page 6-1 should you 
need a refresher. Some computer experience is helpful, but not 
essential. 

Through this effort, you will be introduced to PALASM software 
and to the concept of device programming. Because of the 
simplicity of the designs, we will be using PAL devices, but the 
implementation concepts will be applicable to all of Monolithic 
Memories' and Advanced Micro Devices' PLDs. 

Only enough detail of the software will be presented to allow you 
to implement the examples. This will also give you enough 
information to understand most of the other design examples in 
this handbook. More software details can be found in the actual 
software documentation in section 4. 

To keep the discussion simple, we will make some assumptions 
about the system being used to process your designs. We will 
assume an IBM PC/XT™ orcompatible system with one hard disk 
and one floppy drive, running PC-DOS™ or MS-DOS™ version 
2.1 or higher. The installation procedure for the software will be 
described for this setup. The procedures for other setups and 
machines are not radically different; details can be found in the 
PALASM software documentation, section 4. 

We will also take no significant shortcuts for these examples, 
even though there may be times when wo could. In this way, you 
can gain a better understanding of exactly what is happening as 
you implement your design. 

We will talk about device programming, describing all of the steps 
that are necessary to program a PLD. However, due to the wide 
variety of prog rammers available, we will not get down to the level 
of detail that tells you exactly which buttons to push. Although we 
will get as close as we can, we must defer the details to your 
programmer manual. 

As we work the design examples, we will, of course, have to 
choose devices into which to put the designs. However, we will 
not dwell here on the parameters that go into choosing a device. 
This is covered more thoroughly in the rest of this section. 



Before we actually start implementing a design, we must install 
PALASM software. If you already have the software installed, you 
may disregard this discussion. The procedure described here will 
allow you to install the software package onto an IBM PC/XT or 
compatible with a hard disk and at least one floppy drive. 

After you have turned on the computer and allowed it to boot up, 
you can install the software by placing disk #1 into drive A, and 
typing: 

A:PAL2INST<CR> 

If you are new to the PC, <CR> is the 'RETURN' key on the right 
side of the keyboard. 

The program will ask you which disk you wish to install the 
programs on. Enter the disk name, or just hit <CR> if installing 
onto disk C. The installation program creates directories for all of 
the programs. If you presently have an older version of PALASM 
software installed in a similar directory structure, the installation 
program will warn you that the contents of the directories will be 
changed. Otherwise, it will start Installing. 

Note that the installation procedure may modify two files which 
are used when the computer boots up: AUTOEXEC.BAT and 
CONFIG.SYS. These changes should not affect any other 
programs you run. 

The installation program will then give you a choice of programs 
to install; hit 

1<CR> 

to install the software. The installation program will ask you to 
insert various disks as it installs the programs. The disks are 
labelled, so this should be a simple procedure. After the software 
has been installed, reboot the system by holding down 

<CTRLXALTXDEL> 

at the same time. This only needs to be done after installation. In 
the future when you turn the computer on, the computer will boot 
automatically. 

You can now call up the menu by typing 

PALASM<CR> 

Now you need to install the editor or word processing program 
you wish to use. You may use any editor you wish, as long as it 
can generate a clean ASCII text file. In addition, you need to 
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install a communication program; any one will do. A simple one 
called PC2 is provided with the PALASM software. You can install 
these programs by hitting 

<F6> 

When the setup screen comes up, hit the down-arrow key until it 
gets to the line that says "Editor". Enter the directory path and 
program name for the editor you wish to use. For example, if you 
use WordStar™, and you have the program located in the 
directory called "WS" on disk C, then enter 

C:\ws\ws 

Add enough spaces to delete whatever was there before. 

This takes you into the next field. Here you need to enter the 
filetype of your editor, which will be either .COM or .EXE. 

Once this is complete, for the WordStar example, the line should 
now look like: 

C:\ws\ws .com 

Hit <CR> again to get to the next field. Here you enter the name 
of your communication software. If you wish to use PC2, then 
type 

\PALASM\SUPL\PC2 

(adding spaces to delete what was there before.) 

<CR> 
.EXE 

If you are not using PC2, then replace "PC2" with the name of your 
program. This procedure works regardless of the editor or 
communication software you wish to use. When you are 
finished, hit 

<ESO 

to get back to the menu. 
You are now ready to start. 



Constructing a Combinatorial Design 
-—Basic Gates 

The first example we will try is a very simple combinatorial circuit 
consisting of all of the basic logicgates, as shown in Figure 1 . This 
will be helpful forthose designs where you are integrating random 
logic into a PAL device to save space and money. 

As can be seen from the fig u re, there will be six separate functions 
involving atotal of twelve inputs, it is important to bear in mind that 
programmable logic provides a convenient means of implement- 
ing designs. With a real design, some work would be required 
before this point to conceptualize the design, but due to the 
simplicity of these circuits, we are already in a position to start the 
implementation. 




Figure 1. The Basic Logic Gates 

Building The Equations 

We will start by generating Boolean equations. The first function 
to be generated is an inverter. This is specified according to 
Figure 1 as: 

B = /A 

Here the "equals" sign ('=') is used to assign a function to output 
B. The slash ('/') is used to indicate negation, since it is impossible 
to put a bar over a letter in an ASCII file. Thus this equation may 
be read: 

B is TRUE if NOT A is TRUE 

The next function is a simple AND gate. As shown in Figure 1 , we 
can write 

E = C*D 

Here we use the "equals" sign again, but this time we have 
introduced the asterisk ('*') to indicate the AND operation. This 
equation may be read: 

E is TRUE if C AND D are TRUE 

The third function is an OR gate, which may be written: 

H = F + G 

The "plus" sign ('+') is used to specify the OR operation here. 
Because of the sum-of-products nature of logic as implemented 
in PLDs, it is often easy to place product terms on separate lines, 
which improves the readability. We may rewrite this equation as: 

H = F 
+ G 

This equation may be read: 

H is TRUE if F OR G is TRUE 
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For the moment, we will assume that we have active-HIGH 
outputs on our device. The functions we have generated so far 
have essentially been active-HIGH functions. At times we wish to 
generate active-LOW functions; the next two functions are 
active-LOW functions that we wish to implement in an 
active-HIGH device. 

When we talk in terms of an active-HIGH or an active-LOW 
device, the real question is whetherthere is an extra inverter at the 
output. An active-HIGH device has an AND-OR structure; an 
active-LOW device has an AND-OR-INVERT structure which 
inverts the function at the output (see Figure 2). 




b. AND-OR-INVERT Structure 
403 02 

Figure 2. Active HIGH vs. Active LOW 

NAND and NOR gates could be generated very simply in an 
active-LOWdevice, because we would just have togenerate AND 
and OR functions, and let the output inverter generate their 
complements. However, given that we wish to implement these 
functions in an active-HIGH device, we must invoke DeMorgan's 
theorem, as shown in Figure 3. 

/(X*Y) = /x + /Y 

/(X + Y) = /X*/Y 

Figures. DeiVIorgan's Theorem 

PALASN/I software has the capability of applying DeMorgan's 
theorem to functions, so we may generate our NAND function by 
writing: 

L = /(I*J*K) 

or, if preferred, 

L = /I 
+ /J 
+ /K 

Likewise the NOR function may be specified as 

o = /(M 

+ N) 

or 

O = /M*/N 



Finally, an exclusive-OR (XOR) gate may be specified either as 

R = P :-!-: Q 

where :+: represents the XOR operation, or more explicitly as 

R = P*/Q 
+ /P*Q 

We have now specified all of the functions in terms of their 
Boolean equations. The equations are summarized in Figure 4. 

B = /A 

E = C*D 

H = F 
+ G 

L = /I 
+ /J 
+ /K 

= /M*/N 

R = P*/Q 
+ /P*Q 

Figure 4. Basic Gates Equations 

Understanding the Logic Diagram 

We will use a PAL12H6 for this function, since it has all of the 
resources needed to implement all of these functions. A portion 
of the logic diagram for this device is shown in Figure 5. The full 
logic diagram for this device can be found on page 5-62. 

The logic diagram shows all of the logic resources available in a 
particular device. In each device, inputs are provided in true and 
complementversions, as shown in Figures. These drive what are 
often called "input lines", which are the vertical lines in the logic 
diagram. These input lines can then be connected to product 
terms. The name "product term" is really just a fancy name for an 
AND gate. However, PLDs provide very wide gates, which can be 
cumbersome to draw. To save space, the product terms are 
drawn as horizontal lines with a small AND gate symbol at one end 
to indicate the function being performed. 

So we see that in the PAL12H6, there are twelve inputs and six 
outputs. Four of the outputs, on pins 14 through 17, have two 
product terms connected to the OR gates. The outputs on pins 
1 3 and 1 8 have four product terms connected to the OR gates. 
Thus pins 13 and 18 can be used to implement more complex 
functions than the other four outputs. 

Although you really do not need to be concerned with the actual 
implementation ofthesefunctionsinsidethePALdevice, you may 
be curious. Figure 6 shows howthe inverter and the AND gate are 
implemented in the PALI 2H6. An 'X' indicates a connection. A 
product term that is not used is indicated by an 'X' in the small 
AND gate. 
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Figure 6. Implementation of NOT, AND Gates 
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Building the Design File 

Once the design has been conceptualized, the design file must be 
generated. This can be done with any editor program. The only 
requirement is that the editor produce a clean ASCII file. Most 
word processing programs have this capability, so you can use 
your favorite program. Any hidden control characters for format- 
ting may cause problems when assembling; thus only "non- 
document" or "clean" modes should be used. The filename is 
usually given the extension '.PDS' (for PAL device Design Speci- 
fication), although this is not required. 

We now know exactly what our functions are going to be. We 
have twelve inputs, six outputs, and the NAND function requires 
three product terms. Note that if we had specified 

L - /(I*J*K) 

instead of 

L - /I 
+ /J 
+ /K 

for the NAND gate, it would not be as obvious how many product 
terms would be needed. 

We are now in a position to create the design file. First you need 
to enter the PALASM menu. This menu will let you perform all of 
the tasks you need, including editing your design file. Call up the 
menu by typing 

PALASM <CR> 



The first Step is to provide some documentation information. This 
is done with six keywords: 

TITLE (The title of your design goes here) 

PATTERN (The pattern name or number goes here) 

REVISION (The revision number or level goes here) 

AUTHOR (Your name goes here) 

COMPANY (Your company name goes here) 

DATE (The date of creation or modification goes here) 

This is followed by the CHIP declaration, which tells the assem- 
bler which device is being used and what the pin names are going 
to be. We do this by typing: 

CHIP (any name goes here) (the device type goes here) 
(the pin list goes here) 

Thus, we can create the declaration section for our design as 
shown in Figure 7. 



TITLE 




Basic gates 


PATTERN 




POOOO 


REVISION 




A 


AUTHOR 




Stateyour Namehere 


COMPANY 




Name of your Co., Inc 


DATE 




7/22/87 


CHIP GATES 


PAL12H6 


A C D F G 


I 


J K M GND 


N P B E H 


R 


L Q VCC 



Figure 7. The Dociaration Section 



When prompted, hit any key to bring the menu onto the screen. 

Enter the file name of the design file you wish to create in the field 
called "Input PDS file". The cursor should be there when the 
menu appears on the screen. Then hit 

<CR> 

This will take you to the filetype field, which should already say 
".PDS". We recommend that you use ".PDS"for all of your design 
files. Hit 



again. At this time, the menu will say that the file cannot be 
opened; this is just because you have not actually created the file 
yet, so there is no need to be concerned. You can now create your 
design file by hitting 

<F3> 

This will call up your editor, and you can begin building the file. 

We start with what is called the DECLARATION SECTION. This 
section allows you to document your design, and also provides 
some definitions for the assembler. 



Note that the pin list has all pins in order, from pin 1 to pin 20, 
including VCC and GND. You can see that the inputs were 
assigned to pins 1-12 and 19, whereas the outputs have been 
assigned to pins 13-18. Of course these assignments must be 
made knowing which pins are inputs and outputs, as per the logic 
diagram. Knowing that the NAND gate requires three product 
terms, we assigned this function to pin 1 8, since it can provide up 
to four product terms. 

We can make the pin list easier to read by adding the pin numbers 
as comments. Anything following a semicolon (;) is treated as a 
comment, and ignored by the processor. This lielps make the 
design file more readable. The pin list with comments is shown 
in Figure 8. Notice how much easier it is to read. 

CHIP GATES PAL12H6 

;PINS 123456789 10 
ACDFGI JKMGND 

;PINS 11 12 13 14 15 16 17 18 19 20 
NPBEHROLQ VCC 

Figure 8. Pin List with Comments 
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Next we enter the equations, 
keyword 

EQUATIONS 



This section begins with the 



after which all of the equations are entered, exactly as shown in 
Figure 4. Note that the equations may be defined in any order, not 
necessarily in the order presented in the pin list. 

The fact that this is an active-HIGH device does have some 
bearing on the way the file is built. This is discussed at length on 
page 6-19 and in the software documentation; we will need to 
address the issue again below, when we do an active-LOW 
design. For now, suffice it to say that the design as specified here 
will work correctly for an active-HIGH device. 

Finally, add comments to the equations to document the design. 
Liberal commenting is encouraged to make it easier for you and 
others to understand your design in the future. 

The equation section, with comments, is shown in Figure 9. 



EQUATIONS 




B = /A 


; inverter 


E = C*D 


;AND gate 


H ° F 


;0R gate 


+ G 




L = /I 


;NAND gat< 


+ /J 


; DeMorgai 


+ /K 





/M*/N 



R = P*/Q 
+ /P*Q 



;NOR gate after applying 
; DeMorgan's theorem 

;XOR gate, expanded 

Figure 9. The Equation Section 



When you have completed the file, save the file and get out of the 
editor. 

Generating a JEDEC File 

Once the design file has been entered, you can assemble the 
design to get a JEDEC file. We have two purposes here: to make 
sure there are no basic mistakes in the file, and to generate a 
JEDEC file for programming. 

To start processing, hit 

<F5> 



This gives you a choice of processing options on the right of the 
screen. To run all of the processing steps automatically, hit 



The first program will now run, as evidenced by the file scrolling 
up the bottom of the screen. If there are any errors, they will 
appear on the screen and in an error file. These errors will be 
syntactical in nature; typos, misspelled words, missing declara- 
tions, malformed expressions. 

Once the first program has been run with no errors, the equations 
are minimized, it is a good practice to run the expander and 
minimizer for all designs. Minimization may allow you to fit a 
design into a smaller device, it also improves the testability of 
your design. More complicated designs may have to be ex- 
panded and minimized. 

Afterthe minimization has been performed, the JEDEC file will be 
generated, if there are any reported errors, it will likely be 
because you requested some function that the part you chose 
could not provide. Some examples are: 

• Active-HIGH equations in an active-LOW device 

• Too many product terms 

• Registered equations in a device that has no registers 

Since you have used the "autorun" feature, the simulation pro- 
gram will automatically be run. Since we have not specified the 
simulation yet, this will generate an error, ignore the error 
message for now. 

When processing is complete, hit 

<ESO 

to get back to the main menu. 

If there were any errors during processing, you can see what they 
were by viewing the run-time log. This is done by hitting 

<F7> 

to get a choice of things to view. To view the run-time log, hit 



You can scroll down until you find any errors and make a note of 
what needs correcting. Once you have found the error(s), you can 
get back to the menu by hitting 

<ESCXESC> 

Any corrections should be made by editing the design file. After 
making any changes, the design needs to reprocessed. 



2-6 



Monolithic 



m 



Memories 



Beginner's Guide 



The JEDEC file will have the same file name as your design file, 
except that it will have the extension '.JED'. The program also 
creates an 'xplot' file (with file extension '.XPT') which relates the 
final design implementation to the logic diagram of the PLD being 
used. The xplot file is usually not needed, but it confirms the 
physical implementation of the gates. Figure 1 shows a portion 
of the xplot for this design; the inverter and the AND gate are 
shown. You can see how this relates to the logic diagram by 
referring back to Figure 6. 

You may visw the complete xplot file by hitting 

<F7XPgDnXl> 

To return to the main menu after inspecting the xplot file, hit 
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Figure 10. A Portion of an Xplot 
Simulating the Gates 

After you have verified that your design file is correct, it is time to 
verify that the design itself is correct. This is done by simulating 
the design. Simulation provides a way for you to see whetheryour 
design is working as you expect it to. You provide a series of 
commands, or events, which are then simulated by the software. 
If requested, the software can tell you if the simulation matches 
what you expect, and, if not, where the problems are. 

The simulation section is the last part of the design file. It is not 
required, but is invariably helpful both in debugging the design, 
and in generating what can eventually be used as a portion of a 
test vector sequence, it is introduced by the keyword: 

SIMULATION 

The events to be simulated are specified by placing logic levels 
Dn pins of interest. Any pins not specifically mentioned will either 



maintain their present level, or, if no level was ever defined, 
remain undefined. The SETF command is used to set levels. 

We can start by simulating the inverter. To prove that the inverter 
works as expected, first we wish to set input A HIGH to verify that 
the output goes LOW. This is done with the statement: 

SETF A 

Since all we did here was set A HIGH, all other inputs are still 
undefined. However, this is enough to determine the output for 
signal B. 

The simulator will calculate the value for B and put it in the output 
file (to be discussed later), but it is helpful to write what you expect 
B to be. if the simulator calculates a result that is different from 
what you expect, it will alert you, and place a marker in the output 
file to tell you where the problem is. You can do this with the 
CHECK statement. Since A is HIGH here, we expect B to be 
LOW. To verify that B is indeed LOW, use the statement: 

CHECK /B 

Note that CHECK statements are not required for simulation. 
However, without them, you must examine the simulation output 
to see if the design functioned correctly. 

Wealsoneedto verify that when A goes LOW, B goes HIGH, as 
expected. This can be done with the pair of statements: 

SETF /A 
CHECK B 

We have now fully exercised the inverter. However, the standard 
output file forthe simulator, called the history file, tracks all signals 
on the device, in increasing pin order. From Figure 8, this would 
be in the order 

ACDFGI JKMNPBEHROLQ 

instead, we may wish to place related signals near each other. 
We can do this with the TRACE_ON and TRACE_OFF com- 
mands. These commands cause a second output file to be 
generated, which will only show signals and events as deter- 
mined by the TRACE_ON command. 

in our case, we would like to have the output of the gate appear 
right after its inputs, so that the workings of the gate can be easily 
examined. This can be done with the command: 

TRACE_ON ABCDEFGHIJKLMNOPQR 

At the end of the simulation, the TRACE_OFF command is used 
to end the special trace. 

Note that both a history file and a trace file will now be generated. 
The history file will have the simulation results in increasing pin 
order; the trace file will have the signals in the order requested 
with the TRACE ON command. 
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Now we can exercise all of the other gates in the circuit. The full 
simulation section is shown in Figure 11. 



TRACE ONABCDEFGHIJKLMNOPQR 



;look at the inverter 
SETF A 
CHECK /B 
SETF /A 
CHECK B 



;look at AND gate 

SETF /C /D 

CHECK /E 

SETF D 

CHECK /E 

SETF C /D 

CHECK /E 

SETF D 

CHECK E 



;look at OR gate 

SETF /F /G 

CHECK /H 

SETF G 

CHECK H 

SETF F 

CHECK H 

SETF /G 

CHECK H 



;look at NAND gate 

SETF /I /J /K 

CHECK L 

SETF K 

CHECK L 

SETF J /K 

CHECK L 

SETF K 

CHECK L 

SETF I /J /K 

CHECK L 

SETF K 

CHECK L 

SETF J /K 

CHECK L 

SETF K 

CHECK /L 



;look at NOR gate 

SETF /M /N 

CHECK 

SETF N 

CHECK /O 

SETF M 

CHECK /O 

SETF /N 

CHECK /O 



;look at XOR gate 

SETF /P /Q 

CHECK /R 

SETF Q 

CHECK R 

SETF P /Q 

CHECK R 

SETF Q 

CHECK /R 



;set A HI 

; verify that B is LO 

;set A LO 

;verify that B is HI 

;end of inverter trace 



set C, D LO 

verify E LO 

C stays LO, D goes HI 

E should stay LO 

set C HI, D LO 

E should stay LO 

C stays HI, D goes HI 

E should now be HI 

end of AND gate trace 



;set F, G LO 

; verify H LO 

;F stays LO, G goes HI 

;H should go HI 

;G stays HI, F goes HI 

;H should stay HI 

;F stays HI, G goes LO 

;H should stay HI 

;end of OR gate trace 



IJK = 000 

verify L HI 

IJK = 001 

L should still be HI 

IJK = 010 

L should still be HI 

IJK = Oil 

L should still be HI 

IJK = 100 

L should still be HI 

IJK = 101 

L should still be HI 

IJK = 110 

L should still be HI 

IJK = 111 

L should go LO 

end of NAND gate trace 



set M, N LO 

verify HI 

M stays LO, N goes HI 

should go LO 

M, N now both HI 

should stay LO 

M stays HI, N goes LO 

should stay LO 

end of NOR gate trace 



set P, Q LO 

verify R LO 

P stays LO, Q goes HI 

R should go HI 

now P HI, Q LO 

R should stay HI 

both inputs HI 

R should go LO 



This completes the simulation. The entire design file is shown in 
Figure 21 at the end of this section. You can now run the 
simulation to make sure that your design is correct. 

Running tlie Simulator 

After adding the simulation, the file needs to be reprocessed. Do 
this again by hitting 

<F5> 

and then hitting 

6 

This completely reprocesses the file and runs the simulation. 

Each of the events specified in the simulation section of the 
design file is executed. The resulting outputs are calculated, and, 
if CHECK statements are used, the calculated outputs are com- 
pared to the expected outputs. If the two values do not match, 
then you are expecting an output that the circuit is not generating. 
This means either that the expected values are wrong, or that 
there is a problem with the circuit itself. In either case, the design 
file must be modified to fix the problem, and then the file should 
be completely reprocessed. 

The main output of the simulator is the history file. In addition, a 
trace file will be generated if the TRACE_ON statement was used, 
as in the example above. These files will have the same name as 
your original design, except that the extension for the history file 
is '.HST and that for the trace file is '.TRF'. 

You may look at the history and trace files with your editor, or print 
them out for closer analysis. Note that the history file contains the 
results of every signal and every event in increasing pin order, 
whereas the trace file contains signals in the order specifically re- 
quested by the TRACE_ON commands. 

It is usually easier to look at the simulation output as waveforms. 
To generate the waveforms, hit 

<F7> 

To see the waveforms for the trace file, hit 



Once the waveforms are on the screen, you may scroll around 
using the arrow keys. The vertical bar cursor will help you line up 
events. 

After you are through examining the file, hit 

<ESO 

to get out of the waveform generator, and then, 

<ESO 

to get back to the main menu. 



TRACE OFF 



;end of trace 

Figure 11. The Simulation Section 
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This makes it much easier to debug your design. You can iool< at 
the entire simulation in the history file, or break it up into manage- 
able bits in the trace file. 

The simulator also converts the simulation results into test 
vectors, and appends the vectors to the JEDEC file. This gives 
you two JEDECf lies; one with vectors and one without. The new 
file has the same name as your original design file, but with the 
extension '.JDC. This file can be used with programmers that 
provide functional tests. 

Constructing a Registered Design — 
Basic Flip-Flops 

Next we will do a very simple registered design: we will be 
designing all of the basic flip-flop types (Figure 12). We will 
conceptualize the design by reviewing briefly the behavior of the 
D-type flip-flop. We will then present the results for T, J-K, and S- 
R flip-flops. More detail on these flip-flop types can be found in 
the logic reference, on page 6-8. 

Rememberthat the devices we will be using only have D-type flip- 
flops. Thus we will be emulating the other flip-flops with D-type 
flip-flops. 



CLK 
D 



dCq 
> pQ 



T C Q 

> pQ 



'-> 



J C Q 



>— > 



8 C Q 



Building the D-TVpe Flip-Flop 
Equations 

A D-type flip-flop merely presents the input data at the output after 
being clocked. Its basic transfer function can be expressed as 

DT := D 

where we have used pins DT ("D True") and D as shown in 
Figure 12. 

Note the use of ':=' here instead of '='. This indicates that the 
output is registered for this equation. The difference is illustrated 
in Figure 13. 

















D Q 

> 






CLK 

























Figure 12. Basic Flip-Flops 



Figure 13. Registered vs. Combinatorial Equations 

We can also generate the complement signal (named DC) with 
the statement 

DC := /D 

As per Figure 12, we want to add synchronous preset and clear 
functions to the flip-flops. This can be done with two input pins, 
called PR and CLR. To add these functions to the true flip-flop 
signal, we add /CLR to every product term and add one product 
term consisting only of PR. Likewise, for the complement 
functions, we add /PR to every product term, and add one product 
term consisting only of CLR. This is shown in Figure 14. 

DT :- D*/CLR 
+ PR 

DC := /D*/PR 
-I- CLR 

Figure 14. D-Type Flip-Flops with Clear and Preset 
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In this way, when clearing the flip-flops, the adive-HIGH flip-flops 
have no product terms true, and go LOW; the active-LOW flip- 
flops have the last product term true, and will therefore go HIGH. 
The reverse will occur for the preset function. 

There is still one hole in this design: what happens if we preset 
and clear at the same time? As it is right now, both outputs will go 
HIGH. This makes no sense since one signal is supposed to be 
the inverse of the other. To rectify this, we can give the clear 
function priority over the preset function. We can do this by 
placing /CLR on every product term for the true flip-flop signal. 
The results are shown in Figure 15. 



DT :- D*/CLR 
+ PR* /CLR 



DC :- /D*/PR 
+ CLR 



Figure 15. D-iype Fllp-Flopswith Clear Priority 

Building the Remaining Equations 

The same basic procedure can be applied to all of the other flip- 
flops. The equations are shown in Figure 16. 



EQUATIONS 

; emulating all flip-flops with D-type flip-flops 



DT 


:- 


D*/CLR 




+ 


PR* /CLR 


DC 


:. 


/D*/PR 




+ 


CLR 


TT 


:" 


T*/TT*/CLR 




+ 


/T*TT*/CLR 




+ 


PR* /CLR 


TC 


._ 


T*/TC*/PR 




+ 


/T*TC*/PR 




+ 


CLR 


JKT 


:=, 


J*/JKT*/CLR 




+ 


/K*JKT*/CLR 




+ 


PR* /CLR 


JKC 


i" 


/J* /JKC* /PR 




+ 


K* /JKC* /PR 




+ 


CLR 


SRT 


._ 


S*/CLR 




+ 


/R*SRT*/CLR 




+ 


PR* /CLR 


SRC 


:» 


R*/PR 




+ 


/S*SRC*/PR 




+ 


CLR 



; output is D if not clear 

;or 1 if preset and not clear at the same time 

; output is /D if not preset 
;or 1 if clear 

;go HI if toggle and not clear 

;stay HI if not toggle and not clear 

;go HI if preset and not clear at the same time 

;go HI if toggle and not preset 
;stay HI if not toggle and not preset 
;go HI if clearing 

;go HI if J and not clear 

;stay HI if not K and not clear 

;go HI if preset and not clear at the same time 

;go HI if not J and not preset 
;stay HI if K and not preset 
;go HI if clear 

;go HI if set and not clear 

;stay HI if not reset and not clear 

;go HI if preset and not clear at the same time 

;go HI if reset and not preset 

; stay HI if not set and not preset 

;go HI if clear 

Figure 16. Flip-Fiop Equations Section 
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PR 




[Tfl:=T'[?TTlvCLR 
+ PRVCLR 



Figure 17. Feedback in tlie Equation for TT 



Notice that in some of tlie equations above, the output signal itself 
shows up in the equations. This is the way in which feedbackfrom 
the flip-flop can be used to determine the next state of the flip-flop. 
An equivalent logic drawing of the TT equation is shown in 
Figure 17. 

Completing the Design File 

We are now in a position to select a device and complete the 
device file. The device used will be a PALI 6R8, which is actually 
an active-LOW device. The logic diagram for this device can be 
found on page 5-47. The implications of an active-LOW device 
are discussed more fully on page 6-1 9. For now, suffice it to say 
that one way of implementing active-LOW logic is to declare 
outputs in the pin list with a slash {'f) in front of them. This 
essentially defines them as 'negative' or 'inverted' pins. 

The declaration section is built up just like the last one, and is 
shown in Figure 18. 



TITLE 


Basic flip-flops 


PATTERN 


POOOl 


REVISION 


A 


AUTHOR 


Stateyour Namehere 


COMPANY 


Nameofyour Co . , Inc 


DATE 


7/22/87 



CHIP Flip-flops PAL16R8 

;PINS 1 2345 6 789 10 

CLK J K T PR CLR D S R GND 

;PINS 11 12 13 14 15 

OE /SRC /SRT /DC /DT 

;PINS 16 17 18 19 20 

/TC /TT /JKC /JKT VCC 

Figure 18. Basic Fiip-Flops Deciaration Section 

Notice that this device has aclock and an output enable pin, called 
CLK and OE, respectively. The input and output pins are named 
as shown in Figure 12. The output pins are all defined with 
slashes, to indicate that they are inverted pins. 



The design can now be processed by hitting 

<F5> 

and then hitting 



This will tell you whether there are any basic problems with your 
design. You can correct any mistakes by editing the file and then 
reprocessing. 

Simulating the Flip-Flops 

After processing the design and correcting any mistakes, we can 
write the simulation. We have all of the simulation instructions we 
need, except for one new instruction which simplifies manipula- 
tion of the clock signal. It is possible to use SETF instructions with 
the clock pin, but then each clock transition would require two 
instructions: one to set the clock HIGH, and one to bring it 
back LOW. 

Instead, we can use the CLOCKF instruction. This pulses a clock 
pin in one instruction. Of course, registered outputs will not 
change state until after the rising edge of the clock signal. Since 
we have named the clock pin CLK, we can clock this device with 
the instruction 

CLOCKF CLK 

We can simulate the true D-type flip-flop as follows: 

SETF D ;set the D input HI 

CLOCKF CLK ; clock the device 

CHECK DT ; verify that the output went HI 

SETF /D ;set the D input LO 

CLOCKF CLK ; clock the device 

CHECK /DT ; verify that the output went LO 

Before we actually simulate any registered design like this, two 
items must be initialized: the clock and the output enable pin (OE). 
Since the CLOCKF statement executes a HIGH-LOW pulse on 
the clock pin, we must first make sure that the clock is set LOW 
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to begin with. We must also enable tlio outputs, which is done by 
setting the OE pin LOW. These tasl<s can be accomplished with 
the statement: 

SETF /CLK /OE 

We also need to initialize the flip-flops. At the same time we can 
verify that the CLR function is working correctly: 



SETF CLR /PR 
CLOCKF CLK 



; set the clear pin 
; clear the circuit 



;make sure the clear function worked 
CHECK /DT DC /TT TC /JKT JKC /SRT SRC 
SETF /CLR ; remove the clear signal 

We are now in a position to simulate the entire circuit. A thorough 
simulation is shown in Figure 19. 



SIMULATION 

TRACE ON CLR PR D DT DC T TT TC JK JKT JKC SR SRT SRC 



; initialize the circuit 
SETF CLR /PR 
CLOCKF CLK 



;set the clear pin 
; clear the circuit 



;make sure the clear function worked 

CHECK /DT DC /TT TC /JKT JKC /SRT SRC 

SETF /CLR /remove the clear signal 

; check out the preset function 

SETF PR ;set the preset pin 

CLOCKF CLK /preset the circuit 

CHECK DT /DC TT /TC JKT /JKC SRT /SRT 

SETF /PR ' /remove the preset signal 

/verify that clear has priority 

SETF PR CLR / set both clear and preset 

CLOCKF CLK 

CHECK /DT DC /TT TC /JKT JKC /SRT SRC 

SETF /PR /CLR /remove both preset, clear 

/disable all flip-flop inputs for now 
SETF /D /T /J /K /S /R 

/check out D-type flip-flops 

SETF D /set the D input HI from LO 

CLOCKF CLK /clock the device 

CHECK DT /DC /verify that the true output went HI 

/and the conplement went LO 



SETF D 
CLOCKF CLK 
CHECK DT /DC 

SETF /D 
CLOCKF CLK 
CHECK /DT DC 



SETF /D 
CLOCKF CLK 
CHECK /DT DC 



/hold a HI 

/verify that state maintained 

/set the D input LO from HI 

/ clock the device 

/verify that the true output went LO 

/and complement went HI 

/hold a LO 

/verify that state maintained 



/check out T-type flip-flops 
SETF /T /hold a LO 

CLOCKF CLK 

CHECK /TT TC /true output should still be LO 

/complement output HI 



SETF T 
CLOCKF CLK 
CHECK TT /TC 



/toggle from a LO 

/both outputs should have changed state 



Figure 19. Fllp-Flop Simulation Section 
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SETF /T 
CLOCKF CLK 
CHECK TT /TC 

SETF T 
CLOCKF CLK 
CHECK /TT TC 



;hold a HI 

;both outputs should have held their state 

; toggle from a HI 

;both outputs should have changed state 



; check out J-K flip-flops 
SETF /J /K 
CLOCKF CLK 
CHECK /JKT JKC 

SETF J /K 
CLOCKF CLK 
CHECK JKT /JKC 

SETF /J /K 
CLOCKF CLK 
CHECK JKT /JKC 

SETF /J K 
CLOCKF CLK 
CHECK /JKT JKC 

CLOCKF CLK 
CHECK /JKT JKC 

SETF J K 
CLOCKF CLK 
CHECK JKT /JKC 

SETF J /K 
CLOCKF CLK 
CHECK JKT /JKC 

SETF J K 
CLOCKF CLK 
CHECK /JKT JKC 



;hold a LO 

;both outputs should have held their state 

;set a HI 

/outputs should have changed 

;hold a HI 

;both outputs should have held their state 

; reset a LO 

; outputs should have changed 

; reset an output that is already LO 
;iTiake sure that the outputs didn't change 

; toggle from a LO 

(•verify that outputs changed 

;set an output that is already HI 

;make sure the outputs didn't change 

(•toggle from a HI 

; verify that outputs changed 



; check out S-R flip-flops 
SETF /S /R 
CLOCKF CLK 
CHECK /SRT SRC 

SETF S /R 
CLOCKF CLK 
CHECK SRT /SRC 

SETF /S /R 
CLOCKF CLK 
CHECK SRT /SRC 

SETF S /R 
CLOCKF CLK 
CHECK SRT /SRC 

SETF /S R 
CLOCKF CLK 
CHECK /SRT SRC 

CLOCKF CLK 
CHECK /SRT SRC 

TRACE OFF 



;hold a LO 

;both outputs should have held their state 

; set a HI 

; outputs should have changed 

;hold a HI 

;both outputs should have held their state 

;set an output that is already HI 

;make sure the outputs didn't change 

; reset a LO 

; outputs should have changed 

; reset an output that is already LO 
;make sure that the outputs didn't change 



Figure 19. Flip-FIop Simulation Section (Cont'd.) 
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The file can now be simulated in the same manner as the basic 
gates design. The simulation results can be viewed either by 
examining the history or trace file, or by generating waveforms. 
The complete design file is shown in Figure 22 at the end of this 
section. 

Programming a Device 

After simulating the design, and verifying that it works, it is time to 
program a device. There are several steps to programming, but 
the exact operation of the programmer naturally depends on the 
type of programmer being used. We will be as explicit as we can 
here, but you will need to refer to your programmer manual for the 
specifics. 

The first thing that must be done after turning the programmer on 
is to select the device type. This tells the programmer what kind 
of programming data to expect. The device type is usually 
selected either from a menu or by entering a device code. Your 
programmer manual will have the details. 

Next a JEDEC file must be downloaded. To transfer the JEDEC 
file from the computer to your programmer, you will need to 
provide a connection, as shown in Figure 20. This is normally 
done with an RS-232 cable connected between the 
programmer's port and a serial port on the computer (usually 
C0M1). 




Figure 20. A Connector Must Be Provided Between the 
Computer and the Programmer 

If your programmer can perform functional tests, and you wish for 
those tests to be performed, you should download the '.JDC file; 
otherwise you should download the '.JED' file. 



To download data, the programmer must first be set up to receive 
data. The programmer manual will tell you how to do this. The 
data can then be sent from the computer by hitting 

<F4> 

This sets up communication between the computer and the 
programmer. Whichevercommunication program is installed will 
be invoked. This is used to transmit the JEDEC file to the 
programmer. The details below assume you are using PC2;if not, 
follow the instructions for your program to accomplish the same 
steps. 

Before actually sending the data, you must verify the correct com- 
munication protocol. Check to make sure you know what protocol 
the programmer is expecting; then hit the <F2> key on the 
computer. This allows you to set up the baud rate, data bits, stop 
bits, and parity. 

Once the protocol has been set up, hit the <F1> key. You will be 
asked for the file name; enterthe name of the JEDECfile you wish 
to use. The computer will then announce that it is sending the 
data, and tell you when it isf inished. Note that just because it says 
it has finished sending data does not mean that the data was 
received. Your programmer will indicate whether or not data was 
received correctly. 

Once the data has been received, the programmer is ready to 
program a device. Place a device in the appropriate socket, and 
follow the instructions for your programmer to program the 
device. This procedure programs and verifies the connections in 
the device, and, if a '.JDC file was used, will perform a functional 
test. 

The programmer will announce when the programming proce- 
dure has been completed. You may then take the device and plug 
it into your application. 

If you have actually programmed one of the examples that we 
created above, you naturally don't have a board into which you 
can plug the device. If you do have a lab setup, you may wish to 
play with the devices to verify for yourself that the devices perform 
just as you expected them to. 

You will find much more detail on many issues that were not 
discussed in this section in the remaining sections of this hand- 
book. This section should have provided you with the basic 
knowledge you need to understand the remaining design ex- 
amples in this book, and to start your own designs. 
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TITLE 


Basic gates 


PATTERN 


POOOO 


REVISION 


A 


AUTHOR 


Stateyour Namehere 


COMPANY 


Nameofyour Co., Inc 


DATE 


7/22/87 



CHIP GATES PAL12H6 

;PINS 123456789 10 
ACDFGIJKM GND 

;PINS 11 12 13 14 15 16 17 18 19 20 
NPBEHROLQ VCC 



EQUATIONS 
B = /A 



C*D 



= F 
+ G 



L = /I 
+ /J 
+ /K 

= /M*/N 

R = P*/Q 
+ /P*Q 

SIMULATION 



; inverter 
;AND gate 
;0R gate 

;NAND gate after applying DeMorgan's theorem 

;N0R gate after applying DeMorgan's theorem 
;XOR gate, expanded 



TRACE ONABCDEFGHIJKLMNOPQR 



;look at the inverter 
SETF A 
CHECK /B 
SETF /A 
CHECK B 



;look at AND gate 

SETF /C /D 

CHECK /E 

SETF D 

CHECK /E 

SETF C /D 

CHECK /E 

SETF D 

CHECK E 



;look at OR gate 

SETF /F /G 

CHECK /H 

SETF G 

CHECK H 

SETF F 

CHECK H 

SETF /G 

CHECK H 



;set A HI 

/verify that B is LO 

;set A LO 

; verify that B is HI 

;end of inverter trace 

;set C, D LO 

/verify E LO 

;C stays LO, D goes HI 

;E should stay LO 

;set C HI, D LO 

;E should stay LO 

;C stays HI, D goes HI 

;E should now be HI 

;end of AND gate trace 

;set F, G LO 

/verify H LO 

;F stays LO, G goes HI 

;H should go HI 

;G stays HI, F goes HI 

;H should stay HI 

;F stays HI, G goes LO 

;H should stay HI 

;end of OR gate trace 



Figure 21. Complete Basic Gates Design File 
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;look at NAND gate 

SETF /I /J /K 

CHECK L 

SETF K 

CHECK L 

SETF J /K 

CHECK L 

SETF K 

CHECK L 

SETF I /J /K 

CHECK L 

SETF K 

CHECK L 

SETF J /K 

CHECK L 

SETF K 

CHECK /L 



IJK = 000 

verify L HI 

IJK = 001 

L should still be HI 

IJK - 010 

L should still be HI 

IJK = Oil 

L should still be HI 

IJK ° 100 

L should still be HI 

IJK = 101 

L should still be HI 

IJK " 110 

L should still be HI 

IJK = 111 

L should go LO 

end of NAND gate trace 



;look at NOR gate 

SETF /M /N 

CHECK 

SETF N 

CHECK /O 

SETF M 

CHECK /O 

SETF /N 

CHECK /O 



rset M, N LO 

r verify HI 

;M stays LO, N goes HI 

;0 should go LO 

;M, N now both HI 

rO should stay LO 

;M stays HI, N goes LO 

;0 should stay LO 

;end of NOR gate trace 



/look at XOR gate 
SETF /P /Q 
CHECK /R 
SETF Q 
CHECK R 
SETF P /Q 
CHECK R 
SETF Q 
CHECK /R 



;set P, Q LO 

; verify R LO 

;P stays LO, Q goes HI 

;R should go HI 

;now P HI, Q LO 

;R should stay HI 

;both inputs HI 

;R should go LO 



TRACE OFF 



rend of trace 



Figure 21. Complete Basic Gates Design File (Cont'd.) 
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TITLE 


Basic flip-flops 


PATTERN 


POOOl 


REVISION 


A 


AUTHOR 


Stateyour Namehere 


COMPANY 


Nameof your Co . , Inc 


DATE 


7/22/87 



CHIP Flip-flo PAL16R8 

;PINS 1 2345 6 789 10 
CLK J K T PR CLR D S R GND 

;PINS 11 12 13 14 15 
OE /SRC /SRT /DC /DT 

;PINS 16 17 18 19 20 
/TC /TT /JKC /JKT VCC 



EQUATIONS 

/emulating all flip-flops with D-type flip-flops 



DT := D*/CLR 
+ PR* /CLR 



/output is D if not clear 

;or 1 if preset and not clear at the same time 



DC := /D*/PR 
+ CLR 



/output is /D if not preset 
/or 1 if clear 



TT 



:= T*/TT*/CLR 
+ /T*TT*/CLR 
+ PR* /CLR 



/go HI if toggle and not clear 

/stay HI if not toggle and not clear 

/go HI if preset and not clear at the same time 



:= T*/TC*/PR 
+ /T*TC*/PR 
+ CLR 



/go HI if toggle and not preset 
/stay HI if not toggle and not preset 
/go HI if clearing 



JKT 



:= J* /JKT* /CLR 
+ /K*JKT*/CLR 
+ PR* /CLR 



/go HI if J and not clear 

/stay HI if not K and not clear 

/go HI if preset and not clear at the same time 



JKC 



:= /J*/ JKC* /PR 
+ K* JKC* /PR 
+ CLR 



;go HI if not J and not preset 
;stay HI if K and not preset 
/go HI if clear 



SRT := S*/CLR 

+ /R*SRT*/CLR 
+ PR* /CLR 



rgo HI if set and not clear 

rstay HI if not reset and not clear 

rgo HI if preset and not clear at the same time 



:= R*/PR /go HI if reset and not preset 

+ /S*SRC*/PR /stay HI if not set and not preset 

+ CLR /go HI if clear 

Figure 22. Complete Basic Flip-Fiops Design File 
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SIMULATION 

TRACE ON CLR PR D DT DC T TT TC JK JKT JKC SR SRT SRC 



/•initialize the circuit 
SETF CLR /PR 
CLOCKF CLK 



; set the clear pin 
; clear the circuit 



;make sure the clear function worked 

CHECK /DT DC /TT TC /JKT JKC /SRT SRC 

SETF /CLR ; remove the clear signal 

; check out the preset function 

SETF PR ;set the preset pin 

CLOCKF CLK ; preset the circuit 

CHECK DT /DC TT /TC JKT /JKC SRT /SRT 

SETF /PR /remove the preset signal 

; verify that clear has priority 

SETF PR CLR ; set both clear and preset 

CLOCKF CLK 

CHECK /DT DC /TT TC /JKT JKC /SRT SRC 

SETF /PR /CLR /remove both preset, clear 

/disable all flip-flop inputs for now 
SETF /D /T /J /K /S /R 

/check out D-type flip-flops 

SETF D /set the D input HI from LO 

CLOCKF CLK /clock the device 

CHECK DT /DC 



SETF D 
CLOCKF CLK 
CHECK DT /DC 

SETF /D 
CLOCKF CLK 
CHECK /DT DC 

SETF /D 
CLOCKF CLK 
CHECK /DT DC 



/verify that the true output went HI and the coitplement went LO 

/hold a HI 

/verify that state maintained 

/ set the D input LO from HI 

/clock the device 

/verify that the true output went LO and complement went HI 

/hold a LO 

/verify that state maintained 



/check out T-type flip-flops 
SETF /T 
CLOCKF CLK 
CHECK /TT TC 



SETF T 
CLOCKF CLK 
CHECK TT /TC 

SETF /T 
CLOCKF CLK 
CHECK TT /TC 

SETF T 
CLOCKF CLK 
CHECK /TT TC 



/hold a LO 

/true output should still be LO, cortplement output HI 
/toggle from a LO 

/both outputs should have changed state 
/hold a HI 

/both outputs should have held their state 
/toggle from a HI 

/both outputs should have changed state 
Figure 22. Complete Basic Flip-Fiops Design File (Cont'd.) 
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; check out J-K flip-flops 
SETF /J /K 
CLOCKF CLK 
CHECK /JKT OKC 

SETF J /K 
CLOCKF CLK 
CHECK JKT /JKC 

SETF /J /K 
CLOCKF CLK 
CHECK JKT /JKC 

SETF /J K 
CLOCKF CLK 
CHECK /JKT JKC 

CLOCKF CLK 
CHECK /JKT JKC 

SETF J K 
CLOCKF CLK 
CHECK JKT /JKC 

SETF J /K 
CLOCKF CLK 
CHECK JKT /JKC 

SETF J K 
CLOCKF CLK 
CHECK /JKT JKC 



;hold a LO 

;both outputs should have held their state 

; set a HI 

/outputs should have changed 

;hold a HI 

;both outputs should have held their state 

; reset a LO 

; outputs should have changed 

; reset an output that is already LO 
;make sure that the outputs didn't change 

(•toggle from a 1X3 

; verify that outputs changed 

; set an output that is already HI 

.•make sure the outputs didn't change 

; toggle from a HI 

; verify that outputs changed 



/check out S-R flip-flops 
SETF /S /R 
CLOCKF CLK 
CHECK /SRT SRC 

SETF S /R 
CLOCKF CLK 
CHECK SRT /SRC 

SETF /S /R 
CLOCKF CLK 
CHECK SRT /SRC 

SETF S /R 
CLOCKF CLK 
CHECK SRT /SRC 

SETF /S R 
CLOCKF CLK 
CHECK /SRT SRC 

CLOCKF CLK 
CHECK /SRT SRC 

TRACE OFF 



;hold a LO 

;both outputs should have held their state 

; set a HI 

/outputs should have changed 

/hold a HI 

/both outputs should have held their state 

/ set an output that is already HI 

/make sure the outputs didn't change 

/reset a LO 

/outputs should have changed 

/reset an output that is already LO 
/make sure that the outputs didn't change 



Figure 22. Complete Basic Fiip-Flops Design File (Cont'd.) 
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Programmable logic devices (PLDs) are used in digital systems 
design for implementing a wide variety of logic functions. These 
logicf unctions range from simple random logic replacement (dis- 
cussed earlier in the Beginner's Guide to PLDs on page 2-2) to 
complex control sequencers (discussed in the subsequent sec- 
tions). Programmable logicdevicesofferthe multiple advantages 
of low cost, high integration, ease of use, and easier design 
debugging capability not available in other systems design op- 
tions. These are described in the introduction (on page 1 -1 ) to this 
handbook. In the following discussion we will detail the PLD 
design process. 

Most PLDs have an AND-OR array structure with programmable 
connections in either or both of the arrays. A programmable array 
implies that the connections can be programmed by the user. 
PLDs are classified depending on which of the two arrays is 
programmable or fixed. The popular PAL (Programmable Array 
Logic) devices have a programmable AND array and a fixed OR 
array. PAL devices are used for a wide variety of combinatorial 
and registered logic functions. PROM (Programmable Read- 
only Memory) devices — used often as memory and seldom as 
logic — have a fixed AND array and a programmable OR array. 
Both arrays are programmable in PLS (Programmable Logic Se- 
quencer) devices. These devices are used for special state 
machine applications discussed on page 2-1 1 7. In addition, there 
are other programmable logic devices that combine program- 
mable arrays with dedicated logic for providing optimal function- 
ality for a specific system application. The PROSE (PROgram- 
mable SEquencer) device is one such device, with its architec- 
ture optimized for state machine designs. A discussion of the 
various PLD architectures is included on page 1 -7. In this discus- 
sion we will also examine the various design constraints to be 
considered when selecting the correct architecture for a given 
application. 

All digital logic can be efficiently reduced to two fundamental 
gates, AND and OR, provided boih true and complement versions 
of all input signals are available. Such logic is generally built 
around what is known as the sum-of-products (AND-OR) form. 
Please see page 6-1 for details on sum-of-products form and 
Boolean logic. Programmable logic devices are ideal for imple- 
menting such two-stage logic in the AND and OR arrays. 

Various process technologies offer many design options for 
PLDs. The connections in the programmable arrays can be fuse- 
based, commonly used in both ECL and TTL bipolar technolo- 
gies. E/EEPROM cell-based in UV-EPROM and EEPROM 
CMOS technologies and RAM cell-based in CMOS RAM technol- 
ogy. ECL PLDs are used for very high-speed designs (greater 
than 125 MHz bandwidth), while CMOS is used for low-power 
designs. Bipolar TTL fuse-based PLDs coverthe vast midrange 
3f applications and are the most popular PLDs. The selection of 
echnology is mostly dependent upon the system speed and 
jower constraints. Most design engineers are familiar with these 



constraints, which not only dictate the technology of PLDs but of 
all other logic used in a system. In the following discussion, we 
will assume TTL fuse-based technology for simplicity. A detailed 
discussion of the technology is included on page 3-130. 

Designing with PLDs involves the use of design software and a 
device programmer (Figure 1). The design software eliminates 
the need to identify every connection to be programmed for 
implementing the desired sum-of-products logic. The design 
process begins with the creation of a design file which specifies 
the desired function. The function is typically represented by its 
sum-of-products form and can be derived directly from the timing 
diagram and/or truth tables. Occasionally Karnaugh maps and 
state diagrams are also used. The design file is then assembled 
to produce the "JEDECfile. The JEDEC file gets its name from 
the fact that it is an approved JEDEC standard for specifying the 
state of every connection on the device. Simulation can then be 
performed. If the design is correct, the JEDEC file is downloaded 
into a device programmer for programming the connections on 
the device. The device can then be plugged into the PC board 
where it will function. The entire procedure can often be per- 
formed with the designer never having to leave the desk. Most 
programmers interface to personal computers, so that the design 
file can be edited, assembled, simulated, and downloaded, and 
the device programmed, all in one place. 





PLD 




Figure 1 . PLDs are Designed using Software 
and a Device Programmer 
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The first stage in a PLD design process (Figure 2) is the concep- 
tualization of a design problem; the second is the selection of the 
correct device; the third is the implementation of the design, which 
also includes simulating the design with test vectors; and finally, 
the actual programming and testing on a system board. We will 
take a simple design example and go through the various stages 
of this design process. 



CONCEPTUALIZE A 
DESIGN PROBLEM 




• ' 




SELECT 
DEVICE 
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IMPLEMENT 
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PLD 
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TEST 
PLD 






' 
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INTO BOARD 





Conceptualizing a Design 

The first step in the PLD design process is also required for any 
SSI/MSI design. An advantage of PLDs is that at this stage the 
designer needs to be concerned only with the required logic 
function. With SSI or MSI, various device logic limitations must 
be accounted for before the design can be started. Clearly a 
designer needs to develop a brief and complete functional de- 
scription, based upon the system design requirements. 

We will take the example of a simple address decoder circuit 
required for a 68000 microprocessor. The microprocessor has 24 
address lines along with separate read and write signals. It 
requires some ROM to store the boot-up code as well as some 
RAM for storing and executing programs. The purpose of the 
address decoder circuitry is to select one of the memory ad- 
dresses at a time. The RAMs and ROMs are assigned addresses 
on the 68000 microprocessor address space. The address 
decoder circuit has to select one of the RAMs or ROMs for a 
specific range of addresses, called the address space. This 
selection is accomplished by asserting the specific chip-select 
signal for the RAM or ROM when the microprocessor accesses 
one of the addresses in the address space. There is additional 
circuitry in a typical microprocessor system for addressing I/O 



Figure 2. Programmable Logic Device Design Process 
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devices (such as disk controllers). These devices also require 
that chip-select signals be asserted when the microprocessor 
addresses them. Figure 3 shows an example address map for a 
68000 microprocessor. 

Figure4shows the circuitdiagram. The address signalsfrom the 
68000 microprocessor are inputs to the interface logic block. The 
outputs generated are R0MCS1 , R0MCS2, and RAMCS. The 
generation of signals for selecting device !/0s is similar and is not 
shown here for the sake of simplicity. Other system inputs to the 
interface are the address strobe signal generated by the 68000 
microprocessor as well as the read/write signal. The truth table 
for generating the outputs is shown in Figure 5. This truth table 
is derived from the memory address map and the functional de- 
scription of the design. 



ADDRESSES 
HEX 


SIZE 


A23 


A22 


A21 


SIGNAL 


00000-OFFFF 
1 0000-1 FFFF 
20000-2FFFF 
30000-3FFFF 
40000-4FFFF 
50000-5FFFF 


1 Mbytes 
1 Mbytes 
1 Mbytes 
1 Mbytes 
1 Mbytes 
1 Mbytes 







1 

1 





1 
1 







1 



1 



1 




R0MCS1 


R0MCS2 


RAMCS 


RAMCS 


RAMCS 


RAMCS 



Figure 5. Truth Table for Chip-Select Signals 

Device Selection Considerations 

Thefirsttaskforthe designer is to identify the design problem and 
classify it as a combinatorial function or a registered function, 
depending upon whether or not registers are required. In most 
cases, this decision depends upon the functional nature of the 
problem. Sometimes timing and logic considerations can also 
dictate the use of registers; this will be discussed later. Registers 
are usually not required for such simple combinatorial functions 
such as encoders, decoders, multiplexers, demultiplexers, ad- 
ders, and comparators. Registers are required, however, for 
functions such as counters, timers, control signalgeneratlon, and 
state machines. No registers are required for this simple address 
decoding example. 

A brief look at the places where various architectural features are 
used will help us select a device for our design example. Based 
on historic record of usage and architectural functionality, PLDs 
can be classified into three basic application segments. Figure 6 
shows the optimal placement of PAL, PLS, and PROSE/FPC 
(Fuse Programmable Controller) devices for these three seg- 
ments; combinatorial designs, simple registered designs and 
complex registered (sequencer) designs. This classification also 
shows the available functionality and device speeds. These are 
rough estimates that most designers implicitly assume and de- 
velop with experience. These estimates vary drastically among 
different designers, based on personal preferences and experi- 
ence. They have been included here to provide a complete road 
map for device selection and should be considered strictly as a 
rule of thumb. 
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Based on these criteria, the best choice for our combinatorial 
design would be a PAL device. The task now is to select a PAL 
device for implementing the desired function. The main function- 
specific selection considerations are shown in Figure 7. These 
items are applicable to most designs. There are other function- 
specific issues which need addressing when selecting a device; 
these will be discussed in the later sections of the book, when we 
address advanced designs. 

1. Number of Input pins 

2. Number of output pins 

3. Number of I/O pins 

4. Device speed 

5. Device power requirements 

6. Number of registers (If any) 

7. Number of product terms 

8. Output polarity control 

Figure 7. General Device Selection Considerations 

The first resource that must be provided in a PLD is the number 
of pins needed for the basic logic function. This consists of the 
number of input and output pins. Many PLDs have internal 
feedback, which allows the generated output signal to be reused 
as an input. The same feedback also allows the pin to be used 
as a dedicated input, if required. This is especially useful forf itting 
various designs with different input/output requirements on the 
same device. The I/O pin capability of certain PLDs can also be 
very useful forcertain bus applications and will be discussed later. 
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Figure 8 and 9 show the pin resources available on various 
combinatorial and registered PLDs. The task is as simple as 
counting the number of input, output and I/O pins required by the 
design and picking a PLD from the table which has the requisite 
number of pins. We will pick a PAL1 6L8 for our design, since it 
has sufficient pin resources. 



The next selection issue is the device speed. The most important 
timing consideration for combinatorial PLDs is the propagation 
delay (tp^) of signals from the input to the output of the device. For 
registered PLDs, the importanttiming consideration is the device 
clocking frequency. This clocking frequency (shown in Figure 9) 
is in turn determined by sum of the register setup time (t J, and 
clock-to-output propagation delay (t;,^). Most systems impose 
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Figure 10. System Timing Requirements 



some timing restrictions on tiie internal logic functions. Tiiese 
restrictions will determine the necessary tp^ (for combinatorial 
devices) or f^^ (for registered devices). Details on clocking 
frequency are provided on page 2-64. 

In our design example, the PLD will primarily perform address 
decoding. The critical system timing constraint is determined by 
the read/write cycle time of the microprocessor and the memory 
access time available (Figure 10). Most microprocessors allow 
anywhere from 1 to 35 ns for address decoding. That is, 1 0-35 
ns after the address is available, the correct memory chip-select 
signal should be asserted. In our design example the available 
cycle time of 240 ns and memory access time of 220 ns leaves 
barely 10 ns for address decode time. From the table in 
Figure 8, we can check the propagation delay and select the 
appropriate speed of PAL1 6L8 for our design, which is tpp=1 ns. 

We have already briefly discussed the types of applications 
where registers are needed. Sometimes the consideration of 
system timing can affect whether or not registers are needed. 
Devices with registers can hold a signal stable for the long 
durations required by the addressed peripheral or memory. 
However, this slows the initial response or access time of the 
device since the chip select must wait for the setup time before the 
rising edge of the clock cycle. Devices without registers provide 
fast access time but hold the signal valid only as long as the input 
conditions are valid. In most address decoders, the address 
signals are kept asserted by the microprocessor until the read/ 
write cycle is completed. In this case, the registers are not 
required for holding the signals asserted. 

The remaining two general design considerations are the number 
of product terms and output polarity. We will discuss these two as 
we implement the design in the next section. 

Implementing a design 

Implementing a design (Figure 11) requires the creation of a 
design file. This design file is called the PAL device Design 
Specification (PDS) in the PALASM 2 software package. Details 
of PALASM 2 software can be found in section 4 of this handbook. 
The design file contains three types of information. 

1. Basic bookkeeping information 

2. Design syntax 

3. Simulation syntax 



The first section provides documentation information, along with 
the name of the device selected, and the signal names. All of the 
signal names are assigned to pins and are defined as inputs or 
outputs. The Declaration Section contains the signal names in 
the ascending order of pin numbers. Figure 12 shows the pin 
assignments of the signals used in our example. Signals which 
are used as l/Os are assigned as outputs and are used as both 
inputs and outputs in the sum-of-products logic explained later. 

Once the design file is complete, it is then assembled and 
simulated. Once it passes assembly and simulation, the resultant 
JEDECfile is downloaded to a device programmerfor configuring 
the device. 
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Figure 11. Implementing a Design 
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Title M68000_ADDRESS_DECODER 

Pattern P8000 

Revision A 

Author ENGINEER 

Company MMI SUNNYVALE, CALIFORNIA 

DATE 07/21/87 



CHIP DECODER PAL16L8 



NC A23 A22 A21 TOP BOTTOM AS RW NC GND 
OE ROMCSl R0MCS2 RAMCS INIT NC NC NC NC VCC 

Figure 12. Declaration Section of a Design File 



Design Syntax 

As shown in Figure 11, tliere are two options available to the 
designer for expressing the design. The first is through traditional 
Boolean logic equations; the second is through a state machine 
syntax. The Boolean logic equations are the only option for 
combinatorial designs and can also be efficient for some regis- 
tered designs. The Boolean equations can be derived from a 
combination of the functional description, the truth table and/or 
the timing diagrams (Figure 13). The state machine approach is 
ideal for large registered control designs, and can be derived from 
the functional description, state table, state diagram and/or the 
timing diagram (Figure 14). The latter approach can also be used 
with PROSE and PLS devices, and will be discussed in a later 
section on state machine design. 

Boolean Logic Equations 

Boolean equations are used to represent the sum-of-products 
logic form (see page 6-1 for a logic design review). The Boolean 
equations are ideally suited for representing the two-level 
AND-OR logic available in most PLDs. 

A conventional approach to the design is to convert the design 
problem to its discrete logic implementation. Such random SSI 
and MSI logic can be easily implemented in PLDs (see the 
Beginner's Guide, page 2-2). This usually involves converting to 
sum-of-products Boolean logic form. This approach can be a 
chore, and much effort can be saved by implementing a design 
with PLDs in a sum-of-products form right from the start. This 
essentially means that the designer does not have to design 
around the limitations of fixed SSI and MSI functions. A direct 
implementation of a design in sum-of-products form in a PLD can 
also yield a faster circuit. 

Boolean equations can be directly derived from the truth table or 
timing diagram (Figure 1 3). The truth table is used more often in 
simple combinatorial designs. The timing diagram method is 
used more often in registered control designs. We will first 
discuss the truth table method and then discuss the details of the 
timing diagram method. The state-machine-based registered 
design approach will be discussed on page 2-101. 



In addition to specifying the logic function, the Boolean equations 
in the design file help document the design. There is no need to 
draw out an equivalent schematic. This allows design modularity; 
the schematic can just show a blocl< for a particular PLD. 
Separate supporting documentation (the design file) provides the 
details without cluttering the drawing. 

Truth-Table-Based Design 

The requirements for our particular design example can be easily 
converted to a truth table format (Figure 15). This truth table is 
based upon the functional description of the design, and is 
derived from the address map (Figure 3) and the truth table 
(Figure 5). 
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Figure 13. Writing Boolean Logic Equations 
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Figure 14. State Machine Dsecrlptlon 
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Figure 15. Truth Table for the Address Decoder 



There are three additional input signals in this design example. 
The first, RW, is generated by the microprocessor, and distin- 
guishes between read and write cycles. Since the ROM data is 
only for reading, the R0MCS1 and ROMCS2 signals are asserted 
only when RW is high (when the microprocessor attempts to read 
the ROM) and are not asserted for the write cycle. On the other 
hand, RAMCS is generated for both read and write cycles and the 
state of signal RW is "don't care". 

The second additional signal, AS, is the address strobe signal 
generated by the microprocessor, and is asserted only when the 
address lines carry a valid address. All of the chip select signals 
need to be gated with the AS signal to ensure that they are only 
generated for valid addresses, and no spurious chip selects are 
generated. 

The last signal is the INIT signal, which is a system initialization 
signal. This signal is used to initialize the microprocessor for a 
"warm boot," and none of the chip selects is allowed when this 
INIT signal is asserted. 

Writing Boolean equations from the above logic is very straight- 
forward. The output signal names, along with their polarity, are 



assigned to sum-of-product equations, which are based upon 
inputs and their polarities. 
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Figure 16. The Implementation In Boolean Equations 



The equations are derived directly from the truth tables. Each one 
of the AND equations uses up one product term of the device as 
shown in Figure 16. One device selection consideration is to 
ensure that all the outputs have sufficient product terms to 
accommodatethedesiredfunction. The device we have selected 
for this design is the popular PAL16L8D. This device has 
sufficient inputs, outputs, and product terms, and has the requi- 
site speed for the design. 
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This brings us to the issue of output polarity. Suppose we had to 
generate active-HIGH outputs. In that case the output equations 
for the R0MCS1 signal would be : 

ROMCSl = /A23 * /A22 * /A21 * INIT * /AS * RW 

As the PALI 6L8 has active-LOW outputs only, this equation's 
output polarity needs to be inverted to be able to fit the device. 
Using DeMorgan's theorem for Boolean logic we get: 

/ROMCSl = A23 + A22 + A21 + /INIT + AS + /RW 

This equation requires a large number of product terms (six). 
Some signals are efficient and use fewer product terms in their 
true form, v/hilo others are more efficient in their inverted form. 
This poses no problems until the signals exceed the device limit 
of 8 product terms provided by the PALI 6L8 and other standard 
PALdevices. In such cases, an output polarity control is required. 
The PAL1 6RP8 family provides just the control for fitting the 
required number of product terms on the device. The device 
selection issues of product terms and output polarity also apply to 
registered designs. A detailed discussion of output polarity is 
included on page 6-15. 

Timing-Diagram-Based Design 

Until now, we have discussed a PLD design using truth tables as 
the primary design vehicle. In this section we will attempt a design 
using a timing diagram as a design vehicle. 

Earlier in the address decoder design we mentioned the INIT 
signal. This INITsignal is essentially an initialization signal forthe 
entire system, and can be generated by the PAL16L8. Note that 
the PALI 6L8D used for decoding still has several unused outputs 
left, one of which can be used to generate the INIT signal. The 
INIT signal is used internally (via feedback) for disabling the chip 
selects during initialization. Externally it can be used to initialize 
othersystem signals. This INITsignalisgeneratedfromaRESET 
switch connected to the inputs of the PAL16L8 device as shown 
in Figure 17. 

To avoid unwanted initialization, the RESET switch must be 
debounced. That is, we wantthe INITsignal to remain HIGH until 
the switch actually contacts the bottom side. Once the bottom 
side is hit, INIT should be asserted active LOW. Once asserted, 
it should stay LOW and not change until the top side is hit again. 
The timing requirements of the debounce circuitry are shown in 
Figure 1 8. Signals TOP and BOTTOf\/1 are inputs to the program- 
mable logicdevice. These signals are activated when the RESET 
switch touches the top and the bottom contacts, respectively. 
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We can formulate the equations by looking at the timing require- 
ments of the debounce circuitry shown in Figure 18. The idea is 
to identify the key elements of this timing diagram. The arrows in 
Figure 18 show the critical events. The first arrow shows the 
normal state of all the pins when the RESET switch is not 
asserted. Subsequent arrows show each event in the timing of 
the INIT signal, depending upon the movement of the switch. 




BOTTOM 



t, t. is \. t = 
Figure 18. Timing Diagram for the Debounce Switch 

The logic level of the signals at each critical event carries useful 
logic information for deriving Boolean equations. This logic 
information for each event is converted into direct Boolean 
equations as shown in Figure 19 below. For example, at instant 
1 the INITsignal remains HIGH as long as the TOP signal remains 
LOW; this is converted to INIT = /TOP * BOTTOM. 



1. Normal state 

2. Switch travels 

from TOP to BOTTOM 

3. Switch contacts 
BOTTOM 



INIT = /TOP 

INIT = TOP * BOTTOM * INIT 

/INIT = /BOTTOM 



4. Switch travels /INIT = /init * bottom * top 
form BOTTOM to TOP 

5. Normal State Again 

Figure 19. Boolean Logic at Every Instant of Timing 



Since we are working with an active-LOW device, we can com- 
bine the two active-LOW events into one equation: 

/INIT = /BOTTOM 

+ /INIT * BOTTOM * TOP 

Minimizing, this becomes: 

/INIT = /BOTTOM 

+ /INIT * TOP 

This can also be done by way of a truth table and Karnaugh map. 



Figure 17. RESET Switch for System initialization 
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TOP 


BOTTOM 


INIT- 


INIT+ 


1 


1 


1 


1 


1 


1 








1 





1 





1 














1 


1 


1 





1 





1 








1 


X 











X 



Figure 20. Truth Table of INIT logic 

Here TOP or BOTTOM will be LOW if contacted. Note that both 
TOP and BOTTOM can not be contacted at the same time. The 
truth table of Figure 20 yields the Karnaugh map shown in Figure 
21 . Grouping thezeros (because the PAL1 6L8 offers adive-LOW 
outputs) yields the Boolean equation identical to the one derived 
from the timing diagram. 



then checked for an appropriate response. The simulation test 
vectors identify any flaws in the design equations which could 
affect the logical operation of the devices programmed. Thus the 
simulation vectors serve as a design debugging tool. 

Simulation test vectors will eventually make up part of a larger set 
of test vectors called "functional test vectors". These functional 
test vectors are used to exercise a real device after programming 
to identify any individual devices which are defective. Other 
means of identifying defective devices, such as signature 
analysis, are also available. All of these are discussed in detail on 
page 3-128. in this section we will strictly focus on simulation 
vectors. 

Simulation is included in the design file along with the logic 
equations. There is little standardization in these simulation 
expressions among various PLD design software packages, 
although most of them rely on test vectors to exercise the logic. 
PALASM 2 software offers an advanced event-driven capability 
that retains a closer design perspective than simulation vectors. 
Details of PALASM 2 simulation syntax are included on page 
4-137. 



/RESET 



/TOP 
/BOTTOM 

S, 00 01 11 



10 



1 


/i 


5n 


1 


'^0^ 


^ 


^ 


1 



The simulation vectors or events can be directly derived from the 
truth table and the timing diagram of the design. The logic level 
and functions of all signals can be expanded and rewritten in a test 
vector form by the software. For example, the truth table for the 
address decoder example discussed earlier can be easily rewrit- 
ten as shown in Figure 23. 



Figure 21. Karnaugh Map of INIT Signal Logic 

There is essentially no difference between the truth table and 
timing diagram techniques for writing Boolean logic. A careful 
analysis will indicate that we implicitly assumed a truth table in the 
timing diagram example also. Some designers prefer to make a 
separate truth table (at least in the first few PLD designs), while 
others prefer to design directly from timing diagrams. While the 
truth table method allows a more optimal utilization of product 
terms, the timing diagram method is easier to visualize as it 
retains the design perspective, in both cases the logic should be 
minimized by the design software to ensure that the design is 
testable. Refer to page 3-1 08 for a discussion on testability. 

Most experienced designers understand the tradeoffs for device 
selection. They implicitly go through the steps of design concep- 
tualization and device selection, explained earlier. They typically 
draw a block around the logic being designed, with the previous 
knowledge that it would fit a PLD which has sufficient inputs, 
outputs, i/Os and product terms. Now that we have selected a 
device and implemented the design, the next step is design 
simulation. 

Simulation 

Design simulation is an integral part of the design process, as 
shown in Figure 22. The purpose is to exercise all of the inputs 
and test the response of outputs to verify that they will work as 
desired in the system. These are essentially test vectors which 
designate the state of every input on the device; the outputs are 



ASSEMBLE 
DESIGN FILE 



SIMULATE THE 

DESIGN WITH 

SIMULATION VECTORS 



FIX 
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DOWNLOAD JEDEC 

FILE TO THE DEVICE 

PROGRAMMER 



PROGRAM PLD 
FUSE MATRIX 



VERIFY FUSE MATRIX 



T=^- 



PROGRAM 
THE PLD 



Figure 22. Device Simuiation and Programming 
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Figure 23. Truth Table Used to Derive Simulation Vectors 



These are essentially tlie simulation vectors which will allow us to 
define the inputs to the device and checl< the outputs of the 
device. The PALASM 2 syntax for these simulation vectors is 
shown in Figure 24. 

There is a direct relationship between the truth table vectors of 
Figure 23 and the simulation shown in Figure 24. The simulator 
then interprets the design file and generates the output logic 
levels and/or waveforms, which can be checked by the designer. 



Once the simulation is complete, the design file can be as- 
sembled to generate the JEDEC file. In the preceding discus- 
sions we have assumed prior l<nowledge of the design file 
assembly. The procedure for assembly varies with different 
software packages and is covered for the PALASM 2 software 
package in the Beginner's Guide (page 2-6). 
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SETF /A23 /A22 /A21 /TOP BOTTOM AS RW 
CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF /AS ; READ CYCLE VALID ROMCSl ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF AS 

SETF /RW /AS ; WRITE TO ROMCSl NOT POSSIBLE 

CHECK ROICSl R0MCS2 RAMCS INIT 

SETF RW AS 

SETF /A23 /A22 A21 ; SET UP R0MCS2 ADDRESS 

SETF /AS ; READ CYCLE VALID R0MCS2 ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF AS 

SETF /RW /AS ; WRITE TO R0MCS2 NOT POSSIBLE 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF RW AS 

SETF /A23 A22 /A21 ; SET UP RAMCS ADDRESS 

SETF /AS ; READ CYCLE VALID RAMCS ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF AS 

SETF /RW /AS ; WRITE CYCLE VALID RAMCS ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF RW AS 

SETF /A23 A22 A21 ; SET UP RAMCS ADDRESS 

SETF /AS ; READ CYCLE VALID RAMCS ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF AS 

SETF /RW /AS ; WRITE CYCLE VALID RAMCS ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF RW AS 

SETF A23 /A22 /A21 ; SET UP RAMCS ADDRESS 

SETF /AS ; READ CYCLE VALID RAMCS ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF AS 

SETF /RW /AS ; WRITE CYCLE VALID RAMCS ADDRESS 

CHECK RCMCSl R0MCS2 RAMCS INIT 

SETF RW AS 

SETF A23 /A22 A21 ; SET UP RAMCS ADDRESS 

SETF /AS ; READ CYCLE VALID RAMCS ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF AS 

SETF /RW /AS ; WRITE CYCLE VALID RAMCS ADDRESS 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF RW AS 

SETF TOP ; RESET SWITCH TRAVELING DOWN 

SETF /BOTTOM ; RESET HITTING BOTTOM 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF BOTTOM ; RESET TRAVELING TO TOP 

CHECK ROMCSl R0MCS2 RAMCS INIT 

SETF TOP ; RESET HITTING TOP 

CHECK ROMCSl R0MCS2 RAMCS INIT 

Figure 24. PALASM 2 Software Simulation for Address Decoder 
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I 
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Figure 25. Device Programming and Testing 



Device Programming and Testing 

Once tiie design simulation is completed, thie final step is device 
programming and testing (Figure 25). Programmers are avail- 
able from a variety of vendors. It is important to note that 
Monolitliic Memories qualifies programmers upon verifying ttiat 
tiie algorithms used by the programmers, arc correct and that 
other basic criteria are met. When purchasing a programmer, 
check that the programmer is qualified for the devices you intend 
to use. 

There are two types of programmer available; menu-driven or 
device code based. The menu-driven programmer directly indi- 
cates the part type being programmed, whereas the latter type 
requires the user to enter the device code before programming. 



Once the JEDECfusefile has been downloaded, the programmer 
can program the device; the PLD is then ready for use. The 
programmer also verifies the connections after the programming 
cycle. Programmers also provide the capability of reading a 
previously programmed device and creating duplicates of that 
device. 

Testing PLDs 

The testing of PLDs can be performed by the device programmer 
or by other test equipment. For a manufacturing environment 
where high yields are required, device testing is critical. This 
subject merits a separate discussion and has been included on 
page 3-128. After testing is complete, the device security cells 
may be programmed if desired to secure the design from copying. 
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The resulting Boolean equations are: 



ci = /A * B * /c * /D 
+ /A * /B * /C * D 



Combinatorial Logic Design 



In this section we will take a detailed look at several aspects of 
combinatorial logic design. Most combinatorial design applica- 
tions can be easily segmented into five major fields. 

• Encoders and Decoders 

• Multiplexers 

• Comparators 

• Adders and Arithmetic Logic 

• Latches 

We will not only focus on the design methodology for these func- 
tions, but will also explore further function-specific PLD selection 
requirements. Generalized designs will be developed, which can 
be customized later to suit specific system applications. Ways of 
optimizing the design will also be discussed. 

Encoders and Decoders 

Two of the most important functions required in digital design are 
encoding and decoding. The encoding and decoding of data are 
used extensively in digital communications as well as in periph- 
erals. Both these areas use various complex encoding and de- 
coding techniques, which are described in more detail on pages 
2-357 and 2-507. Most of these techniques are extensions of the 
simple encoding and decoding techniques often used in other 
digital designs. In this discussion we will focus on simple encod- 
ing and decoding techniques. 

Encoders 

A binary code of n bits can be used to represent 2" distinct pieces 
of coded data. A simple combinatorial encoder is a circuit which 
generates n bits of output information based upon one of the 2" 
unique pieces of input data information. This encoding of infor- 
mation is controlled by other independent control signals in a 
typical digital circuit. 

An illustration of a typical encoder is shown in Figure 1. The 
design methodology typically followed is based on truth tables 
(Figure 2), from which the Boolean equations are directly derived 
for the design. The same genericdevice selection considerations 
discussed in the section on PAL device design methodology 
apply for encoder and decoder designs. 
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Figure 2. Truth Table of a Typical Encoder 

The Boolean equations can then be optimized using Karnaugh 
maps or the PALASM 2 software minimizer. 



CO = /A 
+ /A 



/B 

/B 



c 
/c 



/D 
D 



Co 



ENCODED 
OUTPUTS 



The complete PALASM 2 design file (including the simulation) for 
this simple encoder design is shown in Figure 3. 

A Priority Encoder 

Let us take another look at the encoder example of Figure 2. In 
this example it is assumed that only one of the inputs A, B, C or 
D is asserted HIGH at any one time. If two of the inputs are 
asserted HIGH simultaneously, aconflict would be created. To re- 
solve this, a priority needs to be assigned to each of the inputs. 
Such a priority assignment is used to select a particular element 
when several inputs are asserted simultaneously. Each input is 
assigned a priority with respect to the other inputs. The output 
code generated is the code assigned to the highest priority 
input asserted. 

Thus, a priority encoder is a combinatorial circuit block similar to 
a general encoder, except that the inputs are assigned a priority. 
Such priority encoders are used often in state machine applica- 
tions, where they detect the occurrence of the highest priority 
event. They are also used for microprocessor interrupt control- 
lers, where they detect the highest priority interrupt. Another use 
for priority encoders is in bus control, where they are used in 
arbitration schemes for allowing selective access to the bus. 

The model of a priority encoder is shown in Figure 4. The four 
input signals are A, B, C and D. These are to be encoded as LL, 
LH, HL and HH outputs. Let us assign priority to D over C, C over 
B, and B over A. The next design step would be to modify the truth 
table (Figure 5) to reflect these priorities. 



Figure 1. A Block Diagram of an Encoder 
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TITLE 


SIMPLE ENCODER 




PATTERN 


ENCODE. PDS 




REVISION 


01 




AUTHOR 


JOE ENGINEER 




COMPANY 


MONOLITHIC MEMORIES, 


INC 


DATE 


8/20/87 





This is an example of a simple encoder. The output CI and CO 
are generated based upon inputs A, B, C and D. The function 
truth table is following: 



B 



CHIP ENCODER PAL16H2 



A B C D NC NC NC NC NC GND 
NC NC NC NC CO CI NC NC NC VCC 



EQUATIONS 




CI = /A * B * /C * 


/D 


+ /A * /B * /C * 


D 


CO = /A * /B * C * 


/D 


+ /A * /B * /C * 


D 


SIMULATION 




TRACE ON 




SETF A /B /C /D 




CHECK /CO /CI 




SETF /A B /C /D 




CHECK /CO CI 




SETF /A /B C /D 




CHECK CO /CI 




SETF /A /B /C D 




CHECK CO CI 





CO 



CI 



1 











L 


L 





1 








L 


H 








1 





H 


L 











1 


H 


H 



Figure 3. PALASM 2 File for a Simple Encoder 
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Figure 4. A Four-input Priority Encoder Blocl< Diagram 



Figure 5. Priority Encoder Trutii Table 
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The Boolean equations, directly derived from the truth table, are: 



ci = /A 
+ /A 

+ 
+ 



B 

/B 

B 



/C 
/C 
/C 



/D 
D 

/D 
D 



CO 



/A 
/A 



/B 
/B 



C 

/C 

C 



/D 
D 

/D 
D 



These equations can be further optimized by the design software 
to the following: 



ci = D + /c 
CO = D + c 



Although a priority encoder is a purely combinatorial function, 
output registers are frequently used to hold the output signal 
stable for longer durations. An example of a registered 1 6-input 
priority encoder is shown in Figure 6. 



TITLE 


16 INPUT REGISTERED PRIORITY ENCODER 


PATTERN 


P7090 


REVISION 


01 


AUTHOR 


J. ENGINEER 


COMPANY 


MONOLITHIC MEMORIES, INC. 


DATE 


8/20/87 


CHIP IN_PRTY 


PAL20R4 


; DESCRIPTION 





THE 16 INPUT REGISTERED PRIORITY ENCODER ACCEPTS SIXTEEN ACTIVE-LOW INPUTS 
(10-115) TO LOAD THE BINARY WEIGHTED CODE OF THE PRIORITY ORDER INTO THE 
OUTPUT REGISTER (03-QO) ON THE RISING EDGE OF THE CLOCK (CLK). A PRIORITY 
IS ASSIGNED TO EACH INPUT SO THAT WHEN TWO INPUTS ARE SIMULTANEOUSLY 
ACTIVE, THE INPUT WITH THE HIGHEST PRIORITY IS LOADED INTO THE OUTPUT 
REGISTER. THEREFORE THE HIGHEST PRIORITY INPUT (IO=H) PRODUCES HHHH IN 
THE OUTPUT REGISTER AND THE LOWEST PRIORITY INPUT (I15=H) PRODUCES LLLL IN 
THE OUTPUT REGISTER. 



OPERATIONS TABLE 



/OC CLK 115-10 Q3-Q0 



OPERATION 



H 


X 


> 




Z 


HI-Z 




L 


c 


10 


=H 


15 


10 


INTERRUPT 


(HIGHEST PRIORITY INPUT) 


L 


c 


11 


=H 


14 


11 


INTERRUPT 




L 


c 


11 


=H 


13 


12 


INTERRUPT 




L 


c 


11 


=H 


12 


13 


INTERRUPT 




L 


c 


11 


=H 


11 


14 


INTERRUPT 




L 


c 


11 


=H 


10 


15 


INTERRUPT 




L 


c 


11 


=H 


9 


16 


INTERRUPT 




L 


c 


11 


=H 


8 


17 


INTERRUPT 




L 


c 


11 


=H 


7 


IB 


INTERRUPT 




L 


c 


11 


=H 


6 


19 


INTERRUPT 




L 


c 


11 


=H 


5 


110 


INTERRUPT 




L 


c 


11 


=H 


4 


111 


INTERRUPT 




L 


c 


11 


=H 


3 


112 


INTERRUPT 




L 


c 


11 


=H 


2 


113 


INTERRUPT 




L 


c 


11 


=H 


1 


114 


INTERRUPT 




L 


c 


11 


=H 





115 


INTERRUPT 


(LOWEST PRIORITY INPUT) 



CLK 10 II 12 13 14 15 16 17 IB 19 GND 
/OC 110 111 112 03 Q2 Ql QO 113 114 115 VCC 



EQUATIONS 



/QO := /lO* II 

+ /I0*/I1*/I2* 13 

+ /I0*/I1*/I2*/I3*/I4* 15 

+ /I0*/I1*/I2*/I3*/I4*/I5*/I6* 17 

+ /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8* 19 

+ /I0*/I1*/I2*/I3VI4*/I5*/I6VI7*/I8*/I9*/I10* 111 

+ /IO*/I1*/I2VI3*/I4*/I5*/I6*/I7*/I8*/I9*/I10*/I11*/I12* 113 

+ /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*/I10*/I11*/I12*/I13*/I14*I15 



Figure 6. A 16-lnput Registered Priority Encoder Design File 
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/Ql : = 



/Q2 : = 



/Q3 



10*/] 


[1* ] 


2 




10*/] 


[1*/] 


2* ] 


3 


10*/] 


[1*/] 


2*/] 


3*/I 


10*/] 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/] 


[1*/] 


2*/] 


3*/I 


10*/] 


[1*/] 


2*/] 


3*/I 


10*/] 


[1*/] 


2*/] 


3* I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/] 


tl*/] 


2*/] 


3*/I 


10*/ 


tl*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/] 


[1*/] 


2*/] 


3*/I 


10*/] 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/] 


[1*/] 


2*/] 


3*/I 


10*/] 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 


10*/ 


[1*/] 


2*/] 


3*/I 



4*/I5* 16 
4*/I5*/I6* 17 

4*/I5*/I6*/I7*/I8*/I9* 110 
4*/I5*/I6*/I7*/I8*/I9*/I10* 111 

4*/I5*/I6*/I7*/I8*/I9*/I10*/Ill*/I12*/I13* 114 
4*/I5*/I6*/I7*/I8*/I9*/I10*/Ill*/I12*/I13*/I14*I15 

4 

4* 15 

4*/I5* 16 

4*/I5*/I6* 17 

4*/I5*/I6*/I7*/I8*/I9*/X10*/Ill* 112 

4*/I5*/I6*/I7*/I8*/I9*/I10*/Ill*/I12* 113 

4*/I5*/I6*/I7*/I8*/I9*/I10*/Ill*/I12*/I13* 114 

4*/I5*/I6*/I7*/I8*/I9*/I10*/Hl*/I12*/I13*/I14*I15 

4*/I5*/I6*/I7* 18 
4*/I5*/I6*/I7*/I8* 19 
4*/I5*/I6*/I7*/I8*/I9* 110 
4*/I5*/I6*/I7*/I8*/I9*/I10* 111 
4*/I5*/I6*/I7*/I8*/I9*/I10*/Ill* 112 
4*/I5*/I6*/I7*/I8*/I9*/I10*/Ill*/I12* 113 
4*/I5*/I6*/I7*/I8*/I9*/I10*/Ill*/I12*/I13* 114 
4*/I5*/I6*/I7*/I8*/I9*/I10*/Ill*/I12*/I13*/I14*I15 



; SIMULATION NOT INCLUDED HERE 



Figure 6. A 16-lnput Registered Priority Encoder Design File (Cont'd.) 
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INPUT SELECT LINES 



OUTPUT LINES 



QO Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Q13 Q14 Q15 



111111111111111 
10 11111111111111 
110 1111111111111 
1110 111111111111 
11110 11111111111 
111110 1111111111 
1111110 111111111 
11111110 11111111 
111111110 1111111 
1111111110 111111 
11111111110 11111 
111111111110 1111 
1111111111110 111 
11111111111110 11 
111111111111110 1 
1111111111111110 



Figure 7. The Truth Table of an Active-LOW 4-to-16 Decoder 



Decoders 



A decoder performs the reverse function of an encoder, it 
converts an n-bit code to one of its 2" unique items. It is a 
combinatorial circuit designed such that at most one of its several 
outputs will be asserted based upon the unique input codes. 

Adecoder may have as many outputs as there are possible binary 
input selection combinations. As shown in the truth table 
(Figure 7), only one output may be asserted at any time. When 
a new combination is applied, another output is asserted and the 
original output is returned to its non-asserted state. 

The Boolean logic equations can be directly derived from the truth 
table shown in Figure 7. The procedure is the same as explained 
in the previous section on PLD design methodology on page 2-27. 
The Boolean equations derived are shown in Figure 8. 
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/Q7 


= 


/D 
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/Q8 
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/B * /A 
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/c 
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/QIO 
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B * /A 


/Qll 
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/c 
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B * A 


/Q12 


■= 
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■k 


/B * /A 


/Q13 
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/B * A 


/Q14 
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■k 


B * /A 


/Q15 
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* 


c 


■k 


B * A 



Notice from the truth table that there is no combination of inputs 
that will send all the outputs to their non-asserted state. Many 
designs actually need to be able to make all outputs inactive. This 
can be done simply by putting enable lines in all of the output AND 
gates. Many such design modifications can be easily added once 
the basic Boolean equations have been derived, instead of 
redoing the truth table. Figure 9 shows the PALASM 2 software 
design file of one such 4-to-1 6 decoder with the enable function. 

Probably the most commonly used decoders are the address 
decoders required by most microprocessors and bus interfaces. 
These also constitute the most common application of PLDs in 
digital designs. The design considerations for address decoders 
have been covered earlier in the PLD Design Methodology 
section, page 2-23. Later we will develop a general Boolean 
equation for an address decoder circuit when we discuss range 
decoders. 



Figure 8. Decoder Boolean Logic Equations 
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Title 4tol6 Decoder 

Pattern 4-16DEC.PDS 

Revision A 

Author Mehrnaz Hada 

Company Monolithic Memories, Santa Clara, CA 

Date 1/9/85 

CHIP Decoder PAL6L16 



QO Ql Q2 A B C D ENl EN2 Q3 Q4 GND 
Q5 Q6 Q7 Q8 Q9 QIO Qll Q12 Q13 Q14 Q15 VCC 

;The 4 to 16 decoder decodes four binary encoded inputs 
;into one of 16 mutually exclusive outputs, whenever the 
;two enable lines ENl and EN2 are high. When one or both 
;of the enable lines are low the outputs are all set to 
;high values. 



EQUATIONS 



/QO 

/Ql 

/Q2 

/Q3 

/Q4 

/Q5 

/Q6 

/Q7 

/OS 

/Q9 

/QIO 

/Qll 

/Q12 

/Q13 

/Q14 

/Q15 



/DVC 

/D*/C 

/D*/C 

/DVC 

/D* C 

/D* C 

/D* C 

/D* C 

D*/C 

D*/C 

D*/C 

D*/C 

D* C 

D* C 

D* C 

D* C 



*/B*/A* 
*/B* A* 

* B*/A* 

* B* A* 
*/B*/A* 

/B* A* 

B*/A* 

B* A* 

*/B*/A* 

/B* A* 

B*/A* 

B* A* 

*/B*/A* 

*/B* A* 

* B*/A* 

* B* A* 



ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 
ENl* EN2 



Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 
Decode 



0000 
0001 
0010 
0011 
0100 
0101 
0110 
0111 
1000 
1001 
1010 
1011 
1100 
1101 
1110 

1111 



SIMULATION 



TRACE ON 



D B C A QO Ql Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 QIO 
Qll Q12 Q13 Q14 Q15 



SETF /D /C /B /A ENl EN2 

SETF A 

SETF B 

SETF C 

SETF D 

SETF /D 

SETF /C 

SETF /B 

SETF /A 

SETF /ENl 

SETF ENl /EN2 
SETF /ENl 



;Set outputs to high 

;Set outputs to high 
;Set outputs to high 



TRACE OFF 



Figure 9. Design File of Four-to-Slxteen Decoder with Enable 



Encoder/Decoder Device Selection Considerations 

The general device selection considerations are listed in 
Figure 10. Based upon the number of inputs and outputs 
required, a device can be selected from the table in Figure 11 for 
combinatorial devices or Figure 12 for registered devices. 



Number of Input Pins 
Number of Output Pins 
Number of I/O Pins 
Device Speed 

Device Power Requirements 
Number of Registers 
Number of Product Terms 
Output Polarity Control 



Figure 10. General Encoder/Decoder Device Selection 
Considerations 
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DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


OUTPUT 


PINS 


PRODUCT 


tp„ IN ns 




OF PINS 


PINS 


PINS 




TERMS/ 
OUTPUT 




TTL 


10H8 


20 


10 


8 


. 


2 


35 


10L8 


20 


10 


8 


- 


2 


35 


12H6 


20 


12 


6 


- 


2-4 


35 


12L6 


20 


12 


6 


- 


2-4 


35 


14H4 


20 


14 


4 


- 


4 


35 


14L4 


20 


14 


4 


- 


4 


35 


16H2 


20 


16 


2 


- 


8 


35 


16L2 


20 


16 


2 


- 


8 


35 


16C1 


20 


16 


2 


- 


16 


40 


16L8 


20 


10 


2 


6 


7 


10.15,25.35 


16P8 


20 


10 


2 


6 


7 


25 


16RA8 


20 


8 


- 


8 


4 


30 


18P8 


20 


10 


- 


8 


8 


15,25 


6L16 


24 


6 


16 


. 


1 


25 


8L14 


24 


8 


14 


- 


1 


25 


12L10 


24 


12 


10 


. 


2 


40 


14L8 


24 


14 


8 


- 


2-4 


40 


16L6 


24 


16 


6 


- 


2-4 


40 


18L4 


24 


18 


4 


- 


4-6 


40 


20L2 


24 


20 


2 


- 


8 


40 


20C1 


24 


20 


2 


- 


16 


40 


20L8 


24 


14 


2 


6 


7 


15,25,35 


20L10 


24 


12 


2 


8 


3 


15,20,25,30 


20RA10 


24 


10 


- 


10 


4 


20,30 


20S10 


24 


12 


2 


8 


8-16 


35 


22P10 


24 


12 


- 


10 


8 


15.25 


22XP10 


24 


12 


- 


10 


8 


20, 30. 40 


22RX8 


24 


14 


- 


8 


8 


25 


22V10 


24 


12 


- 


10 


8-16 


15.25,35 


32VX10 


24 


12 


- 


10 


8-16 


25,30 


CMOS 


C16L8 


20 


10 


2 


6 


7 


25 


C20L8 


24 


14 


2 


6 


7 


35.45 


C22V10 


24 


12 


- 


10 


8-16 


25. 35 


C29M16 


24 


5 


- 


16 


8-16 


35,45 


C29MA16 


24 


5 


- 


16 


4-12 


35.45 


ECL 


10H20P8 


24 


12 


- 


8 


4-8 


6 



Figure 11. Combinatorial Programmable Logic Devices 
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DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


REGISTERED 


PINS 


PRODUCT 


FREQUENCY IN 




OF PINS 


PINS 


OUTPUT 
PINS 




TERMS/ 
OUTPUT 


MHz 
'max 


TTL 


16R8 


20 


8 


8 


. 


8 


55.5,37.25,16 


16R6 


20 


8 


6 


2 


8 


55.5,37.25,16 


16R4 


20 


8 


4 


4 


8 


55.5,37.25,16 


16RP8 


20 


8 


8 


- 


8 


22.2 


16RP6 


20 


8 


6 


2 


8 


22.2 


16RP4 


20 


8 


4 


4 


8 


22.2 


16RA8 


20 


8 


0-8 


8-0 


4 


20 


16X4 


20 


8 


4 


4 


8 


14 


23S8 


20 


9 


4 


4 


8-12 


33.3, 28.5 


20R8 


24 


12 


8 


, 


8 


37.25.16 


20R6 


24 


12 


6 


2 


8 


37.25.16 


20R4 


24 


12 


4 


4 


8 


37.25,16 


20X10 


24 


10 


10 


- 


4 


22.2 


20X8 


24 


10 


8 


2 


4 


22.2 


20X4 


24 


10 


4 


6 


4 


22.2 


20RP10 


24 


10 


10 


. 


8 


25,37 


20RP8 


24 


10 


8 


2 


8 


25,37 


20RP6 


24 


10 


6 


4 


8 


25,37 


20RP4 


24 


10 


4 


6 


8 


25.37 


20XRP10 


24 


10 


10 


- 


8 


30,22.2,14 


20XRP8 


24 


10 


8 


2 


8 


30,22.2,14 


20XRP6 


24 


10 


6 


4 


8 


30,22.2,14 


20XRP4 


24 


10 


4 


6 


8 


30,22.2,14 


20RS10 


24 


10 


10 


. 


8-16 


19.2 


20RS8 


24 


10 


8 


2 


8-16 


19.2 


20RS4 


24 


10 


4 


6 


8-16 


19.2 


20RA10 


24 


10 


0-10 


10-0 


4 


33,20 


22RX8 


24 


14 


0-8 


8-0 


8 


28.5 


22V10 


24 


12 


0-10 


10-0 


8-16 


40,28.5,18 


32VX10 


24 


12 


0-10 


10-0 


8-16 


25, 22.2 


32R16 


40 


16 


16 


- 


8-16 


16 


CMOS 


G16R8 


20 


8 


8 


. 


8 


28.5 


C16R6 


20 


8 


6 


2 


8 


28.5 


C16R4 


20 


8 


4 


4 


8 


28.5 


C20R8 


24 


12 


8 


- 


8 


20, 15.3 


C20R6 


24 


12 


6 


2 


8 


20, 15.3 


C20R4 


24 


12 


4 


4 


8 


20, 15.3 


C22V10 


24 


12 


0-10 


10-0 


8-16 


33.3, 20 


C29M16 


24 


5 


. 


16 


8-16 


20,15 


C29MA16 


24 


5 


- 


16 


4-12 


20,15 


ECL 


10H/020EV/G8 


24 


12 


0-8 


8-0 


8-12 


125 


10H20G8 


24 


12 


0-8 


8-0 


4-8 


NA 


(Latched) 















Figure 12. Registered Programmable Logic Devices 
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Encoders typically require a large number of inputs and fewer 
outputs, whereas decoders typically require a large number of 
outputs and fewer inputs. We can see from these tables that the 
PAL device for the priority encoder application (Figure 6) should 
be the PAL20R4 and for the decoder application (Figure 7) it 
should be the PAL6L16. 

Another important device selection consideration for encoders 
and decoders is the number of product terms required for a 
design. A careful selection of code values (and priority assign- 
ments in priority encoders) can often reduce the required number 
of product terms. This can sometimes determine whether or not 
a design fits a device successfully. Figure 13 shows the truth 
tables of two simple partial 3-to-2 encoders. The product terms 
required for the two designs are different due to the different 
assignment of encoded bits. 



INPUTS 


OUTPUTS 


INPUTS 


OUTPUTS 


ABC 


X, x„ 


ABC 


X, X, 


1 
1 
1 




1 

1 


1 
1 
1 


1 

1 

1 1 



XI = /A * /B * C 



XO = /A * B * /C 



XI 


= 


/A 


* 


B 


* 


/C 




+ 


/A 


* 


/B 


* 


c 


XO 


= 


A 


* 


/B 


* 


/c 




+ 


/A 


* 


/B 


* 


c 



Figure 13. Two Encoders With Different Product Term 
Requirements 

Another way of looking at a decoder is as a logic function which, 
depending upon the select code applied, connects one data input 
to the selected outputs. Also known as a Demultiplexer, a 
decoder essentially connects an input to one of 2" outputs based 
upon n select code bits. The reverse logic function, which 
combines data from multiple sources to an output signal, is called 
a multiplexer and is discussed next. 



Multiplexers 

A multiplexer (sometimes referred to as a data selector) is a 
special combinatorial circuit, widely used in digital design. It is 
designed to gate one of several inputs to a single output. The 
input selected for connection to the output is controlled by a 
separate set of select inputs. 

The traditional use of a multiplexer is for "time division multiplex- 
ing" in data communication, when gating several data lines to a 
single data transmission line for short intervals of time. The data 
received is then demultiplexed by using a demultiplexer, de- 
scribed earlier. 

The design methodology employed for multiplexer design is the 
truth-table approach. As an example, we can look at a three- 
input-to-one-output (3:1) multiplexer, which uses two select sig- 
nals A and B. Based on these two select bits, the data on one of 
the three inputs is sent to the output. The truth table is shown in 
Figure 14. 



SELECT 




INPUTS 




OUTPUT 


B A 


11 CO 


11 C1 


11 C2 


01 Y 








X 


X 








1 


X 


X 


1 


1 


X 





X 





1 


X 


1 


X 


1 


1 


X 


X 








1 


X 


X 


1 


1 



Figure 14. Truth Table for a Three-to-One Multiplexer 

Deriving the Boolean equation from this truth table is a straight- 
forward task, in this case no further minimization is possible. The 
Boolean equation is: 



/OlY 


= 


/B 


* 


/A 


* 


/IICO 




+ 


/B 


* 


A 


* 


/IlCl 




-1- 


B 


* 


/A 


* 


/I1C2 



The complete PALASf^ 2 design file for this design example 
showing four such three-to-one multiplexers is shown in 
Figure 15. 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 

DESCRIPTION 



QUAD 3:1 MULTIPLEXER 

P7074 

01 

COLI/VOLPIGNO 

MONOLITHIC MEMORIES, INC. 

8/20/87 



THIS IS AN EXAMPLE OF A QUAD 3-TO-l MULTIPLEXER USING A PAL14L4. SELECT 
LINES A,B ARE ENCODED IN BINARY, WITH A REPRESENTING THE LSB. THE OUTPUTS 
(Y) ARE ALL HIGH IF THE SELECT LINES ARE BOTH HIGH (B,A=H), 

OPERATIONS TABLE: 



INPUT 


OUTPUTS 


SELECT 




B A 


Y 


L L 


CO 


L H 


CI 


H L 


C2 


H H 


H 



CHIP QUAD_MUX PAL14L4 

IICO IlCl I1C2 I2C0 I2C1 I2C2 I3C0 I3C1 I3C2 GND 
I4C0 I4C1 I4C2 04Y 03Y 02Y OlY B A VCC 

EQUATIONS 

/OlY = /B*/A * /IICO 
+ /B* A * /IlCl 
+ B*/A * /I1C2 

/02Y = /B*/A * /I2C0 
+ /B* A * /I2C1 
+ BVA * /I2C2 

/03Y = /B*/A * /I3C0 
+ /B* A * /I3C1 
+ B*/A * /I3C2 



/04Y 



/B*/A * /I4C0 

/B* A * /I4C1 

B*/A * /I4C2 



[SELECT INPUT ICO 

[SELECT INPUT ICl 

; SELECT INPUT 1C2 

SELECT INPUT 2C0 

SELECT INPUT 2C1 

SELECT INPUT 2C2 

SELECT INPUT 3C0 

SELECT INPUT 3C1 

SELECT INPUT 3C2 

[SELECT INPUT 4C0 

[SELECT INPUT 4C1 

[SELECT INPUT 4C2 



; SIMULATION NOT INCLUDED HERE 

Figure 15. Quad 3:1 Multiplexer 
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The equations derived in the above example can be easily 
generalized for other multiplexers. The symbol for a general 
2"-inputs-to-one-output multiplexer is shown in Figure 16 where 
n select lines are used. 




So Si Sn-1 

405 15 

INPUT SELECT LINES 

Figure 16. General Model of a 2"-to-1 Multiplexer 

The Boolean equations are: 

n=2 

Y = /S1*/S0*(I0) 

+ /SI* SO* (II) 

+ S1*/S0*(I2) 

+ SI* SO* (13) 



Y = /S2*/S1*/S0*(I0) 

+ /S2*/S1* SO* (II) 

+ /S2* S1*/S0*(I2) 

+ /S2* SI* SO* (13) 

+ S2*/S1*/S0*(I4) 

+ S2*/S1* SO* (15) 

+ S2* S1*/S0*(I6) 

+ S2* SI* SO* (17) 

Multiplexer Device Selection Considerations 

Multiplexers typically require more inputs than outputs, so the 
devices with a large number of inputs and l/Os are usually more 
useful. Careful consideration must also be given to the number 
of product terms available on each output. 



Several multiplexers are often used simultaneously to route 
multiple address and data bits, under the control of the same 
select lines. In such cases, multiple devices can be cascaded 
when the number of inputs and outputs exceeds device limits. 
Cascading is also possible for large multiplexers that do not fit in 
a single device. In such cases, the select bits should also be 
judiciously selected for each PLD, to minimize the number of 
product terms. 

Another common trick for designing a multiplexer is to connect a 
number of outputs together and control the output enables using 
the select bits to multiplex data. This multiplexing technique is 
used extensively in DRAM controllers for multiplexing address 
signals. Tming considerations for such designs include the 
output enable and disable times, which should be carefully 
selected to avoid output contentions. 

Comparators 

A comparator is a combinatorial circuit designed primarily to 
compare the relativemagnitudeoftwobinarynumbers. Figure 17 
shows the truth table for a two-bit comparator. 



INPUTS 


OUTPUTS 




A B 


EQL LES GTR 


A. 


A, B, 


B, 


A=B A<B A>B 











1 








1 


1 





1 





1 





1 


1 


1 





1 





1 





1 


1 


1 





1 1 





1 





1 1 


1 


1 










1 







1 


1 




1 





1 




1 


1 


1 




1 





1 




1 


1 


1 




1 1 
1 1 




1 


1 

1 



Figure 17. Truth Table for a Comparator 

A basic comparator compares two numbers only for equality, 
and generates the EQL signal (indicating A=B). An extension, 
called a magnitude comparator, also generates the LES signal 
(indicating A<B) and GTR signal (indicating A>B). Based on this 
truth table, the equations for the three output signals EQL, LES 
and GTR can be easily derived. These equations can then be 
optimized by using Boolean algebra, Karnaugh maps or the 
minimization routine available with PALASM 2 software. 
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The final Boolean equations are: 

EQL = /A2 * /Al * /B2 * /Bl 

+ /A2 * Al * /B2 * Bl 

+ A2 * /Al * B2 * /Bl 

+ A2 * Al * B2 * Bl 



LES 



/A2 * /Al * /B2 * Bl 



+ 


/A2 


* 


/Al 


* 


B2 


* 


/Bl 


+ 


/A2 


* 


/Al 


* 


B2 


* 


Bl 


+ 


/A2 


* 


Al 


* 


B2 


* 


/Bl 


+ 


/A2 


* 


Al 


* 


B2 


* 


Bl 


+ 


A2 


* 


/Al 


* 


B2 


* 


Bl 


= 


/Al 


* 


B2 


* 


Bl 






+ 


/A2 


* 


/Al 


* 


Bl 






+ 


/A2 


* 


B2 












/A2 


* 


Al 


* 


/B2 


* 


/Bl 


+ 


A2 


* 


/Al 


* 


/B2 


* 


/Bl 


+ 


A2 


* 


Al 


* 


/B2 


* 


/Bl 


+ 


A2 


* 


/Al 


* 


/B2 


* 


Bl 


+ 


A2 


* 


Al 


* 


/B2 


* 


Bl 


+ 


A2 


* 


Al 


* 


B2 


* 


/Bl 


= 


Al 


* 


/B2 


* 


/Bl 






+ 


A2 


* 


Al 


* 


/Bl 






+ 


A2 


* 


/B2 











LES = B2 


* /A2 




+ (B2 


:+: /A2) 


* Bl * /Al 


GTR = A2 


* /B2 




+ (A2 


:+: /B2) 


* Al * /Bl 



These equations can tiien be extended for a general comparison 
of n-bit comparands as follows: 

IBS = Bn * /An 

+ (Bn :+: /An) * Bn-1 * /An-1 

+ (Bn :+: /An) * (Bn-1 :+: /An-1) * Bn-2 * 

/An-2 
+ ... 
+ ... 
+ ... 

+ (Bn : + : /An) * (Bn-1 : + : /An-1) ... 
(B2 : + : /A2) * Bl * /Al 



GTR = An * /Bn 



(An :+: /Bn) 
(An :+: /Bn) 
/Bn-2 



An-1 * 
(An-1 



/Bn-1 
:+: /Bn-1) 



* An-2 * 



(An :+: /Bn) * (An-1 :+: /Bn-1) 
(A2 :+: /B2) * Al * /Bl 



Comparator Device Selection Considerations 

The number of product terms needed is directly related to the 
number of bits compared. For LES (less than) and GTR (greater 
than) functions, the number of product terms required depends 
upon the number of bits in the two operands compared, as well as 
their value. The LES and GTR equations can be rewritten as 
follows: 



The total number of product terms required for an n-bit compari- 
son is 2"-1 . Comparators require a large number of product terms 
and devices that offer many product terms can be used very ef- 
fectively. Figure 1 8 shows a list of devices that offer many product 
terms. It also includes registered devices, since registers are 
sometimes used to hold the comparator output signals asserted 
for long durations. 



DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 
tpJNns 


TTL 


1601 
20C1 


20 

24 


16 
20 


2 
2 


- 


16 
16 


40 
40 


20S10 
22V10 
32VX10 


24 
24 
24 


12 
12 
12 


2 


8 

10 
10 


8-16 
8-16 
8-16 


35 
25,35 
25,35 


CMOS 


C22V10 


24 


12 


- 


10 


8-16 


25.35 


C29M16 
C29MA16 


24 
24 


5 
5 


- 


16 
16 


8-16 
4-12 


35,45 
35.45 



Figure 18. PLDs With Large Numbers of Product Terms 
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As is obvious from these equations, comparators require exclu- 
sive-OR functions, and can also be efficiently implemented in the 



PAL1 6X4, which offers exclusive-OR functions. An example of a 
four-bit comparator implemented on a PAL16X4 is shown in 
Figure 19. 



PAL16X4 PAL DESIGN SPECIFICATION 

P7031 BIRKNER/COLI 07/12/81 

BETWEEN LIMITS COMPARATOR/REGISTER 
MMI SUNNYVALE, CALIFORNIA 

CLK LOAD CLEAR BO Bl B2 B3 NC /0C2 GND 
/OCX /NE /EQ A3 A2 Al AC /LT /GT VCC 

; DESCRIPTION 

;THE DEVICE CONTINUOUSLY COMPARES THE BUS VALUE (B3-B0) WITH THE VALUE OF 
;THE REGISTER (A3-A0) AND REPORTS THE STATUS ON OUTPUTS LT, EQ, NE, AND GT: 

* LT INDICATES THAT B IS LESS THAN A 

* EQ INDICATES THAT B IS EQUAL TO A 

* NE INDICATES THAT B IS NOT EQUAL TO A 

* GT INDICATES THAT B IS GREATER THAN A 



/OCl /0C2 CLK LOAD 



STATUS BUS REG 
CLEAR LT EQ NE GT B3-B0 A3-A0 



OPERATION 



H 


X 


X 


X 


X 


X 


X 


X 


X 


X 


Z 


REG HI-Z 


X 


H 


X 


X 


X 


z 


z 


z 


z 


X 


X 


STATUS HI-Z 


L 


X 


X 


L 


L 


X 


X 


X 


X 


X 


A 


READ REG 


X 


X 


c 


H 


L 


X 


X 


X 


X 


B 


B 


LOAD REG 


X 


X 


c 


L 


L 


X 


X 


X 


X 


X 


A 


HOLD 


X 


X 


c 


X 


H 


X 


X 


X 


X 


X 


L 


CLEAR REG 


X 


L 


X 


L 


L 




STATUS 




B 


A 


COMPARE 



IF(0C2) LT = (A3*/B3) 

+ (A3:*:B3) * 

+ (A3:*:B3) * 

+ (A3:*:B3) * 

IF(0C2) GT = (/A3*B3) 

+ (A3:*:B3) * 

+ (A3:*:B3) * 

+ (A3:*:B3) * 

IF(0C2) EQ = {A3:*:B3) * 

IF(0C2) NE = (A3:+:B3) + 

/A3 := (/A3)*/L0AD 
+ (/B3)* LOAD 
+ CLEAR 

/A2 := (/A2)*/L0AD 
+ (/B2)* LOAD 
+ CLEAR 



/Al 



/AO 



( /Al ) */LOAD 
(/Bl)* LOAD 
CLEAR 

(/A0)*/LOAD 
(/BO)* LOAD 
CLEAR 



(A2*/B2) 

(A2:*:B2) * (A1*/B1) 

(A2:*:B2) * (A1:*:B1) * (AO*/BO) 



(/A2*B2) 

(A2:*:B2) * (/A1*B1) 

(A2:*:B2) * (A1:*:B1) * (/AO*BO) 

(A2:*:B2) * (A1:*:B1) * (AO:*:BO) 

(A2:+:B2) + (A1:+:B1) + (AO:+:BO) 



HOLD REG A3 
LOAD REG A3 
CLEAR REG A3 

HOLD REG A2 
LOAD REG A2 
CLEAR REG A2 

HOLD REG Al 

LOAD REG Al 
CLEAR REG Al 

HOLD REG AO 
LOAD REG AG 
CLEAR REG AO 



;B3=L, A3=H 
!B2=L, A2=H 
!B1=L, A1=H 
;BO=L, AO=H 

;B3=H, A3=L 
;B2=H, A2=L 
;B1=H, A1=L 
;BO=H, AO=L 

; COMPARE EQUAL 

;NOT EQUAL 



Figure 19. PAL16X4 Based Four-Bit Comparator 
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FUNCTION TABLE 

CLK /OCl /0C2 LOAD CLEAR B3 B2 81 BO A3 A2 Al AO LT EQ NE GT 



; CONTROL 






BUS 


REG 














; 


/oc 


OPERATIONS 


BBBB 


AAAA 


— 


STATUS 





COMMENTS 




;CLK 


I 


2 


LOAD 


CLEAR 


3210 


3210 


LT 


EQ 


NE 


GT 


(HEX VALUES) 




C 


L 


X 


X 


H 


XXXX 


LLLL 


X 


X 


X 


X 


CLEAR REG 




C 


L 


X 


H 


L 


LLLL 


LLLL 


X 


X 


X 


X 


LOAD REG (0) 




X 


L 


L 


L 


L 


LLLL 


LLLL 


L 


H 


L 


L 


COMPARE (0 EQ 


0) 


X 


L 


L 


L 


L 


LLLH 


LLLL 


L 


L 


H 


H 


COMPARE (1 GT 


0) 


X 


L 


X 


L 


L 


XXXX 


LLLL 


X 


X 


X 


X 


READ REG (0) 




c 


L 


X 


X 


H 


XXXX 


LLLL 


X 


X 


X 


X 


CLEAR REG 




c 


L 


X 


H 


L 


LHLH 


LHLH 


X 


X 


X 


X 


LOAD REG (5) 




X 


L 


L 


L 


L 


LLLL 


LHLH 


H 


L 


H 


L 


COMPARE (0 LT 


5) 


X 


L 


L 


L 


L 


LHLH 


LHLH 


L 


H 


L 


L 


COMPARE (5 EQ 


5) 


X 


L 


L 


L 


L 


HHHH 


LHLH 


L 


L 


H 


H 


COMPARE (F GT 


5) 


X 


L 


X 


L 


L 


XXXX 


LHLH 


X 


X 


X 


X 


READ REG (5) 




c 


L 


X 


X 


H 


XXXX 


LLLL 


X 


X 


X 


X 


CLEAR REG 




c 


L 


X 


H 


L 


HLHL 


HLHL 


X 


X 


X 


X 


LOAD REG (A) 




X 


L 


L 


L 


L 


LHLL 


HLHL 


H 


L 


H 


L 


COMPARE (4 LT 


A) 


X 


L 


L 


L 


L 


HLHL 


HLHL 


L 


H 


L 


L 


COMPARE (A EQ 


A) 


X 


L 


L 


L 


L 


HLHH 


HLHL 


L 


L 


H 


H 


COMPARE (B GT 


A) 


X 


L 


X 


L 


L 


XXXX 


HLHL 


X 


X 


X 


X 


READ REG (A) 




c 


L 


X 


X 


H 


XXXX 


LLLL 


X 


X 


X 


X 


CLEAR REG 




c 


L 


X 


H 


L 


HHHH 


HHHH 


X 


X 


X 


X 


LOAD REG (F) 




X 


L 


L 


L 


L 


HHHL 


HHHH 


H 


L 


H 


L 


COMPARE (E LT 


F) 


X 


L 


L 


L 


L 


HHHH 


HHHH 


L 


H 


L 


L 


COMPARE (F EQ 


F) 


X 


L 


X 


L 


L 


XXXX 


HHHH 


X 


X 


X 


X 


READ REG (F) 




c 


L 


X 


L 


L 


XXXX 


HHHH 


X 


X 


X 


X 


HOLD (F) 




X 


H 


X 


X 


X 


XXXX 


ZZZZ 


X 


X 


X 


X 


TEST HI-Z (/0C1=H) 


X 


X 


H 


X 


X 


XXXX 


XXXX 


z 


z 


z 


z 


TEST HI-Z (/0C2=H) 



Figure 19. PALI 6X4 Based Four-Bit Comparator (Cont'd.) 



The values of the comparands themselves affect the number of 
product terms used. When the comparison is made with compar- 
ands which are power-of-two numbers, the number of product 
terms required can be reduced drastically. This essentially relies 
on the fact that when the lower bits of a comparand are all zeros 
only the highest bit needs to be compared, requiring only one 
product term. For example in a two-bit comparator, if A1 is zero 
andA2isone, the equation forthegreater-than function becomes 
very simple and requires only one product term: 



This general GTR equation can also be considered as an equa- 
tion for comparing a number to a range of numbers extending 
from zero to number A. In factthis trick is used very often by many 
system designers for address decoder functions. In PLD Design 
Methodology (page 2-22) the R0MCS1 signal is one such signal 
which is generated for the address range from (000000) hex to 
(OFFFFF) hex. Forthis design n=23, the comparand A=(OFFFFF 
+ 1 )=1 00000 and p=21 . Substituting in the general equation we 
get the same address decoder Boolean logic equation. 



GTR = /B2 



/A23 * /A22 * /A21 



The general equation for the GTR signal can also be simplified 
when comparing a number B to a fixed power-of-two comparand 
A with p least significant zeros. 



A = 000010000 
n p 



00 

1 



As such designs require few product terms and no XOR gates, 
they are efficiently implemented on standard combinatorial PLDs. 
A list of such standard combinatorial PLDs is shown in Figure 20. 
A general form of range comparators with two boundary compar- 
ands will be discussed later. 



GTR = /Bn * /Bn-1 ... * /Bpfl * /Bp 
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DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


OUTPUT 


PINS 


PRODUCT 


tpJNns 




OF PINS 


PINS 


PINS 




TERMS/ 
OUTPUT 




TTL 


16L8 


20 


10 


2 


6 


7 


10,15,25,35 


16P8 


20 


10 


2 


6 


7 


25 


18P8 


20 


10 


- 


8 


8 


15,25 


20L8 


24 


14 


2 


6 


7 


15, 25, 35 


20L10 


24 


12 


2 


8 


3 


15,20,25,30 


20S10 


24 


12 


2 


8 


8-16 


35 


22P10 


24 


12 


- 


10 


8 


15,25 


22XP10 


24 


12 


- 


10 


8 


20, 30, 40 


22RX8 


24 


14 


- 


8 


8 


25 


22V10 


24 


12 


- 


10 


8-16 


25.35 


32VX10 


24 


12 


- 


10 


8-16 


25.30 


CMOS 


C16L8 


20 


10 


2 


6 


7 


25 


C20L8 


24 


14 


2 


6 


7 


35,45 


C22V10 


24 


12 


- 


10 


8-16 


25,35 


C29M16 


24 


5 


. 


16 


8-16 


35,45 


C29MA16 


24 


5 


- 


16 


4-12 


35,45 



Figure 20. Standard Combinatorial PLDs witli Sum-of-Products Logic 



The tiiird output signal is the EQL signal. The EQL Boolean 
equation tells us whether the two numbers are identical. Such 
information is useful not only in address decoders, but also in 
digital signal processing designs. This equation requires a large 
number of product terms. A closer examination reveals that it is 
essentially an exclusive-OR function. 



EQL = /A2 * /B2 
+ A2 * B2 



(/Al 
(/Al 



/Bl + Al 
/HI + Al 



Bl) 

Bl) 



EQL = (Al :*: Bl) * (A2 :*: B2) ; Exclusive-NOR 

; function 

Inverting this: 

^EQL = (Al :+: Bl) + (A2 :+: B2) ; Exclusive-OR 

; function 

This equation can be extended to give a general equation for 
!qual-to comparison for two n-bit comparands. 



This inverted equation is implementable in the sum-of-products 
form of the exclusive-OR functions, and can be easily expanded 
to the following: 

/£QL = Al * /Bl + /Al * Bl 



+ A2 
+ A3 

+ ... 
+ ... 
+ An 



/B2 + /A2 
/B3 + /A3 



B2 
B3 



* /Bn + /An * Bn 



This gives us a general sum-of-products form of a comparator 
equation which is easily implemented in PAL devices. An n-bit 
comparator requires 2n product terms. An eight-bit comparator 
is implemented in the design shown in Figure 21. This design 
uses the PALI 601 , which is specifically designed for comparator 
applications, and provides a large number of product terms 
connected to EQL and /EQL (NE) outputs. All of the devices that 
have a largo number of product terms are illustrated in Figure 1 8. 
and would be appropriate for such designs. 



'EQL = (An : + : Bn) 

+ (An-1 :+: Bn-1) 

-I- (An-2 :+: Bn-2) 

+ (An-3 :+: Bn-3) 

+ ... 

+ ... 

+ (Al :+: Bl) 



^ 
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Title Octal_Comparator 

Pattern OctComp.pds 

Revision A 

Author Mehrnaz Hada 

Company Monolithic Memories Inc. 

Date 1/29/85 



Santa Clara, CA 



;The octal comparator establishes when two 8-bit data 
.-strings (A7-A0) and (B7-B0) are equivalent (EQ=H) or not 
.•equivalent (NE=H). 

CHIP OctalCom PAL16C1 

A7 AO BO Al Bl A2 B2 A3 B3 GND 
A4 B4 A5 B5 EQ NE A6 B6 B7 VCC 



EQUATIONS 








NE = AO*/BO 


+ 


/AO* 


BO 


+ A1*/B1 


+ 


/Al* 


Bl 


+ A2*/B2 


+ 


/A2* 


B2 


+ A3*/B3 


+ 


/A3* 


B3 


+ A4*/B4 


+ 


/A4* 


B4 


+ A5*/B5 


+ 


/A5* 


B5 


+ A6*/B6 


+ 


/A6* 


B6 


+ A7*/B7 


+ 


/A7* 


B7 


SIMULATION 









;A0 


: + : 


BO 


;A1 


: + : 


Bl 


;A2 


: + : 


B2 


;A3 


: + : 


B3 


;A4 


: + : 


B4 


;A5 


: + : 


B5 


;A6 


: + : 


Be 


;A7 


: + : 


B7 



TRACE_ON A7 A6 A5 A4 A3 A2 Al AO NE 
B7 B6 B5 B4 B3 B2 Bl BO 

SETF A7 /A6 /A5 /A4 /A3 /A2 /Al /AO 
/B7 /B6 /B5 /B4 /B3 /B2 /Bl /BO 

SETF /A7 A6 

SETF /A6 A5 

SETF /A5 A4 

SETF /A4 A3 

SETF /A3 A2 

SETF /A2 Al 

SETF /Al AO 

SETF /A7 /A6 /A5 /A4 /A3 /A2 /Al /AO 
B7 

SETF /B7 B6 

SETF /B6 B5 

SETF /B5 B4 

SETF /B4 B3 

SETF /B3 B2 

SETF /B2 Bl 

SETF /Bl BO 

SETF /BO 

SETF A7 A6 A5 A4 A3 A2 Al AO 
B7 B6 B5 B4 B3 B2 Bl BO 

SETF /A7 A6 /A5 A4 /A3 A2 /Al AO 
/B7 B6 /B5 B4 /B3 B2 /Bl BO 

SETF A7 /A6 A5 /A4 A3 /A2 Al /AO 
B7 /B6 B5 /B4 B3 /B2 Bl /BO 

TRACE OFF 



A7=H, 


B7=L 


A6=H, 


B6=L 


A5=H, 


B5=L 


A4=H, 


B4=L 


A3=H, 


B3 = L 


A2=H, 


B2=L 


A1=H, 


B1=L 


AO=H, 


BO=L 


A7 = L, 


B7=H 


A6=L, 


B6=H 


A5 = L, 


B5 = L 


A4 = L, 


B4=H 


A3 = L, 


B3=H 


A2 = L, 


B2=H 


A1=L, 


B1=H 


AO = L, 


BO=H 


Test 


all L's 


Test 


all H's 


Test 


even ones 


Test 


odd ones 



Figure 21. Octal Comparator Design Example 
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Note that the EQL equation, as well as GTR and LBS equations, 
rely upon the XOR function. Often the logic represented by the 
equations is implemented in two or more devices. There are 
several PAL devices that provide dedicated XOR capability, and 
can efficiently implement the logic. These devices, listed in Figure 
22, can be used for comparator applications. The PAL32VX10 
and the PAL22RX8 offer an architecture with a multiple number 
of product terms on one side of the XOR gate, which allows a 
number to be compared against multiple comparands. The de- 



sign example in Figure 1 9 shows the XOR implementation of the 
LES, GTR and EQL functions. 

Let us analyze these equations further. The LES and GTR 
outputs indicate whether one number is greater than or less than 
another. In fact, these equations can also be judiciously com- 
bined to get a comparison of a range of numbers such as 
A > X > B. Such range comparisons are very useful for address 
decoder circuits. 



DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 
FREQUENCY IN 

MHz 

'max 


TTL 


16X4 


20 


8 


4 


4 


8 


14 


20X10 
20X8 
20X4 


24 
24 
24 


10 
10 
10 


10 
8 
4 


2 
6 


4 
4 
4 


22.2 
22.2 
22.2 


22XP10 


24 


12 


- 


10 


8 


20, 30, 40 


20XRP10 
20XRP8 
20XRP6 
20XRP4 


24 
24 
24 
24 


10 
10 
10 
10 


10 
8 
6 
4 


2 

4 
6 


8 
8 
8 
8 


30,22.2,14 
30,22.2,14 
30,22.2,14 
30,22.2, 14 


22RX8 
32VX10 


24 
24 


14 
12 


0-8 
0-10 


8-0 
10-0 


8 
8-16 


28.5 
25, 22.2 



Figure 22. PAL Devices with XOR Capability for Comparators 
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Range Decoders 

Range decoders implemented as address decoders are one of 
the most commonly used applications of PLDs in digital systems. 
Agood example is tlie address decoder illustrated earlieron page 
2-22, in the PLD Design Methodology section. Range decoders 
compare a number (address) to a given range of comparands 
(addresses). One way to arrive at the range decoder Boolean 
equations is to use the traditional truth table approach. Another 
way is to use the Boolean equations generated earlier in the 
comparator section for greater-than and less-than functions. To 
decode a range of three-bit numbers from B to A, we must 
compare another number X such that A>X>B. The Boolean 
equations for the GTR (A>X) and LES (B<X) functions are 
illustrated below: 



Combining these two equations can give us a range signal which 
will be asserted only when Ais greaterthan X and X is greaterthan 
B. The combined Boolean equation follows: 



RANG 



( 


A3 * /X3 






+ 


(A3 :+: /X3) 


* A2 * 


/X2 


+ 

* 

( 


(A3 :+: /X3) 
X3 * /B3 


* (A2 


:+: /X2) 


-1- 


(X3 :+: /B3) 


* X2 * 


/B2 


+ 


(X3 :+: /B3) 


* (X2 


:+: /B2) 



* Al * /Xl ) 



* XI * /Bl ) 



GTR = 


A3 


* /X3 




+ 


(A3 


:+: /X3) * A2 * 


/X2 


+ 


(A3 


:+: /X3) * (A2 


:+: /X2) 


LES = 


X3 


* /B3 




+ 


(X3 


:+: /B3) * X2 * 


/B2 


+ 


(X3 


:+: /B3) * (X2 


:+: /B2) 



* Al * /Xl 



* XI * /Bl 



Using Boolean algebra we get the following equation: 



RANG 

(A3 

(A3 

(A3 

(X3 



: + : /X3) * (X3 :-l- 

:+: /X3) * (X3 :+ 

: + : /X3) * (A2 :-l- 

:+: /B3) * (X2 :+ 

(A3 :+: /X3) * A2 * /X2 * X3 * /B3 

(X3 :+: /B3) * A3 * /X3 * X2 * /B2 



/B3) 


* 


(A2 


: + : 


/X2) * 


Al * 


/Xl 


* 


X2 


* 


/B2 


/B3) 


* 


(X2 


: + : 


/B2) * 


A2 * 


/X2 


* 


XI 


* 


/Bl 


/X2) 


* 


Al * 


/Xl 


* X3 


* /B3 












/B2) 


* 


A3 * 


/X3 


* XI 


* /Bl 













The general equation for n-bit comparands can also be obtained by extending these equations. 

RANG = ( An * /Xh 

+ (An :+: /Xn) * An-1 * /Xn-1 

+ (An :+: /Xn) * (An-1 :+: /Xh-1) * An-2 * /Xn-2 
-I- ... 
+ ... 

+ (An : + : /Xn) * (An-1 : + : /Xh-1) ... (A2 :-f-: /X2) * Al * /Xl) 
* 

(Xh * /Bn 
+ (Xn :+: /fin) * Xn-1 * /Bn-1 

+ (Xn :+: /Bn) * (Xn-1 :+: /Bn-1) * Xn-2 * /Bn-2 
+ ... 
-I- ... 
+ ... 
+ (Xn : + : /Bn) * (Xn-1 : + : /Bn-1) ... (X2 :+: /B2) * XI * /Bl) 
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The number of product terms required is clearly very large and 
can easily exceed one hundred for an eight-bit range comparator. 
Most microprocessors have addresses which exceed 16 bits. In 
order to fit the design on a PAL device, one commonly used 
technique is to select the address range defined by A and B such 
that the range extendsfrom address B+1 to A-1 , where Aand B+1 
are power-of-two numbers. Because the address space is 
aligned on the power-of-two boundaries, a number of bits of the 
address comparands will be zero. When implemented in Boolean 
equations, this substantially reduces the number of product terms 
required. 

The maximum number of product terms required for a three-bit 
range decoder shown above, with any comparand values, is 28. 
If the address is chosen from 2 to 3, resulting in A=4 and B=1 , then 
only one product term will be required. 

RANG = /X3 * X2 

Similarly for a range from one to five, B = 1 and A = 6 (a multiple- 
of-two), and the number of product terms required is only two. 



RANG = /X3 
+ X3 



X2 
/X2 



Thus a careful selection of range boundaries allows such logic 
functions to be implemented easily in PLDs. Such reduction in 
logic obviously also holdstrue for discrete implementations. Most 
address decoders are designed with address ranges with 
boundaries that are power-of-two numbers, and require few prod- 
uct terms for implementation. 

For a power-of-two address range, the comparand A v/ould be a 
power-of-two; 2, 4 or 8. These are numbers whose least signifi- 
cant bits are all zeros. Similarly, comparand B will be a power-of- 
two number (minus one); 1 , 3, and 7. These are numbers whose 
least significant bits are allones. Substituting these in the general 
equation for range comparators we arrive at: 

A = 0000100000000000 
n p 1 

B = 0000000000011111 
n q 1 

RANG = /Xn * /Xn-1 *..* /Xp * (Xp-1 + Xp-2 + ... +Xq) 

This is a general equation for a power-of-two range comparison. 
In the example on page 2-22, the R0MCS2 signal addresses the 
range from 100000 to 1FFFFF, in which case B=OFFFFF and 
A=2FFFFF Here n=23, p=22 and q=21. The address decode 
equation for the ROMCS2 signal can be arrived at by substituting: 

R0MCS2 = /A23 * /A22 * A21 

This is the same equation as was found from the truth table. 

Such designs are very common for address decoder appiica- 
ions. These do not require any XOR gates, and can be imple- 
nented in standard combinatorial PLDs with only sum-of-prod- 
icts logic. A list of such PLDs is shown in Figure 20. 



Adders/Arithmetic Circuits 

Digital systems are designed to carry out a variety of arithmetic 
instructions on binary numerical data. A good example is the 
ALU (Arithmetic Logic Unit) used in all digital computers. The 
basic function of an ALU is that of an adder performing addition 
on two binary numbers. A binary adder takes two inputs, adds 
them, and generates the binary sum. A full adder is a one-bit 
adder with carry-in and carry-out; this is the basic building block 
of any adding circuit. The truth table of such an adder is shown 
in Figure 23. 



INPUTS 


OUTPUTS 


A 


B 


c„ 


Y C 




















1 


1 





1 





1 





1 


1 


1 


1 








1 


1 





1 


1 


1 


1 





1 


1 


1 


1 


1 1 



Figure 23. Truth Table for a Full Adder 

This truth table is then used to form the Boolean equations in the 
manner described earlier. 



y 


= 


A 


* /B 


* 


/Cin 




+ 


/A 


* /B 


* 


Cin 




+ 


A 


* B 


* 


Cin 




+ 


/A 


* B 


* 


/Cin 


Cout 


= 


A * 


Cin 






+ 


A * 


B 








+ 


B * 


Cin 



Bigger binary adders can be made by cascading these full 
adders. Each carry-out is directed to carry-in for the next stage. 
Such adders are known as Ripple Adders. 

Combinatorial PAL devices are ideal for this purpose, since they 
also provide internal feedback. Thus one strong consideration in 
such designs is the internal feedback capability of the device, in 
addition to other general device selection considerations. 

These ripple adders have an advantage in that they can be 
cascaded to any length possible. But since the carry-out from the 
least significant bit has to travel all the way to the highest 
significant bit, this can take a long time, making such large adders 
inefficient. Adders with built in carry-look-ahead circuitry can 
save time by simultaneously generating the carry-in signal for all 
of the bits. 
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Rewriting the equations for a full adder from above gives: 

YO = AO :+: BO :+: Cin 

wlnere the carry-out signal is: 

CO = AO * BO + (AO + BO) * Cin 

Extending these equations for an n-bit carry-look-ahead adder, 
we can directly get the following equations: 

YO = AO :+: BO :+: Cin 

Yl = Al :+: Bl :+: CO 

where 

CO = AO * BO 

+(A0 + BO) * Cin 

Y2 = A2 :+: B2 :+: CI 

where 

Cl = Al * Bl 

+ {A1 + B1)*(A0 * BO) 

+ {A1 + B1)*(A0 + BO) * Cin 

Y3 = A3 :+: B3 :+: C2 

where 

C2 = A2 * B2 

+ (A2 + B2)*(A1 * Bl) 

+ (A2 + B2)*(A1 + B1)*(A0 * BO) 

+ (A2 + B2)*(A1 + B1)*(A0 + BO) * Cin 

In general 

Yn = An :+: Bn :+: Cn-1 

and Cn-1 = An-1 * Bn-1 

+ (An-l -I- Bn-l)*(An-2* Bn-2) 

+ ... 

+ ... 

+ (An-l + Bn-1)* ... * (AO -I- BO) * Cin 

and finally the carry-out is: 

Cn = An * Bn 

+ (An -f- Bn) * (An-1 * Bn-1) 

+ (An + Bn) * (An-1 + Bn-1) * (An-2 * Bn-2) 

+ ... 

+ ... 

+ (An + Bn) * ... * (AO + BO) * Cin 

These equations are essentially a combination of the traditional 
generate and propagate logic for ALU design. 



Adder Device Selection Considerations 

The number of product terms required for implementing a carry- 
look-ahead adder is enormous. The carry-out function alone for 
a four-bit carry-look-ahead adder requires over 36 product terms 
in the sum-of-products form. For a single-level AND-OR implem- 
entation the number of product terms required for the most 
significant bit Y3 is 28. This will not fit into any of the available 
standard combinatorial devices. A PLD with this many product 
terms per output will not only be slow, but will also allow very low 
silicon utilization. 

Special circuitry, known as bit-pair decoding, has been provided 
on the PAL1 6X4 to allow implementation of such functions with up 
tosixproductterms. Designed primarily forarithmeticoperations, 
the PALI 6X4 device has an architecture which implements three 
different stages of logic. The first is the bit-pair decoding between 
an input and a feedback. The second is the standard AND-OR 
array logic. The third is the XOR gate provided on selected 
outputs. 

Although adders are strictly a combinatorial function, the 
PAL16X4 provides four registered outputs. Not only are these 
registers used for storing the result, but they are also used 
temporarily to hold one of the two operands being added. The 
other operand is provided directly at the inputs. To keep the 
discussion simple we will not discuss how the first operand is 
loaded into the registers. This load function can be implemented 
by using an extra product term on each output. 

The logic trick lies in the bit -pair decoding function. All of the bits 
of the first operand in the registers (A) and the second operand at 
the inputs (B) are bit -pair decoded. As illustrated in Figure 24, the 
results of this bit-pair decoding are A + B, A + /B, /A + B, and 
/A + /B. These outputs are then fed to the AND array as inputs. 






:=0 



^=^> 



BIT-PAIR 
DECODING 

- A + B 

- A + B 

- A + B 

- A + B 



Figure 24. Bit-Pair-Decoding Function 
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Sixteen AND combinations of these four inputs can then be 
formed on every product term of the AND-OR array. These are 
shown in Figure 25, and include the standard true and comple- 
ments of both the bits as well as XOR, XNOR and various other 
combinations. This bit pair decoding essentially provides an extra 
two-level AND-OR logic level before the AND-OR array. The cost 
as well as extra propagation delay of the extra logic level is 
minimal, since the array size does not increase. 

The equationsfortheaddercan obviously benefit from multi-level 
logic. The bit-pair decoding is used to implement the first two 



levels of logic. The next level of logic is implemented in the 
standard AND-OR array. Every product term of this AND-OR 
array can combine one of the sixteen possible functions of 
different inputs/feedbacks of the device. 

The product terms are then combined together through an OR 
gate to implementthe CARRY-OUTfunction, shown in Figure 26. 
For adder outputs YO, Y1, Y2, and Y3, the product terms are 
combined through the XOR gate, as shown in Figure 27. This 
XOR gate on the outputs is the final logic level on the device. 



O- 
O 
-D- 

o- 

■D- 
O 



PRODUCT TERMS 



o- 

£> 

-o 



A 
A +1 



A'B 
A + B 



Figure 25. Sixteen Possible Input Logic Combinations 
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A3 



A2 



BIT PAIR 
DECODING 



A3*B3 
(A3 + B3) *A2*B2 

(A3 + B3)*(A2 + B2)*ArBi 

(A3 + B3)*(A2 + B2)*(Ai+Bi) *Ao*Bo 

(A3+B3)*(A2 + B2) "(Ai + Bi) *(Ao*Bo)*Cin 



B2 



BIT PAIR 
DECODING 



Bl 



Ao 



BIT PAIR 
DECODING 



BIT PAIR 
DECODING 



w 



> 



mlffl CO Im CO im m m 



CARRY-OUT = A3* B3 

(A3+B3) *A2*B2 

(A3+B3)*(A2+B2)*Ai*Bi 
(A3+B3) *(A2 + B2)*(Ai + Bi)*Ao*Bo 
(A3+ B3) * (A2+ B2) * {Ai+ Bi) * (Ao * Bo) *Ci 

Figure 25. Implementation of CARRY-OUT Function 



A3 



B3 



A2 



BIT PAIR 
DECODING 



A3 :+: B3 



(A2* B2) 

(A2 + B2)*(ArBi) 

(A2+B2} *(Ai+Bi)*(Ao*Bo) 

(A2 + B2)*(Ai + Bi)*(Ao*Bo)*Cin 



BIT PAIR 
DECODING 



BIT PAIR 
DECODING 



BIT PAIR 
DECODING 



V 




COCOCOCO CvJCJCMCM y~ y- t' t- OOOO 

cD|Cii m [CO m jco m |CQ m|m mim cn\CQ m|CQ 

+ + ++ + + + + ++ + + ++ + + 

cocococo cjcMcvjcj T-T-T-»- 0000 .G.C 

<<!<!< <<!<!< <<!<!< <<!<!< O lO 



Y3 = A3:+:B3 

:+: {(A2 * B2) 
+{A2+B2) *(Ai*Bi) 
+(A2+B2)*{Ai + Bi)*(Ao*Bo) 
+(A2+B2)*(Ai + Bi)*(Ao*Bo)*Cin) 

Figure 27. Relationship Between Adder Boolean Equation and PAL16X4 Device Logic 
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Thus the most significant bit of the output can he implemented 
using only five product terms; the carry-out function can be 
implemented using only six product terms. All of this is possible 
because the PALI 6X4 offers a modified programmable array with 
hardware dedicated to bit -pair decoding at the inputs and an XOR 
gate at the outputs. 

PAL16X4 

PNEW 

ADDER 

MMI SUNNYVALE, CALIFORNIA 



Figure 28 illustrates an example of one such adder, which also 
includes the logic required for loading the registers with one of the 
operands. 



PAL DESIGN SPECIFICATION 
J. ENGINEER 9/15/87 



CLK LOAD ADD BO Bl B2 B3 
GIN NC NC A3 A2 Al AO 



NC NC GND 
G P VCC 



; DESCRIPTION 

;THE DEVICE FUNCTIONS AS AN ADDER. IT FIRST STORES ONE OPERAND IN THE FOUR- 
;BIT OUTPUT REGISTER UNDER THE CONTROL OF LOAD SIGNAL. USING FEEDBACK AND A 
; SECOND OPERAND AT THE INPUTS, IT ADDS THE TWO UNDER CONTROL OF SIGNAL ADD. 
;THE RESULTS ARE STORED BACK IN THE OUTPUT REGISTERS. THE DEVICE ALSO 
; GENERATES THE PROPAGATE AND GENERATE SIGNALS WHICH ALLOW A NUMBER OF 
; SIMILAR DEVICES TO BE CASCADED. 



IF(VCC) 



P = (A3 + B3) 

* (A2 + B2) 

* (Al + Bl) 

* (AO + BO) 



; PROPAGATE SIGNAL 



IF(VCC) G = (A3*B3) 



(A3 
(A3 
(A3 



B3) 
B3) 
B3) 



CARRY O.EQU ADD * CIN 



; GENERATE SIGNAL 
(A2*B2) 

(A2 + B2) * (A1*B1) 
(A2 + B2) * (Al + Bl) 



(AO*BO) 



CARRY l.EQU ADD 
+ ADD 



CARRY2.EQU 



ADD 
ADD 
ADD 



(AO*BO) 
(AO + BO) 

(A1*B1) 
(Al + Bl) 
(Al + Bl) 



CARRY3.EQU ADD * (A2*B2) 

+ ADD * (A2 + B2) 
+ ADD * (A2 + B2) 
+ ADD * (A2 + B2) 



CIN 



(AO*BO) 
(AO + BO) 



(A1*B1) 
(Al + Bl) 
(Al + Bl) 



CIN 



(AO*BO) 
(AO + BO) 



/A3 



(/A3)*/L0AD * 
(/B3)* LOAD 
(A3:*:B3)*AD 
CARRY 3 



/ADD 



HOLD REG A3 
LOAD • REG B3 
ADD A3 AND B3 



/A2 := (/A2)*/L0AD * /ADD 
+ (/B2)* LOAD 
+ (A2:*:B2)*ADD 
:+: CARRY2 



/Al 



(/A1)*/L0AD * 
(/Bl)* LOAD 
(A1:*:B1)*ADD 
CARRYl 



/ADD 



;H0LD REG A2 
;L0AD REG B2 
:ADD A2 AND B2 



;H0LD REG Al 
;L0AD REG Bl 
rADD Al AND Bl 



/AO 



(/A0)*/LOAD * 
(/BO)* LOAD 
(AO:*:BO)*ADD 
CARRYO 



/ADD 



HOLD REG AO 
LOAD REG BO 
ADD AO AND BO 



FUNCTION TABLE 

; NOT INCLUDED HERE 



Figure 28. A Four-Bit Adder Implemented In a PAL16X4 



,m. 



Monollthio imnU Memories 



^ 



2-57 



Combinatorial Logic Design 



Latches 

PAL devices are often used to implement latches. One of the 
most common uses for latches is as temporary storage for data 
or addresses. PLD-based latches are often used in address 
decoders to assert the decoded signal for long durations. These 
latches are also very useful for asynchronous digital designs, and 
are used often for control and arbitration functions. 

A latch is essentially a simple combinatorial circuit in which the 
output is a function of inputs and feedback. The most commonly 
used latch is the D-type latch. When the control signal latch- 
enable (LEN) is HIGH, the latch is in the "transparent mode" and 
the input signal /D is available at the outputs. When the LEN 
signal is LOW, the input data is latched on the outputs and is 
retained until LEN goes back HIGH. In a typical address decoder, 
the input will be a combination of various address signals, 
decoded as explained earlier for range comparators. The latch- 
ing signal in most microprocessors is called AS (address strobe) 
or ALE (address latch enable). 

The truth table for a latch can be derived directly from this 
functional description, and is shown in Figure 29. 



INPUTS 


OUTPUTS 


/D 


LEN 


/Q 



1 
X 


1 
1 





1 

/Q (previous) 



Figure 29. Truth Table of a Simple Latch 

The Boolean equations for this latch can be directly derived from 
the truth table: 

/Q = /D * LEN 
+ /Q * /LEN 

The logic implementation for this latch is shown in Figure 30. 



are prime candidates for such hazards. As the input changes, the 
two outputs will not respond simultaneously and change state at 
the same instant. Although this will not change the steady-state 
output of the circuit, it may cause a spurious pulse or a "glitch". 
Such hazards are even more dangerous in latches, where the 
glitch can cause incorrect data to be latched. 

There are two types of hazards, static and dynamic. Static 
hazards occur wlien the steady-state output of combinatorial 
logic is not supposed to change due to an input transition, but a 
momentary change does occur. Such a glitch can be further 
classified as a static 1 or a static hazard as shown in Figure 31 . 

STATIC HAZARD STATIC 1 HAZARD 



Figure 31. Static Hazards 

Dynamic hazards involve situations where the the steady-state 
output is supposed to change due to an input transition. The 
hazard occurs when the transient output changes several times 
before settling. Figure 32 shows dynamic hazards. 






Figure 32. Dynamic Hazards 

A Karnaugh map is a very good way of detecting hazard condi- 
tions. When trying to detect a static or static 1 hazard, only the 
mapping of the zeros and the ones, respectively, are required. 
For example, the latch equations in Figure 30 can be mapped to 
a Karnaugh map shown in Figure 33. The relationship between 
the Karnaugh mapping and the Boolean equation product terms 
is also illustrated. 

DLEN 




Figure 30. A Transparent Latch 



Hazards 







Q- D'LEN— ^ 
+ Q* LEN —0 



r'\ 



<V 



^ 



KD 



kD 



l_ STATIC 1 
HAZARD. 



COVERING 
"HAZARD 



Even when a combinatorial circuit has been designed correctly, 
it may still have erroneous outputs due to "hazards". Hazards 
exist because physical circuits do not behave ideally. Combina- 
torial complementary output functions based on the same inputs 



Figure 33. Karnaugh Map for Transparent Latch Design 
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The possibility of a hazard exists when the signal LEN changes. 
Initially, when D and LEN are HIGH, the output Q is also HIGH. 
When LEN switches to LOW, it is possible for the output to go 
LOW momentarily. This is because when LEN goes LOW the first 
product term is disabled, and to maintain the output HIGH the 
second product term should be enabled exactly at the same 
instant. Due to the uneven gate delays or routing conditions on 
the board, these two events will not take place simultaneously. 
This is a static 1 hazard. It can also be identified directly in the 
Karnaugh map by the two adjacent but disjoint sets of ones, 
grouped together to form a product term each. 

The hazard conditions can be easily avoided in the PLDs by 
providing an extracoverproductterm. This productterm is shown 
with a dotted line in the Karnaugh map. This third product term 
will keep the output asserted during the transition of the LEN 
signal, when the control changes from the first product term to the 
second. The modified Boolean equation is shown above. 



/Q = /D * LEN 
+ /Q * /LEN 
+ /D * /Q (Cover product term) 

Devices on which latches are implemented need to provide 
output feedback. All devices with I/O pins provide this necessary 
feedback. The only other consideration for selecting a device 
would be the provision of sufficient number of product terms for 
addressing the needs of glitch-free and testable design. 

Acomplete latch design implemented in a PAL20L1 is shown in 
Figure 34. Although the design is now glitch-free, the cover 
product term introduces further complications in the testability of 
this circuit. This design does not have the additional circuitry to 
address the testability requirements. Details on designing for 
testability are discussed on page 3-108. 



TITLE 


CLEAN OCTAL LATCH 




PATTERN 


P7096 




REVISION 


A 




AUTHOR 


VINCENT COLI 




COMPANY 


MONOLITHIC MEMORIES, 


INC. 


DATE 


03/10/83 





CHIP OCT LAT PAL20L10 



G NC DO Dl D2 D3 D4 D5 D6 D7 NC GND 
/OC NC Q7 Q6 Q5 Q4 Q3 Q2 Ql QO NC VCC 



;DESCRIPTION 

THIS PAL16L8 IMPLEMENTS AN 8-BIT LATCH FUNCTION WITH THREE-STATE OUTPUTS. 
THE LATCH PASSES EIGHT BITS OF DATA (D7-D0) TO THE EIGHT OUTPUTS (Q7-Q0) 
WHEN LATCH ENABLE IS TRUE (G=HIGH). THE DATA IS LATCHED WHEN LATCH ENABLE 
IS FALSE (G=LOW). THE OUTPUTS WILL BE DISABLED (HI-Z) WHEN OUTPUT ENABLE 
IS TRUE (/OC=TRUE) REGARDLESS OF ANY OTHER INPUTS. 



/OC G D7-D0 Q7-Q0 



H 


X 


X 


Z 


HI-Z 


L 


L 


X 


Q 


LATCH OUTPUT 


L 


H 


D 


D 


LOAD LATCH 



;THIS DESIGN SHOWS HOW TO IMPLEMENT A "CLEAN" LATCH SINCE THERE ARE NO 
; OUTPUT GLITCHES AS THE DEVICE CHANGES STATE. 



Figure 34. Clean Octal Latch Design 
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EQUATIONS 




/QO = G*/DO 
+ /G*/QO 
+ /DO*/Q0 


;LOAD LATCH (QO) 

; LATCH OUTPUT 

; COVER ALWAYS HIGH 


QO.TRST = OC 


; OUTPUT ENABLE 


/Ql = G*/D1 
+ /GVQl 
+ /D1*/Q1 


;LOAD LATCH (Ql) 

; LATCH OUTPUT 

; COVER ALWAYS HIGH 


Ql.TRST = OC 


; OUTPUT ENABLE 


/Q2 = G*/D2 
+ /GVQ2 
+ /D2*/Q2 


;LOAD LATCH (Q2) 

; LATCH OUTPUT 

; COVER ALWAYS HIGH 


Q2.TRST = OC 


; OUTPUT ENABLE 


/Q3 = G*/D3 
+ /G*/Q3 


;LOAD LATCH (Q3) 
; LATCH OUTPUT 


+ /D3*/Q3 


; COVER ALWAYS HIGH 


Q3.TRST = OC 


; OUTPUT ENABLE 


/Q4 = G*/D4 
+ /G*/Q4 
+ /D4*/Q4 


;LOAD LATCH (Q4) 

; LATCH OUTPUT 

; COVER ALWAYS HIGH 


Q4,TRST = OC 


; OUTPUT ENABLE 


/Q5 = G*/D5 
+ /GVQb 
+ /D5*/Q5 


;LOAD LATCH (Q5) 

; LATCH OUTPUT 

; COVER ALWAYS HIGH 


Q5.TRST = OC 


; OUTPUT ENABLE 


/Q6 = G*/D6 
+ /GVQ5 
+ /D6*/Q6 


;LOAD LATCH (Q6) 

; LATCH OUTPUT 

; COVER ALWAYS HIGH 


Q6.TRST = OC 


; OUTPUT ENABLE 


/Q7 = G*/D7 
+ /G*/Q7 
+ /D7*/Q7 


;LOAD LATCH (Q7) 

; LATCH OUTPUT 

; COVER ALWAYS HIGH 


Q7.TRST = OC 


; OUTPUT ENABLE 



SIMULATION NOT INCLUDED HERE 



Figure 34. Clean Octal Latch Design (Cont'd.) 
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In the previous section we discussed combinatorial designs, 
circuits whose outputs are totally independent of any system 
clock. In this section we will discuss sequential circuits, where 
outputs store their previous values until a new clock is applied. 
The storage elements which retain the previous output values are 



called flip-flops. A bank of these flip-flops forms a register, 
although individualflip-flops are often called registers. Monolithic 
Memories and Advanced Micro Devices offer a number of PLDs 
which provide registered outputs. The table in Figure 1 lists all of 
the PLDs which provide registered outputs. 



DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


REGISTERED 


PINS 


PRODUCT 


FREQUENCY IN 




OF PINS 


PINS 


OUTPUT 
PINS 




TERMS/ 
OUTPUT 


MHz 
'max 


TTL 


16R8 


20 


8 


8 


. 


8 


55.5,37.25,16 


16R6 


20 


8 


6 


2 


8 


55.5.37,25,16 


16R4 


20 


8 


4 


4 


8 


55.5,37,25, 16 


16RP8 


20 


8 


8 


- 


8 


22.2 


16RP6 


20 


8 


6 


2 


8 


22.2 


16RP4 


20 


8 


4 


4 


8 


22.2 


16RA8 


20 


8 


0-8 


8-0 


4 


20 


16X4 


20 


8 


4 


4 


8 


14 


23S8 


20 


9 


4 


4 


8-12 


33.3. 28.5 


20R8 


24 


12 


8 


. 


8 


37.25.16 


20R6 


24 


12 


6 


2 


8 


37.25,16 


20R4 


24 


12 


4 


4 


8 


37,25,16 


20X10 


24 


10 


10 


- 


4 


22.2 


20X8 


24 


10 


8 


2 


4 


22.2 


20X4 


24 


10 


4 


6 


4 


22.2 


20RP10 


24 


10 


10 


. 


8 


37,25 


20RP8 


24 


10 


8 


2 


8 


37,25 


20RP6 


24 


10 


6 


4 


8 


37,25 


20RP4 


24 


10 


4 


6 


8 


37,25 


20XRP10 


24 


10 


10 


- 


8 


30,22.2,14 


20XRP8 


24 


10 


8 


2 


8 


30,22.2,14 


20XRP6 


24 


10 


6 


4 


8 


30,22.2,14 


20XRP4 


24 


10 


4 


6 


8 


30,22.2,14 


20RS10 


24 


10 


10 


. 


8-16 


19.2 


20RS8 


24 


10 


8 


2 


8-16 


19.2 


20RS4 


24 


10 


4 


6 


8-16 


19.2 


20RA10 


24 


10 


0-10 


10-0 


4 


33.20 


22RX8 


24 


14 


0-8 


8-0 


8 


28.5 


22V10 


24 


12 


0-10 


10-0 


8-16 


40.28.5.18 


32VX10 


24 


11 


0-10 


10-0 


8-16 


25. 22.2 


32R16 


40 


16 


16 


- 


8-16 


16 


PLS167 


24 


14 


6 


. 


Total 48 


33 


PLS168 


24 


12 


8 


- 


Total 48 


'33 


PLS105 


28 


16 


8 


- 


Total 48 


37 



Figure 1. Registered PLDs 
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DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


REGISTERED 


PINS 


PRODUCT 


FREQUENCY IN 




OF PINS 


PINS 


OUTPUT 
PINS 




TERMS/ 
OUTPUT 


MHz 

MAX 


CMOS 


C16R8 


20 


8 


8 


- 


8 


28.5 


016R6 


20 


8 


6 


2 


8 


28.5 


G16R4 


20 


8 


4 


4 


8 


28.5 


C20R8 


24 


12 


8 


- 


8 


20, 15.3 


C20R6 


24 


12 


6 


2 


8 


20. 15.3 


G20R4 


24 


12 


4 


4 


8 


20, 15.3 


C22\/10 


24 


12 


0-10 


10-0 


8-16 


33.3, 20 


C29M16 


24 


5 


- 


16 


8-16 


20.15 


G29MA16 


24 


5 


- 


16 


4-12 


20,15 


ECL 


10H20EV/EG8 


24 


12 


0-8 


8-0 


8-12 


125 


10020EV/EG8 


24 


12 


0-8 


8-0 


8-12 


125 



Figure 1. Registered PLDs (Cont'd.) 



Before we discuss purely registered designs, let us take a look at 
designs which combine both registered and combinatorial por- 
tions. Registered and combinatorial outputs are often mixed on 
a single device. There can be two distinct designs, one registered 
and one combinatorial (often glue logic) combined on a single 
device for higher integration. There may also be a design 
requirement where registered outputs need to be decoded using 
combinatorial logic. 

There are a number of devices which provide both registered and 
combinatorial outputs. Some devices provide programmable 



register bypass, which allows outputs to be programmed as 
registered or combinatorial. Figure 2 shows a list of PLDs with 
both registered and combinatorial outputs. Figure 3 shows a list 
of PLDs with programmable register bypass. 

In most design software packages the output registers are 
signified by the ":=" assignment symbol, as opposed to the "=" 
sign for acombinatorial output. This helps easily identify registers 
in each equation. In devices which provide outputs configurable 
as either registered or combinatorial, this sign is also used by the 
software to configure the outputs. 
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DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


REGISTERED 


PINS 


PRODUCT 


FREQUENCY IN 




OF PINS 


PINS 


OUTPUT 
PINS 




TERMS/ 
OUTPUT 


MHz 

'max 


TTL 


16R6 


20 


8 


6 


2 


8 


55.5.37,25,16 


16R4 


20 


8 


4 


4 


8 


55.5,37,25,16 


16RP6 


20 


8 


6 


2 


8 


22.2 


16RP4 


20 


8 


4 


4 


8 


22.2 


16RA8 


20 


8 


0-8 


8-0 


4 


20 


16X4 


20 


8 


4 


4 


8 


14 


23S8 


20 


g 


^ 


4 


8-12 


33.3. 28.5 


20R6 


24 


12 


6 


2 


8 


37.25.16 


20R4 


24 


12 


4 


4 


8 


37.25.16 


20X8 


24 


10 


8 


2 


4 


22.2 


20X4 


24 


10 


4 


6 


4 


22.2 


20RP8 


24 


10 


8 


2 


8 


37.25 


20RP6 


24 


10 


6 


4 


8 


37,25 


20RP4 


24 


10 


4 


6 


8 


37,25 


20XRP8 


24 


10 


8 


2 


8 


30,22.2,14 


20XRP6 


24 


10 


6 


4 


8 


30,22.2,14 


20XRP4 


24 


10 


4 


6 


8 


30,22.2.14 


20RS8 


24 


10 


8 


2 


8-16 


19.2 


20RS4 


24 


10 


4 


6 


8-16 


19.2 


20RA10 


24 


10 


0-10 


10-0 


4 


33.20 


22RX8 


24 


14 


0-8 


8-0 


8 


28.5 


22V10 


24 


12 


0-10 


10-0 


8-16 


40.28.5.18 


32VX10 


24 


12 


0-10 


10-0 


8-16 


25, 22.2 


32R16 


40 


16 


16 


- 


8-16 


16 


CMOS 


C16R6 


20 


8 


6 


2 


8 


28.5 


C16R4 


20 


8 


4 


4 


8 


28.5 


C20R6 


24 


12 


6 


2 


8 


20, 15.3 


C20R4 


24 


12 


4 


4 


8 


20, 15.3 


C22V10 


24 


12 


0-10 


10-0 


8-16 


33.3, 20 


C29M16 


24 


5 


. 


16 


8-16 


20.15 


C29MA16 


24 


5 


- 


16 


4-12 


20,15 


ECL 


Latched 














10H20G8 


24 


12 


0-8 


8-0 


4-8 


N.A. (latch) 


10H/020EG8 


24 


12 


0-8 


8-0 


8-12 


N.A. (latch) 


10H/020EV8 


24 


12 


0-8 


8-0 


8-12 
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Figure 2. PLDs with Both Registered and Combinatorial Outputs 
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DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 

FREQUENCY IN 

MHz 

MAX 


TTL 


16RA8 
23S8 


20 
20 


8 
9 


0-8 
4 


8-0 
4 


4 
8-12 


20 
33.3, 28.5 


20RA10 

22RX8 

22V10 

32VX10 

32R16 


24 
24 
24 
24 
40 


10 
14 
12 
12 
16 


0-10 
0-8 
0-10 
0-10 
16 


10-0 
8-0 
10-0 
10-0 


4 

8 
8-16 
8-16 
8-16 


33,20 

28.5 

40,28.5,18 

25, 22.2 

16 


CMOS 


C22V10 


24 


12 


0-10 


10-0 


8-16 


33.3. 20 


C29M16 
C29MA16 


24 
24 


5 
5 


- 


16 
16 


8-16 
4-12 


20.15 
20.15 


ECL 


Latched 
10H20G8 
10H/020EG8 
10H/020EV8 


24 
24 
24 


12 
12 
12 


0-8 
0-8 
0-8 


8-0 
8-0 
8-0 


4-8 
8-12 
8-12 


N.A. (latch) 

N.A. (latch) 
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Figure 3. PLDs with Programmable Register Bypass 



General Device Selection 
Considerations 

The same set of general device selection considerations dis- 
cussed in the PLD design methodology section (page 2-21 ) apply 
to registered designs. The list of items which must be considered 
is repeated in Figure 4 for convenience. A device can be 
conveniently selected from the table of registered PLDs 
(Figure 1 ) based upon the specific input and output requirements. 

1. Number of input pins 

2. Numberof output pins 

3. Numberof I/O pins 

4. Device speed 

5. Device power requirements 

6. Numberof registers 

7. Numberof product terms 

8. Output polarity control 

Figure 4. General Device Selection Considerations 

Maximum Frequency 

Forregistered designs, speed is aparameterwhichneedscareful 
consideration. Most combinatorial designs use the propagation 
delay (tpp) for ensuring that enough time is allowed for the data 
from the inputs to appear at the outputs. In registered designs the 
effects of the ciocl< must be tal<en into account. This is reflected 



in the maximum frequency (f^,^,,) parameter. The flexibility inher- 
ent in PLD design provides a choice of configurations from which 
different f^^^ parameters can be calculated. 

In the first type of design, the PLD is used for a stand-alone 
registered design. In order to decide the next logic level of the 
registers, the present logic level needs to be available at the 
inputs of the registers before they are clocked (Figure 5). Under 
these conditions the clock period is limited by the internal delay 
from the flip-flop outputs through the internal feedback and logic 
to the flip-flops inputs (^cn+*cF^' ^°'' ^'^'^ configuration: 

f«Ax°l/(t.>i + ^cr) = designated the internal f„^. 

CLK 
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'max INTERNAL: 1/(ts,+ tcF) 

Figure 5. Internal f„^ 
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The second type of system configuration is when a number of 
logic devices with registers, including PLDs, are clocked with a 
common clock. This is probably the most prevalent configuration. 
In this case the registered outputs are sent off-chip back to the 
device inputs or to the inputs of a second device. The slowest 
path required to set up an input (Figure 6) is the sum of the clock- 
to-output delay and the input setup time for the external signals 
(*$u + ^CLK^- ^^® same speed grade devices are typically clocked 
together, which means that the t^^ 
usually be the same. 

Thus for this configuration: 

f„„ = 1/ (t.„ + t„,) 



Flip-flop Types 

There are four basic typesofflip-f lops; S-R,J-K,Tand the popular 
D-type. Details on the various types of flip-flops are provided in 
the logic reference (page 6-8). In this section we will assume that 
the reader is familiar with these flip-flops. 

Almost all registered PLDs provide the basic D-type flip-flops. 
D-type flip-flops are the simplest to design with and will be used 
throughout this section. Some PLDs provide the capability of 
configuring output registers as either D, T, J-K or S-R. Configur- 
able flip-flops in some cases can reduce the number of product 
terms required for certain designs. The effect of the configurable 
flip-flops will be discussed wherever relevant. 



This f|^^j( is the maximum frequency when external feedback is Synchronous vs Asynchronous 

used or when several equivalent-speed devices are used. This 
f^^^j. is designated the external i^^^. The tables for registered 



devices show this f^^^ parameter. 

The third type of design is a simple data path application. In this 
case, input data is presented to the flip-flop and clocked; no 
feedback is employed (Figure 7). In this case, the period is limited 
by the sum of data setup time and data hold time (tj^+tf,). 
However, the minimum clock period (t^^ + 1^^^) 's usually a stricter 
limit. Thus the third f^^j^, designated "f^^, no feedbaclC will be 
the lesser of: 



Registered designs can be easily classified into two categories; 
synchronous and asynchronous. In synchronous designs the 
clock inputs of ail the registers are tied together to a common 
clock. This is the most commonly used type of registered design. 
With asynchronous designs, theflip-flops' clock inputs may not be 
tied together, and the clocks may be gated or even driven by other 
flip-flops. We will first discuss synchronous registered designs 
and then asynchronous registered designs. 



1/ (t „ + t ) 



1/ (t., + t„) 
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f MAX EXTERNAL: 1(15^+ Iclk) 

Figure 6. External f„^ 
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Figure 7. f^^^ with No Feedback 
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Synchronous Registered Designs 



Counters 



Synchronous registered designs are used for two major func- 
tions: data handling and control. Registered synchronous de- 
signsfordatahandlingincludecountersandshiftregisters. There 
are various types of counters, among them binary counters, 
modulo counters, Johnson counters, and Gray-code counters. 
These counters are differentiated by the sequence of values 
through which the counter travels. A binary counter is the 
simplest form of a counter, and is used most often for data 
functions. Any system requiring a regular count uses a binary 
counter. Modulo, Gray-code and Johnson counters are also used 
for control. 

All counters are actually subsets of a larger class of digital designs 
called state machines. State machines are discussed in detail in 
the next chapter of this handbook. 



Counters are the most commonly used sequential circuits. A set 
of registers which cycles through a predetermined, unvarying 
sequence is called a counter. Ageneral model of a synchronous 
counter is illustrated in Figure 8. This shows a common clock to 
all the flip-flops, whose outputs are fed back to a combinatorial 
logic array called the next-state (count) decoder. The next count 
is generated by this logic based upon the present count and 
control inputs. Most PLDs use the standard sum-of-products 
form of array for this logic. 

The relationship between a four-bit counter and its signal timing 
diagram is illustrated in Figure 9. The counters can also be 
represented by state diagrams (Figure 10). The state diagrams 
are bubble-and-arrow diagrams, with each bubble representing a 
count value and each arrow a transition from one count to the 
next. More detail on state diagrams is given in the next chapter 
on state machine design. For counters, the state diagrams are a 
convenient representation tool and will be used in the discussion 
when necessary. 
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Figures. General Model of a Counter 
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STATE 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2 3 
(COUNT) 

Figure 9. Timing Diagram of a Four-bit Binary Counter 
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Figure 10. State Diagram of a Four-bit Binary Counter 



Binary Counters 

Let us examine a four-bit binary counter. The truth table (also 
called the transition table) for such a counter is given in Figure 1 1 . 
The table lists the next state values of all the output registers 
based upon their present values. 



PRESENT STATE 




NEXT STATE 




X3 X2 XI XO 


X3 


X2 X1 


XO 













1 





1 





10 





1 




11 





1 





10 





1 




10 1 





1 1 





110 





1 1 




111 










10 









10 1 




1 





10 10 




1 




10 11 




1 





110 




1 




110 1 




1 1 





1110 




1 1 




1111 












Figure 11. Thie Truth Table for a Four-bit Binary Counter 
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We derive Boolean equations for each bit directly from the above 
truth table by collecting all the product terms where outputs are 
asserted HIGH (ones). This yields: 



X3 



X2 



XI 



XO : = 



/X3 


* 


X2 


* 


XI 


* XO 


+ X3 


* 


/X2 


* 


/Xl 


* /XO 


+ X3 


* 


/X2 


* 


/Xl 


* XO 


+ X3 


* 


/X2 


* 


XI 


* /XO 


+ X3 


* 


/X2 


* 


XI 


* XO 


+ X3 


* 


X2 


* 


/Xl 


* /XO 


+ X3 


* 


X2 


* 


/Xl 


* XO 


+ X3 


* 


X2 


* 


XI 


* /XO 


/X3 


* 


/X2 


* 


XI 


* XO 


+ /X3 


* 


X2 


* 


/Xl 


* /XO 


+ /X3 


* 


X2 


* 


/Xl 


* XO 


+ /X3 


* 


X2 


* 


XI 


* /XO 


+ X3 


* 


/X2 


* 


XI 


* XO 


+ X3 


* 


X2 


* 


/Xl 


* /XO 


+ X3 


* 


X2 


* 


/Xl 


* XO 


+ X3 


* 


X2 


* 


XI 


* /XO 


/X3 


* 


/X2 


* 


/Xl 


* XO 


+ /X3 


* 


/X2 


* 


XI 


* /XO 


+ /X3 


* 


X2 


* 


/Xl 


* XO 


+ /X3 


* 


X2 


* 


XI 


* /XO 


+ X3 


* 


/X2 


* 


/Xl 


* XO 


+ X3 


* 


/X2 


* 


XI 


* /XO 


+ X3 


* 


X2 


* 


/Xl 


* XO 


+ X3 


* 


X2 


* 


XI 


* /XO 


/X3 


* 


/X2 


* 


/Xl 


* /XO 


+ /X3 


* 


/X2 


* 


XI 


* /XO 


+ /X3 


* 


X2 


* 


/Xl 


* /XO 


+ /X3 


* 


X2 


* 


XI 


* /XO 


+ X3 


* 


/X2 


* 


/Xl 


* /XO 


+ X3 


* 


/X2 


* 


XI 


* /XO 


+ X3 


* 


X2 


* 


/Xl 


* /XO 


+ X3 


* 


X2 


* 


XI 


* /XO 



These Boolean equations are for devices with active-HIGH 
outputs. These equations can be inverted for devices with active- 
LOW outputs. The Boolean equations for active-LOW devices 
can also be directly derived from the truth table by collecting all the 
product terms where the active-LOW outputs (zeros) are 
asserted. 

Manipulating the equations with Boolean algebra we obtain the 
Boolean logic equations: 



XO 


= /XO 






XI 


= XI 


: + : 


XO 


X2 


= X2 


:+: 


(XI * XO) 


X3 


= X3 


: + : 


(X2 * XI 



XO) 

Similarly for active-LOW output devices 
(since /(A :+: B) = /A :+: B): 



/XO 


= XO 


/Xl 


= /Xl :+: XO 


/X2 


= /X2 :+: (XI * XO) 


/X3 


= /X3 :+: (X2 * XI 



These equations could also be obtained from the Boolean equa- 
tions developed for an adder in the combinatorial design section 
on page 2-54. 

Rewriting the equations for an adder: 

XO ° AO :+: BO :+: Cin 



XI 


= Al :+: Bl :+ 
; where 


CO 




CO 


= AO * BO +(A0 


+ BO) * Cin 




X2 


= A2 :+: B2 :+ 
; where 


CI 




CI 


•= Al * Bl + (Al 


+ B1)*(A0 * 


BO) 




-(-(Al + B1)*(A0 


+ BO) * Cin 




X3 


= A3 :+: B3 :+ 
; where 


C2 




C2 


■= A2 * B2 + (A2 


+ B2)*(A1 * 


Bl) 




-l-(A2 + B2)*(A1 


+ B1)*(A0 * 


BO) 




-1-(A2 + B2)*(A1 


+ B1)*(A0 + 


BO) * Cin 



Assuming one of the operands in the adder is the number itself 
and the second operand is one (X3-X0 = A3-A0, B3-B0 = 0001 
and Cin = 0) we get the following equations for a counter: 



XO 


= /XO 






XI 


° XI 


: + : 


XO 


X2 


= X2 


: + : 


(XI * XO) 


X3 


= X3 


: + : 


(X2 * XI 



XO) 

These are of course the same equations as the ones derived 
directly from the truth table. The equations for a binary counter 
are very regular. The general equation for an n-bit binary counter 
can be directly expressed: 

Xn := Xn :+: (Xn-1 * Xn-2 ... XO) 

For devices with active-LOW outputs the general Boolean equa- 
tions can be derived by inverting both sides of the equation: 

/Xn := /Xn :+: (Xn-1 * Xn-2 ... XO) 

These equations represent a binary UP counter. Counting 
backwards for a DOWN counter, the Boolean equations can be 
similarly generated, either from the truth table or from the adder 
Boolean equations. The general equation for a DOWN 
counter is: 

Xn := Xn :+: (/Xn-1 * /Xn-2 ... /XO) 

This equation is for active-HIGH outputs. For active-LOW output 
devices the Boolean equation for a DOWN counter is; 

/Xn := /Xn :+: (/Xn-1 * /Xn-2 ... /XO) 

A four-bit binary counter which combines both UP and DOWN 
capability is illustrated in Figure 12. This design is implemented 
in a registered PLD and requires at least six product terms for the 
most significant output bit. The design logic has been imple- 
mented in sum-of-products form. It uses a control input (UP) 
which selects between the "count up" and "count down" product 
terms, allowing the counter to count up or down. 
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Further control functions can be added to these counter equa- 
tions directly either at the truth-table stageor in the equations. For 
example, a load data function is required in most counters. This 
allows registers to be loaded with a count under the control of 



another input signal (LOAD). When the LOAD signal is HIGH the 
counter is loaded with the input data, and when the LOAD signal 
is LOW the counting is resumed. 



Title 4Bit_Counter 

Pattern 4cnt.pds 

Revision B 

Author Mehrnaz Hada, Bill Hollon, Ali Sebt 

Company Monolithic Memories Inc. Santa Clara, CA 

Date 1/14/85 

CHIP 4BitCoun PAL16RP4 



CLK UP AI BI CI DI CLR LOAD NC GND 
/OC NC NC D C B A NC NC VCC 

The 4-bit counter counts up or down and has the clear and 
load capability. The clear operation overrides count and 
load. The counter counts up when CLR=low, LOAD=low, and 
UP=high. It counts down whenever CLR=low, LOAD=low, and 
UP=low 



EQUATIONS 



;= /A*/B*/C*/D*/UP*/LOAD*/CLR 
+ /A* B* C* D* UP*/LOAD*/CLR 
+ A* B* /UP*/LOAD*/CLR 
+ A*/B* D* /LOAD*/CLR 
+ A* /C* UP*/LOAD*/CLR 
+ A* C*/D* /LOAD*/CLR 
+ LOAD*/CLR* AI 



;When CLR=1, A=0. 
;Else it will count 
;UP or DOWN. 



;New value is loaded 
;when L0AD=1, CLR=0. 



/B*/C*/D*/UP*/LOAD*/CLR 

/B* C* D* UP*/LOAD*/CLR 

B* C*/D* /LOAD*/CLR 

B*/C* UP*/LOAD*/CLR 

B* D*/UP*/LOAD*/CLR 

LOAD*/CLR* 



/C*/D*/UP*/LOAD*/CLR 

/C* D* UP*/LOAD*/CLR 

C*/D* UP*/LOAD*/CLR 

C* D*/UP*/LOAD*/CLR 

LOAD*/CLR* CI 



/D* 



/LOAD*/CLR 
LOAD*/CLR* DI 



;When CLR=1, 8=0. 
;Else it will count, 



;New value is loaded 
;when L0AD=1, CLR=0. 



;When CLR=1, C=0. 
;Else it will count, 



;New value is loaded 
;when L0AD=1, CLR=0. 

rCount 

;New value is loaded 

;when L0AD=1, CLR=0. 



Figure 12. Design File for a Four-bit Counter 
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SIMULATION 

TRACE_ON AI BI CI DI LOAD CLR UP A B 

SETF LOAD /CLR AI BI CI DI OC 
CLOCKF CLK 

SETF CLR 
CLOCKF CLK 

SETF /CLR UP /LOAD 

FOR I := 1 TO 16 DO 
BEGIN 

CLOCKF CLK 

END 

SETF LOAD /CLR /UP AI BI CI DI 

CLOCKF 

SETF /LOAD 

FOR I := 1 TO 16 DO 
BEGIN 

CLOCKF CLK 
END 

SETF LOAD CLR AI /BI CI /DI 
CLOCKF CLK 

SETF /OC 

TRACE OFF 



;Load all registers 
; to HIGH and count up 

; Clear all registers 



;Start Counting up 

;Count up 15 clock 
jcycles 



;Load all registers 

rto HIGH and count 

;down 

;Count down 16 clock 

; cycles 



;Test setting LOAD 
;and CLR on at the 
;same time. 



Figure 12. Design File for a Four-bit Counter (Cont'd.) 



Binary Counter Device Selection Considerations 

One major device selection consideration is the logic require- 
ment. The binary counter Boolean equations make use of 
exclusive-OR functions in the output. In most of the registered 
PLDs shown in Figure 1, the XOR functions are implemented in 
their sum-of-products logic form. This usually requires a large 
number of product terms. We have seen a design example for a 
four-bit UP/DOWN binary counter which requires up to six prod- 
uct terms. Most standard PAL devices provide eight product 
terms per output. However, for larger counters, a large number 
of product terms is required. Figure 1 3 shows a list of PLDs which 
provide more than eight product terms per output. 

Some PLDs provide a dedicated XOR gate on the outputs (Figure 
14). This allows an AND-OR-XOR implementation of the Boolean 
logic, and consequently requires fewer product terms. A nine-bit 
binary counter design is illustrated in Figure 1 5. This design has 
been implemented in the PAL20X10, which provides an XOR 
gate. This design cannot be implemented in standard PAL 
devices. 



Cascading Binary Counters 

Situations are occasionally encountered in digital system designs 
where very long counters are required. Few PLDs provide 
greater than ten outputs to implement such counters. Binary 
counters can be easily cascaded into two or more devices to 
construct such large counters. The design of long counters is very 
simple. These are designed as simple binary counters with a 
count enable control. The less significant counters generate an 
extra output signal at the penultimate count. These signals are 
ANDed together to form the count enable signal for the higher- 
order counter. For a down counter the reverse scheme is 
implemented. 

Cascading counters is a lot simpler than cascading adders 
because the carry-look-ahead circuitry is not required. The only 
thing to remember is thatthe more significant countertoggles only 
when the penultimate count of all of the less significant counters 
is reached. 

Notice that the nine-bit counter has a built-in mechanism for cas- 
cading. This count enable output signal is represented by CO and 
is easily implemented in the PLD. This is just another example 
of how such modifications are easy to handle in a PLD design, 
unlike standard SSI/MSi designs. 
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DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 

FREQUENCY IN 

MHz 

^MAX 


TTL 


2388 


20 


9 


4 


4 


8-12 


33.3. 28.5 


20RS10 

20RS8 

20RS4 


24 
24 
24 


10 
10 
10 


10 
8 
4 


2 

6 


8-16 
8-16 
8-16 


19.2 
19.2 
19.2 


22V10 
32VX10 


24 
24 


12 
12 


0-10 
0-10 


10-0 
10-0 


8-16 
8-16 


40.28.5,18 
25. 22.2 


PLS167 
PLS168 


24 
24 


14 
12 


6 
8 


- 


Total 48 
Total 48 


33 
33 


PLS105 


28 


16 


8 


- 


Total 48 


37 


CMOS 


C22V10 


24 


12 


0-10 


10-0 


8-16 


33.3. 20 


C29M16 
C29MA16 


24 
24 


5 
5 


- 


16 
16 


8-16 
4-12 


20.15 
20.15 


ECL 


10H20EV/EG8 
10020EV/EG8 


24 
24 


12 
12 


0-8 
0-8 


8-0 
8-0 


8-12 
8-12 


125 
125 



Figure 13. Registered PLDs with More Than Eight Product Terms Per Output 



DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 

FREQUENCY IN 

MHz 

*MAX 


TTL 


16X4 


20 


8 


4 


4 


8 


14 


20X10 

20X8 

20X4 


24 
24 
24 


10 
10 
10 


10 
8 
4 


2 

6 


4 
4 
4 


22.2 
22.2 
22.2 


20XRP10 
20XRP8 
20XRP6 
20XRP4 


24 
24 
24 
24 


10 
10 
10 
10 


10 
8 
6 
4 


2 

4 
6 


8 
8 
8 
8 


30. 22.2. 14 
30.22.2. 14 
30. 22.2. 14 
30.22.2,14 


22RX8 
32VX10 


24 
24 


14 
12 


0-8 
0-10 


8-0 
10-0 


8 
8-16 


28.5 
25. 22.2 



Figure 14. Registered PLDs for Binary Counter Design with XOR Gate Output 
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Title 9BitCounter 

Pattern 9BitCnt.pds 

Revision A 

Author Mehrnaz Hada 

Company Monolithic Memories Inc., Santa Clara, CA 

Date 1/28/85 



;The 9-bit synchronous counter has parallel load, increment, 
;and hold capabilities. The carry out pin (/CO) shows how to 
; implement a carry out using a register by anticipating one 
;count before the terminal count if counting and the terminal 
;count if loading. 



Operations Table 

/OC CLK /LD DB-DO Q8-Q0 



CHIP 9BitCoun PAL20X10 



CLK DO Dl D2 D3 D4 D5 D6 07 D8 /LD GND 
/OC /CO 08 07 06 05 Q4 Q3 Q2 Ql QO VCC 



EQUATIONS 
/QO 



/Ql 



/Q2 



/Q3 



/Q4 



/Q5 



/Q6 



/Q7 



/Q8 



/LDVQO 
LD*/DO 
/LD 

/LD*/Q1 

LD*/D1 

/LD* QO 

/LD*/Q2 

LD*/D2 

/LD* QO' 

/LD*/Q3 

LD*/D3 

/LD* QO' 



Ql 



Ql* Q2 



/LD*/Q4 
LD*/D4 
/LD* QO* Ql* Q2* Q3 

/LD*/Q5 
LD*/D5 
/LD* QO* Ql* Q2* Q3* Q4 

/LD*/Q6 
LD*/D6 
/LD* QO* Ql* Q2* Q3* Q4* Q5 



/LD*/Q7 
LD*/D7 
/LD* QO* Ql* Q2* Q3' 



Q4* Q5* Q6 



Operation 



H 


X 


X 


X 


z 


HI-Z 


L 


L 


X 


X 





Hold 


L 


C 


L 


D 


D 


Load 


L 


c 


H 


X 


Q PLUS 1 


Increment 



;Hold QO 
;Load DO (LSB) 
;Count 

;Hold Ql 
;Load Dl 
;Count 

;Hold Q2 
;Load D2 
; Count 

;Hold Q3 
;Load D3 
;Count 

;Hold 04 
;Load D4 
(•Count 

.-Hold Q5 
;Load D5 
; Count 

;Hold Q6 
;Load D6 
;Count 

;Hold Q7 
;Load D7 
: Count 



/LD*/QB 

LD*/D8 1 

/LD* QO* Ql* Q2* Q3* Q4* Q5* Q6* Q7 l 

/LD*/QO* Ql* Q2* Q3* Q4* Q5* Q6* Q7* Q8 

LD* DO* Dl* D2* D3* D4* D5* D6* D7* D8 

Figure 15. Design Fiie for a Nine-bit Counter 



;Hold Q8 
;Load D8 (MSB) 
; Count 

;Carry out 

; (Anticipate Count) 

;Carry out 

; (Anticipate Load) 
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SIMULATION 

TRACE_ON /OC /LD D8 D7 D6 D5 D4 D3 D2 Dl DO 
/CO Q8 Q7 Q6 Q5 Q4 Q3 Q2 Ql QO 

SETF OC LD /D8 /D7 /D6 /D5 /D4 /D3 /D2 /Dl /DO ;Load 
CLOCKF CLK 

SETF /LD ; Increment 

CLOCKF CLK 

SETF LD /D8 /D7 /D6 /D5 /D4 /D3 /D2 /Dl DO ;Load 
CLOCKF CLK 

SETF /LD ; Increment 

CLOCKF CLK 

SETF LD /D8 /D7 /D6 /D5 /D4 /D3 D2 Dl DO ;Load 

CLOCKF CLK 

SETF /LD ; Increment 

CLOCKF CLK 

SETF LD /D8 /D7 /D6 /D5 /D4 D3 D2 Dl DO ; Load 

CLOCKF CLK 

SETF /LD ; Increment 

CLOCKF CLK 

SETF LD /D8 /D7 /D6 /D5 D4 D3 D2 Dl DO ;Load 

CLOCKF CLK 

SETF /LD ; Increment 

CLOCKF CLK 

SETF LD /D8 D7 D6 D5 D4 D3 D2 Dl DO ;Load 

CLOCKF CLK 



SETF /LD 
CLOCKF CLK 



; Increment 



TRACE_OFF 

Figure 15. Design File for a Nine-bit Counter (Cont'd.) 



£1 



MonolHhiG 



m 



Memories 



2-73 



Registered Logic Design 



Flip-flop Selection 

Until now, all of the designs have been implemented in devices 
with D-type flip-flops. What happens if the counter design is 
implemented in a device which allows ixsth J-K and T-type 
registers? The Boolean logic equations for such a design can be 
derived from the truth table. This requires advance knowledge of 



the functionality of the J-K and T-type registers. For the J-K 
register the output is asserted when the J input goes HIGI-i, and 
the output is unasserted when the K input goes HIGH. Toggle 
type registers require the T input to be asserted for every change 
in the output level. 



PRESENT STATE 


NEXT STATE 


X3 


X2 


XI 


XO 


X3 X2 XI XO 


D 


J 


K 


T 


D 


J 


K 


T 


D 


J 


K 


T 


D 


J 


K T 













































1 


1 




























1 














10 










































11 














1 


1 





1 








1 











10 














1 


























1 


10 1 














1 













1 














110 














1 


























1 


111 




1 





1 








1 


1 








1 












10 









































1 


10 1 



























1 















10 10 








































1 


10 11 













1 


1 





1 








1 












110 













1 



























1 


110 1 













1 













1 















1110 













1 


























1 


1111 








1 


1 








1 


1 








1 













Figure 16. Truth Table for D, J-K and T-type Flip-flops 



Figure 16 shows the truth tablefor both a J-K and aT-type register 
implementation for a binary counter. Deriving and optimizing the 
equations from the table, we get the following results: 



X3-J := /X3 * X2 * XI * XO 
X3-K := X3 * X2 * XI * XO 



X2-J 


= 


/X2 * XI * XO 


X2-K 


= 


X2 * XI * XO 


Xl-J 


= 


/Xl * XO 


Xl-K 


= 


XI * XO 


XO-J 


= 


/XO 


XO-K 


= 


XO 


X3-T 


c. 


X2 * XI * XO 


X2-T 


= 


XI * XO 


Xl-T 


= 


XO 


XO-T 


= 


1 



As we can see from these equations, the number of product terms 
used for J-K and T-type implementations is smaller than the 
number of product terms required for a D-type implementation 
(Figure 12). 



Which flip-flop is most efficient depends on the relative number of 
transitions or holds required. As a counter traverses from one 
count (state) to another, every output either makes a "transition" 
(changes logic level) or "holds" (stays at the same logic level). 
Small counters in general require more transitions and fewer 
holds. As the designs get larger, the higher-order bits require 
fewer transitions and more holds. 

D-type flip-flops use up product terms only for active transitions 
from logic LOWIevelto HIGH level, andforlogicHIGH level holds 
only. J-K and T-type flip-flops use up product terms for both LOW- 
to-HIGH and HIGH-to-LOWtransitions, but eliminate hold terms. 
Generally, the requirements of transition and hold terms depends 
upon the count sequence selection. D-type flip-flops are more 
efficient for small designs. Conversely J-K and T-type flip-flops 
can be more efficient for large designs, which require more hold 
terms. 

A comparison of product term requirements of 2-, 3-, 4- and 5-bit 
binary counters can be representative for other types of counters 
and state machines. The table in Figure 1 7 shows the transition 
terms and the hold terms required for these counters. For a J-K 
type flip-flop implementation, after optimizing, total product terms 
required are 4, 6, 8, and 10 respectively. The D-type implemen- 
tation requires 3, 6, 10, and 15 respectively, and is relatively less 
efficient for large counters. 
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BINARY COUNTER 


TRANSITIONS 


HOLDS 


D PRODUCT 
TERMS 


J-K PRODUCT 
TERMS 


T PRODUCT 
TERMS 


2-Bit 
3-Bit 
4-Bit 
5-Bit 


6 

14 
30 
62 


2 

10 
34 
98 


3 
6 

10 
15 


4 
6 
8 
10 


1 

1 
1 
1 



Figure 17. Product Term Requirements for Configurable Flip-flops 



Ttiere are a number of PLDs which off er the capability of program- 
mable D, J-K ancJ T-type registers. Figure 18 shows a list of all 
PLDs which offer configurable flip-flops. The J-K and T-type flip- 



flops are constructed from a basic D-type flip-flop using an extra 
XOR gate per flip-flop. 



DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 

FREQUENCY IN 

MHz 

MAX 


TTL 


22RX8 
32VX10 


24 
24 


14 
12 


0-8 
0-10 


8-0 
10-0 


8 
8-16 


28.5 
25, 22.2 



Figure 18. Registered PLDs with Configurable Flip-flops 



Modulo Counters 



The number of unique states a counter traverses is generally 
referred to as the modulus. A typical n-bit binary counter has a 
maximum modulus of 2" . It is often necessary to introduce signal 
delays into the logic design to meet timing requirements. This 
makes it possible to allow for bus-skew, access time or differential 
propagation delays between devices along two different signal 
paths. Atypical example of this is the introduction of wait states 
to allow for access times of different memory elements. Counters 
and delay lines are commonly used to introduce the delay. 
Counters in PLDs have the added advantage of programmability 
to select the required delay. Such applications where precise 
timing duration control is required usually use modulo counters 
with a non-power-of-two modulus. Other applications of modulo 
counters include waveform generators and arbiters. 



A good example of a modulo counter is a BCD counter. Such a 
counter is useful in applications where the computer's outputs are 
generated using a decimal system. While a four-bit binary 
counter can count up to sixteen, the BCD counter terminates the 
count at the modulus of 10. 

Modulo counters can be designed in a variety of ways. One direct 
way is to use the truth table to implement acount to a modulus and 
directly derive the equations from it. The truth table for a BCD 
count (from zero to nine) is shown in Figure 19. 

Now let us consider what happens if the device accidentally 
powers up in one of the count values from ten to fifteen. These 
are illegal counts (states) and, for a good design, a mechanism 
must be built into the equations to allow it to recover back into a 
legal state. What we actually need is to consider the truth table 
in Figure 20 in conjunction with the one in Figure 1 9 for deriving 
the Boolean equations. 



PRESENT STATE 




NEXT STATE 


03 02 01 QO 




03 02 01 QO 





0->1 


1 


1 


1 ->2 


10 


10 


2->3 


11 


11 


3->4 


10 


10 


4->5 


10 1 


10 1 


5->6 


110 


110 


6->7 


111 


111 


7->8 


10 


10 


8->9 


10 1 


10 1 


9->0 






PRESENT STATE 




NEXT STATE 


03 Q2 01 QO 




03 02 01 QO 


10 10 
10 11 
110 
110 1 
1110 

1111 


10 ->0 
11 ->0 
12 ->0 
13 ->0 
14 ->0 
15 ->0 











Figure 19. Truth Table for a BCD Counter 



Figure 20. Truth Table for Illegal State Recovery to 
Count Zero 
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A state diagram for the BCD counter is shown in Figure 21 . For 
active-LOW outputs, the Boolean equations can be derived 
directly from the truth table and optimized using Karnaugh maps 
or the PALASM 2 software minimizer. 

The Boolean equation for Q3 is: 



/Q3 



The equation can be reduced to the following: 



/Q3 : = 



= /Q3 


* 


/Q2 


* 


/Ql * /QO 


+ /Q3 


* 


/Q2 


* 


/Ql * QO 


+ /Q3 


* 


/Q2 


* 


Ql * /QO 


+ /Q3 


* 


/Q2 


* 


Ql * QO 


+ /Q3 


• 


Q2 


* 


/Ql * /QO 


+ /Q3 


* 


Q2 


* 


/Ql * QO 


+ /Q3 


* 


Q2 


* 


Ql * /QO 


+ Q3 


* 


/Q2 


* 


/Ql * QO 


+ Q3 


* 


/Q2 


* 


Ql * /QO 


+ Q3 


* 


/Q2 


* 


Ql * QO 


+ Q3 


* 


Q2 


* 


/Ql * /QO 


+ Q3 


* 


Q2 


* 


/Ql * QO 


+ Q3 


* 


Q2 


* 


Ql * /QO 


+ Q3 


* 


Q2 


* 


Ql * QO 



= 


/Q3 * /Q2 


+ 


/Q3 * /Ql 


+ 


/Q2 * QO 


+ 


Q3 * Ql 


+ 


Q3 * Q2 



Similar Boolean equations can be generated for Q2, Q1 and QO. 
Figure 22 shows the circuit diagram of a loadable dual BCD 
counter implemented in a PALI 6R8. Figure 23 shows the design 
file for such a counter. 




Figure 21. State Sequence of a BCD Counter Showing Illegal State Recovery 



CEB- 
LDB- 



CEA - 
LDA- 



INPUT 
BUFFERS 



FOUR- 
BIT 
BCD 
|_J COUNT 



FOUR- 
BIT 
BCD 

COUNT 



Ql 
Q2 
Q3 



Figure 22. Circuit of a Dual BCD Counter 
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TITLE 


DUAL BCD COUNTER. 


PATTERN 


00 


REVISION 


01. 


AUTHOR 


CHRIS JAY. 


COMPANY 


MMI SANTA CLARA, CA. 


DATE 


17 JULY 1987 



CHIP BCD PAL16R8 

THE PAL16R8 HAS BEEN PROGRAMMED AS A DUAL BINARY 
CODED DECIMAL COUNTER, REGISTERS MAY BE LOADED FROM 
DATA INPUTS DO - D3 , THE LDA ENABLES DATA ON 
DO - D3 TO BE SYNCHRONOUSLY LOADED INTO Q4 - Q7 . 
LDB ENABLES DATA ON DO - D3 TO BE LOADED INTO 
REGISTERS QO - Q3. TWO COUNT ENABLE INPUTS CEA 
AND CEB ENABLE A BINARY CODED DECIMAL COUNT IN 
BOTH COUNTERS. THE COUNTERS RESET IF POWERED UP 
IN A HEXADECIMAL STATE, SO IF USED IN A STAND 
ALONE MODE THE COUNTERS WILL NOT LOCK UP INTO 
AN INVALID STATE. 



PIN 1 

CLK 



2 

/LDA 



3 
CEA 



4 
DO 



5 
Dl 



:PIN 6 
D2 



;PIN 11 
/OE 



7 
D3 



12 
QO 



13 
Ql 



9 
/LDB 



14 
Q2 



10 
GND 



15 
Q3 



;PIN 16 
Q4 



17 
Q5 



18 
Q6 



19 
Q7 



20 
VCC 



STRING 
STRING 



RESA 
RESB 



'Q5*Q7 + Q6*Q7' 
'Q1*Q3 + Q2*Q3' 



RESET FROM HEX 
STATES 



EQUATIONS 
/QO : = 



/Ql 



/02 



/Q3 



/Q4 



QO*/LDB*CEB 
RESB 

/DO*LDB*/CEB 
/QO*/CEB*/LDB 

Q3*/LDB*CEB 

Q0*Q1*/LDB*CEB 

/QO*/ Q1*/LDB*CEB 

RESB 

/D1*LDB*/CEB 

/Q1*/CEB*/LDB 

Q3*/LDB*CEB 

/Q0*/Q2*/LDB*CEB 

/Q1*/Q2*/LDB*CEB 

QO* Q1*Q2*/LDB*CEB 

RESB 

/D2*LDB*/CEB 

/Q2*/CEB*/LDB 

/Q3*/Q0*/LDB*CEB 

Q0*/Q2*/LDB*CEB 

/Q1*/Q3*/LDB*CEB 

RESB 

/D3*LDB*/CEB 

/Q3*/CEB*/LDB 

Q4*/LDA*CEA 
RESA 

/DO*LDA*/CEA 
/Q4*/CEA*/LDA 



COUNT EQUATION 

RESET FROM ILLEGAL STATE 

LOAD DO 

HOLD QO 

COUNT EQUATION 



RESET FROM 
LOAD Dl 
HOLD Ql 



ILLEGAL STATE 



COUNT EQUATION 



RESET FROM ILLEGAL STATE 
LOAD D2 
HOLD Q2 



COUNT EQUATION 



RESET FROM ILLEGAL STATE 
LOAD D3 
HOLD Q3 

COUNT EQUATION 

RESET FROM ILLEGAL STATE 

LOAD DO 

HOLD Q4 



Figure 23. Design Fiie for a Dual BCD Counter 
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/Qb 



/Q6 



/07 



Q7*/LDA*CEA 

Q4*Q5*/LDA*CEA 

/Q4*/Q5*/LDA*CEA 

RESA 

/D1*LDA*/CEA 

/Q5*/CEA*/LDA 

Q7*/LDA*CEA 

/Q4*/Q6*/LDA*CEA 

/Q5*/Q6*/LDA*CEA 

Q4*Q5*06*/LDA*CEA 

RESA 

/D2*LDA*/CEA 

/Q6*/CEA*/LDA 

/Q7*/Q4*/LDA*CEA 

Q4*/Q6*/LDA*CEA 

/Q5VQ7*/LDA*CEA 

RESA 

/D3*LDA*/CEA 

/Q7*/CEA*/LDA 



COUNT EQUATION 



RESET FROM ILLEGAL STATE 
LOAD Dl 
HOLD Q5 



COUNT EQUATION 



RESET FROM ILLEGAL STATE 
LOAD D2 
HOLD Q6 



COUNT EQUATION 



RESET FROM ILLEGAL STATE 
LOAD D3 
HOLD Q7 



SIMULATION 

TRACE_ON CLK Q4 Q5 Q6 Q7 
QO Ql Q2 Q3 LDA 
LDB DO Dl D2 D3 
CEA CEB OE 

;PRLDF QO Ql Q2 Q3 Q4 Q5 Q6 Q7 

SETF /CLK OE 

CLOCKF CLK 

SETF /CLK OE DO Dl D2 /D3 

SETF LDA LDB /CEA /CEB 

CLOCKF CLK 

SETF /LDB /LDA 

CLOCKF CLK 

SETF CEA /LDA 

FOR I := 1 TO 10 DO 

BEGIN CLOCKF CLK 

END 

SETF /CEA CEB /LDB 

FOR I := 1 TO 10 DO 

BEGIN CLOCKF CLK 

END 

SETF /CEB LDB /DO Dl /D2 D3 

CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB DO Dl /D2 D3 

CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB /DO /Dl D2 D3 

CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB DO /Dl D2 D3 

CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB /DO Dl D2 D3 

CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB DO Dl D2 D3 

CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

TRACE OFF 



TRACE ALL SIGNALS 



SET DATA INPUTS SELECT 
REGISTERS QO - Q3 PERFORM 
SYNCHRONOUS LOAD. 
SELECT REGISTERS Q4 - Q7 
PERFORM SYNCHRONOUS LOAD. 
ENABLE Q4 - Q7 AS A BCD 
COUNTER, CLOCK FOR 10 
CLOCK CYCLES 

ENABLE QO - Q3 AS A BCD 
COUNTER, CLOCK FOR 10 
CYCLES 

LOAD ILLEGAL STATES 
TO TEST COUNTER RESET 



Figure 23. Design File for a Dual BCD Counter (Cont'd.) 
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Modulo Counter Device Selection Considerations 

We have illustrated a counter whiich counts from zero to a fixed 
modulus. The same technique can be applied for a counter which 
counts down from a maximum power-of-two number to a fixed 
modulus, or even a counter which counts from one modulus to 
another. The important considerations will be the number of 
product terms used. 

The registered PLDs used for modulo counters are similar to the 
ones selected for other counters. Since the counts used are 
binary, devices with J-K or T-type flip-flops or XOR gates will help 
optimize the number of product terms used. The product term 
usage also depends upon the modulus selected. Generally a 
power-of-two or a multiple-of-two modulus will require fewer 
product terms. 



Another factor for flip-flop selection is the illegal states. D-type 
flip-flops are generally better suited for illegal state recovery than 
the J-l< or T-type flip-flops. This is because when no product term 
is asserted, the D-type flip-flops resetto zero. Designers using J- 
K or T-type flip-flops must design-in illegal state recovery. 

Certain devices allow the use of a synchronous RESET product 
term for modulo counters. The idea is to use a minimal number 
of product terms to build a binary counter which counts up to a 
power-of-two number. However, this counter is RESET to zero 
using the synchronous RESET product term when the desired 
modulus is reached. It then begins counting afreshfrom zero, and 
the procedure is repeated. Similar operation can also be 
achieved with a synchronous PRESET product term for a down 
counter. Figure 24 shows a table of all the PLDs that provide the 
RESET and PRESET product terms. 



DEVICE 


TOTAL 
NUMBER 
OF PiNS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 
FREQUENCY IN 

MHz 

'max 


TTL 


16RA8 
2338 


20 
20 


8 
9 


0-8 

4 


8-0 
4 


4 
8-12 


20 
33.3, 28.5 


20RA10 
22RX8 
22V10 
32VX10 


24 
24 
24 
24 


10 
14 
12 
12 


0-10 
0-8 
0-10 
0-10 


10-0 
8-0 
10-0 
10-0 


4 

8 
8-16 
8-16 


33,20 

28.5 

40,28.5,18 

25, 22.2 


CMOS 


C22V10 


24 


12 


0-10 


10-0 


8-16 


33.3, 20 


G29M16 
C29MA16 


24 
24 


5 
5 


- 


16 
16 


8-16 
4-12 


20, 15 
20, 15 


ECL 


10H20EV/EG8 
10020EV/EG8 


24 
24 


12 
12 


0-8 
0-8 


8-0 
8-0 


8-12 125 
8-12 125 



Figure 24. PLDs Which Provide RESET or PRESET Product Terms 
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Using synchronous RESET and PRESET product terms allows 
the counter to recover from illegal states. Notice that the logic 
product terms in the counter are designed for a complete binary 
count. If the counter powers up in any illegal state (as shown in 
Figure 25), it will continue the count until the terminal count and 
return to zero, where the correct modulo count will begin. This 
illegal state recovery will take an unpredictable number of clock 
cycles, and you may wish to design a more systematic recovery 
system. 

Cascading Modulo Counters 

For large modulo counters, the technique of generating Boolean 
equations from the truth tables is very tedious and time consum- 
ing. Another approach for designing modulo counters is to divide 
it into two smaller modulo counters. In addition to simplifying the 
design, this approach usually helps optimize the number of 
product terms. 



As an example, a modulo-360 counter can be directly imple- 
mented with nine register bits. However, instead of implementing 
this as a straight 9-bit counter, we can implement this as two 
counters: one four-bit counter (counting from zero to 14) and 
another five-bit counter (counting from zero to 23). Together the 
two counters count up to 360. Shown in Figure 26 the terminal 
count output MOUT is asserted when the count reaches 360. 

The design requires nine inputs, nine outputs, one clock pin, one 
LOAD pin, one RESET and one MOUT (module output signal) 
pin. Note that no extra flip-flops or pins were needed. Obviously, 
the count values of this counter are not the same as a straight 
modulo-360 counter. Actually, this is what contributes to the 
optimization of the number of product terms used. The PLPL 
design file of this counter is shown in Figure 27. 



SYNCHRONOUS 
RESET PRODUCT 
TERM ASSERTED 




Figure 25. A BCD Counter Using Synchronous RESET Product Term 



Q24 (4) (3) (2) (1) (0) 
i A i A A 



A (8:4) 

LOAD 

RST 

CLK 



-r'-^ 



Q15 (3) (2) (1) (0) 
4 A A i 



MODULO 24 
5 BfTS 



A (3:0) — ■/■ — ► 



MODULO 15 
4 BITS 



Figure 26. A Modulo-360 Counter in PAL22V10 
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DEVICE MODULO 360 COUNTER (PAL22V10) 



CLK = 1 RST = 2 

LOAD = 3 

A[8:01 = 4:11,13 

Q24[4:0] = 18:14 

Q15[3:0] = 22:19 

M OUT = 23; "ASSERTED WHEN COUNT REACHES 360" 



BEGIN 



IF (RST) THEN ARESET( ) ; "COUNTER INITIALIZATION" 
IF (LOAD) THEN BEGIN 

Q24[4:0] := A[8:4] ; "INITIALIZATION COUNT" 
Q15(3:0] := A[3:0] ; "INITIALIZATION COUNT" 
END ; 
ELSE BEGIN 

IF (Q24[4]*/Q24[3]* Q24[2]* 024[1]* Q24[0]* 

Q15[3]* Q15[2]* Q15[l]*/Q15[0)) THEN M_OUT = 1; 
CASE (Q24[4:0]) BEGIN "MODULO 24" 
#B00000) BEGIN 

Q15[3:0] := Q15[3:01 ; 
Q24[4:0] := #B00001 ; 
END ; 

#800001) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #800010 ; 
END ; 

#800010) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #800011 ; 
END ; 

#800011) BEGIN 

O15[3:0] := Q15[3:0] ; 

Q24[4:0] := #800100 ; 
END ; 

#800100) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #800101 ; 
END ; 

#800101) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #800110 ; 
END ,- 

#800110) BEGIN 

Q15[3:0] := Q15[3:0) ; 

024(4:0] := #800111 ; 
END ; 

#800111) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #801000 ; 
END ; 

#801000) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #801001 ; 
END ; 

#801001) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #801010 ; 
END ; 

#801010) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #801011 ; 
END ; 

#801011) BEGIN 

Q15[3:0] := Q15[3:0] ; 

Q24[4:0] := #801100 ; 
END ; 

Figure 27. Design Fiie of a IVIoduio-360 Counter 
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#B01100) BEGIN 

Q15[3;0] := Q15[3:0] 
Q24[4:0] := #801101 ; 

END ; 



#801101) 8EGIN 

Q15[3:0] 
Q24[4:0] 

END : 



015(3:0] 
#801110 i 



#801110) 8EGIN 

Q15[3:0] := Q15[3:0] 
Q24[4:0) := #801111 ; 

END ; 

#801111) BEGIN 

Q15[3:0] := Q15[3:0] 
Q24[4:0] := #810000 ; 

END ; 

#810000) BEGIN 

Q15[3:0] := Q15[3:0] 
Q24[4:0] := #810001 ; 

END ; 



#810001) BEGIN 

Q15[3:0] 
Q24[4:0] 

END ; 



!= Q15[3:0] 
!= #810010 



#810010) BEGIN 

Q15[3:0] 
Q24[4:0] 

END ; 



:= Q15[3:01 
:= #810011 ; 



#810011) BEGIN 

Q15[3:0] 
Q24[4:0] 

END ; 



:= Q15[3:0] 
!= #810100 , 



#810100) BEGIN 

Q15[3:0] 
Q24[4:0] 

END ; 



Q15[3:0] 
#810101 ; 



#810101) BEGIN 

Q15[3:0] := Q15[3:0] 
Q24[4:0] := #810110 ; 

END ; 

#810110) BEGIN 

Q15[3:0] := Q15[3:0] 
Q24[4:0] := #810111 ; 

END ; 



#810111) BEGIN 

Q24[3:0) := #800000 ; 

CASE (Q15[3:0] BEGIN "MODULO 15" 

#80000) Q15[3:0] := #80001 

#80001) Q15[3:0] := #80010 

#80010) Q15[3:0] := #80011 

#80011) Q15[3:0] := #80100 

#80100) Q15[3:01 := #80101 

#80101) Q15[3:0] := #80110 

#80110) Q15[3:0) := #80111 

#80111) Q15[3:0] := #81000 

#81000) Q15[3:0] := #81001 

#81001) Q15[3:01 := #81010 

#81010) Q15[3:0] := #81011 

#81011) Q15[3:0] := #81100 

#81100) Q15[3:0] := #81111 

#81101) Q15[3:0] := #81110 

#81110) Q15[3:0] := #80000 



Figure 27. Design File of a Modulo-360 Counter (Cont'd.) 
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TEST_VECTORS 

IN CLK RST LOAD A[8:0] ; 



OUT 
BEGIN 



Q24[4:0] Q15[3:0] M OUT ; 



"CLK RST LOAD 



X 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
END. 



A[f 
X X 
1 
X X 
X X 
X X 



:0] 
X X 



X X 

X X 



Q24[4 
L L L 
L H 
L H 
L H 
L L 



H H 
L L 



:0] 
L L 
L H 
H L 
H H 
L L 
H H 
L L 
L H 
H L 
H H 
L L 
L H 
H L 
H H 
L L 
L H 
H L 



Q15[3:0] 
L L L L 
H H L 
H H L 
H H L 
H H L 
H H H 
H H H 
H H H 
H H H 
H H H 
H H H 
H H H 
H H H 
H H H 
L L L 
L L L 
L L L 



M_OUT" 
L; 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
H 
L 
L 
L 



Figure 27. Design File of a Modulo-360 Counter (Cont'd.) 



PLS Devices for Modulo Counters 



Monolithic Memories offers a family of PLS devices. PLS devices 
provide programmable AND and programmable OR arrays. 
These devices offer S-R type output and buried flip-flops. They 



also provide a complement array term which is very useful for 
designing modulo counters. The available PLS devices are listed 
in Figure 28. 



DEVICE 


TOTAL 

NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 

FREQUENCY IN 

MHz 

MAX 


TTL 


PLS105 
PLS167 
PLS168 


28 
24 
24 


16 
14 
12 


8 
6 
8 


" 


Total 48 
Total 48 
Total 48 


37 
33 

33 



Figure 28. PLS Devices 
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The complement array is an OR array term, the complement of 
which is fed back as an input to the AND array of the device 
(Figure 29). The complement term can be used for both illegal 
state recovery and programmable modulo counter designs. 

Illegal State Recovery 

For any counter going through a sequence of legal counts 
(stales), at least one product term is always active as long as the 
counter never needs to hold a count; such terms determine the 
next legal state. When the counter powers up in any invalid state, 
none of the product terms for deciding the next state will be active. 
When none of the product terms connected to the complement 
array is active, the complement array is asserted. 

The complement array is usually connected to all of the product 
terms of the counter. Thus when the count is legal, the comple- 
ment array output is always LOW (Figure 30). When the counter 
is in an illegal state, the complement array detects it and is 



asserted HIGH. This signal can then be used to reset the registers 
to some legal state. One advantage of the complement array is 
that it allows recovery to any state predetermined by the designer. 

Programmable Modulus Counter 

The complement array term can instead be used for designing 
programmable modulus counters. Acounter is implemented with 
the device registers. Different product terms are programmed 
with different modulus values. These product terms are under the 
control of input signals. At any one time only one of the product 
terms is kept active. The complement array is connected to all 
such terminal count product terms. Since all of these product 
terms are LOW the complement array is initially HIGH and is used 
as a count enable for the counter. When the count reaches the 
terminal value the complement array term goes LOW disabling 
the normal count product terms. Simultaneously the terminal 
count product term becomes active, resetting the counter to 
begin counting afresh. 



DEVICE 
INPUTS 
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Figure 29. The Architecture of the Complement Array Term 
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Figure 30. Complement Array for Illegal State Recovery 



DEVICE 
INPUTS 






STATE (COUNT) 



PRODUCT TERMS 
WITH DIFFERENT 
COUNT VALUES' 
PROGRAMMED 



fcrhTTTOO^ 



7" 



7" 



NORMAL COUNT 
PRODUCT TERMS 



COUNT ENABLE GOES LOW 
WHEN MODULUS IS REACHED 



A 



A 



LOW WHEN COUNTING, 
HIGH AT MODULUS 



■o 



- S 
R 



H>- 



DEVICE 
OUTPUTS 



S 

R Q 



STATE 
REGISTERS 



Figure 31. Programmable Modulo Counter Using Complement Array Term 
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Counters witli Encoding 



Until now we have discussed counters which generate binary 
output sequences. Most peripherals require a predetermined 
sequence of control signals. Custom control sequences can be 
generated by decoding the binary sequence with combinatorial 
logic. Figure 32 shows a general model of a counter with 



combinatorial output decoding circuitry. This combinatorial circuit 
modifies the counter bits and generates output signals in the 
manner required for peripheral timing and control. Since these 
circuits require extra combinatorial logic, they are not very effi- 
cient. They are also more susceptible to hazards and output 
glitches. 



LOGIC 
/ 




CLOCK 
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• 
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• 
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Figure 32. Counter with an Output Decoder 



It is possible to have a different output coding for a four-bit 
counter, as shown in Figure 33. This code, called Gray code, 
allows only one output bit to toggle for each new count value. This 
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code can be easily derived from a four-bit binary counter code 
(also shown in Figure 33) using an output decoder. 

Wecan derive the Boolean equationsforthe combinatorial output 
decoder from the truth table. The equations are: 

G3 ° X3 

G2 = X3 :+: X2 

Gl = X2 :+: XI 

GO = XI :+: XO 

A more efficient and easier technique for generating control 
signals is to implement the decode circuitry before the registers. 
This alternative is shown in Figure 34. This essentially generates 
a non-standard counter with state values which are not a binary 
progression. It can also be considered to be a counter where the 
product terms for a binary count and encoding the outputs have 
been combined. 

Many different codes can be generated using such techniques. 
We will limit ourselves to the ones which are most commonly 
used: Gray-code counters and Johnson counters. 



Figure 33. Generating Gray Code from a Binary Code 
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Figure 34. Counter with Combined Next State Generation and Output Encoding Circuit 
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Gray-Code Counters 



The Boolean logic equations for a Gray-code counter are: 



Gray-code counters are often used in digital designs for control 
timing functions. The primary advantage of Gray-code counters 
stems from the characteristic that only one output bit changes 
value for every clock cycle. These output signals can then easily 
be decoded using a combinatorial decoder without any risk of 
hazards. Gray-code counters are used extensively as system 
clocks, since the different output bits provide different clock 
phases without the risks of hazards. Gray-code is also used in 
high-speed data communication applications, where data is 
transmitted from one part of the system to another, and where the 
error susceptibility increases with the number of bit changes 
between adjacent numbers in a sequence. These are also used 
for such specialized applications as shaft encoders and real-time 
process control. 

The implementation of a Gray-code counter is very simple. A 
truth table can be derived from the transition table as is done for 
a binary counter. The Boolean equations can then be directly 
derived from the truth table. The truth table for the Gray-code 
counter is shown in Figure 35. 
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Figure 35. Truth Table for a Four-bit Gray-code Counter 
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Johnson Counters 

A Johnson counter is part of a family of counters known as "ring 
counters." These counters are used for speciai appiications 
where code symmetry is desired. Ring counters are also often 
used for timing purposes, since all of the outputs are essentially 
a series of pulses. This code symmetry also allows use of the 
fewest possible product terms with a D-type register. Devices 
such as the Logic Cell Array (LCA), which provide a small amount 
of logic per cell, can implement Johnson counters very easily. 

Johnson counters are also known as circular-shift counters. The 
sequence for af ive-stage Johnson counter is shown in Figure 36. 
As can be seen in the truth table, the counter first fills up with 1 's 
from left to right and then it fills up with zeros again. Note from the 
output sequence that only one of the Johnson counter bits 
changes for every clock period, like the Gray-code counter. One 
major advantage of the Johnson counter is that it can be readily 
decoded with small two-input NAND gates and hence is suitable 
for high-speed applications. 

Note that the five-stage sequence has a table of 10 legal states 
and 22 illegal states (Figure 37). In general, an n-bit Johnson 
counter will produce a modulus of 2n. Figure 38 shows the state 
diagram of the five-bit counter. 

Legal States 



PRESENT STATE 


NEXT STATE 


Q4 Q3 Q2 Q1 
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Figure 36. Five-bit Johnson Counter Truth Table 



Illegal States 
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Figure 37. Illegal States for a Five-bit Johnson Counter 




Figure 38. State Diagram of a Five-bit Johnson Counter 



The implementation of a Johnson counter is relatively straight- 
forward, and is the same regardless of the number of stages. 
When D-type flip-flops are used, the Q output of each flip-flop is 
connected to tlie D input of the following stage. The single 
exception is theQ output of the last stage, which is complemented 
and connected to the D input of the first stage. 



One disadvantage of the counter is the number of invalid (or 
illegal) states. The invalid states increase exponentially with the 
length of the counter. The bigger the counter becomes, the 
greater are its chances of entering an illegal state. Johnson 
counters are very susceptible to illegal states, and can "hang up" 
very easily. Noise or improper use can cause this counter to end 
up in an illegal state. Therefore a design with illegal state recovery 
circuitry is always recommended. 
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Figure 39 shows a nine-bit Johnson counter which can be derived 
by directly extending the design of afive-bit Johnson counter. The 
design file adds a LOAD function; it requires nine input pins, nine 
output pins, one RESETpin, one LOAD pin, one invalid status pin 



and one clock pin. The design incorporates the product terms 
required for illegal state recovery. The PLPL design file for this 
counter is shown in Figure 40. 




DEVICE 



Figure 39. Block Diagram of a NIno-bIt Johnson Counter 

NINE_BIT_JOHNSON_COUNTER ( AniPAL22V10) 



PIN CLK = 


1 


RST = 


2 


A[8:0] 


= 3:11 


Q[8:0] 


= 14:21,23 


VF 


2 2 "VALID STATE FLAG' 


LOAD = 


13 ; 



IF (RST) THEN ARESET(Q[ 8 :0 ] 


; 


"COUNTER IN 


IF (LOAD) THEN Q[8:0] := 


A(8:0] 


"LOAD NEW V 


ELSE BEGIN 






CASE {Q[8:0]) 






BEGIN 






#BOO00000OO) 


Q[8 


01 


= #8100000000 


#B100000000) 


Q18 


01 


= #8110000000 


#B110000000) 


Q18 


0) 


= #8111000000 


#B111000000) 


Q18 


OJ 


= #8111100000 


#6111100000) 


Q18 


01 


= #8111110000 


#B111110000) 


Q18 


01 


= #8111111000 


#B111111000) 


Q[8 


0] 


= #8111111100 


#8111111100) 


Q18 


01 


= #8111111110 


#B111111110) 


Q18 


01 


= #8111111111 


#B111111111) 


Q18 


01 


= #8011111111 


#8011111111) 


Q18 


01 


= #8001111111 


#8001111111) 


Q[8 


01 


= #8000111111 


#8000111111) 


Q[8 


01 


= #8000011111 


#B000011111) 


Q18 


01 


= #8000001111 


#8000001111) 


Q18 


01 


= #6000000111 


#8000000111) 


Q18 


01 


= #8000000011 


#8000000011) 


Q18 


01 


= #8000000001 


#8000000001) 


Q[B 


01 


= #6000000000 


END ; 






END ; 






CASE (Q[8:0]) 






BEGIN 






#8000000000) 


VF = 1 




#8100000000) 


VF = 1 




#B110000000) 


VF = 1 




#8111000000) 


VF = 1 




#8111100000) 


VF = 1 




#8111110000) 


VF = 1 




#8111111000) 


VF = 1 




#8111111100) 


VF = 1 




#8111111110) 


VF = 1 




#8111111111) 


VF = 1 




#8011111111) 


VF = 1 




#8001111111) 


VF = 1 




#8000111111) 


VF = 1 




#8000011111) 


VF = 1 




#8000001111) 


VF = 1 




#8000000111) 


VF = 1 




#8000000011) 


VF = 1 




#8000000001) 


VF = 1 




END ; 









END. 



Figure 40. Design Fllo for a NIno-bIt Johnson Counter 
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TEST VECTORS 
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CLK RST LOAD A[8:01; 
Q[8:0] VF; 



AAAAAAAAA 
876543210 



X X 
X X 



X X 

X X 



X X 

1 1 

X X 

X X 

X X 

1 

X X 

X X 



1 1 

X X 

X X 



XXX 



QQQQQQQQO 
876543210 



L L 
H H 



H H 

H H 

L H 

L L 

L L 



L L L 

H H H 

H H H 

H H H 

L H H 

H L H 

L L L 

L L L 

H H H 

H H H 

H H H 

L L L 

L L L 



L L L L 

H H H H 

H H H H 

H H H H 

H H H H 

L H L H 

L L L L 

L L L L 

H H H L 

H H H H 

H H H H 

L L L H 

L L L L 



V" 
F" 

H 
H 
H 
H 
H 
L 
H 
H 
H 
H 
H 
H 
H 



"RESET" 

"LOAD" 

"COUNT" 

"COUNT" 

"LD INVALID ST" 

"INVALID ST" 

"RECOVERY" 

"LOAD" 

"COUNT" 



Figure 40. Design File for a Nine-bit Johnson Counter (Cont'd.) 



Shift Registers 



A Shift Register is a special digital circuit often used as a primary 
building block in digital computer systems, it is closely related to 
a ring counter, its fundamental usage is for temporary data 
storage and bit-wise data manipulation for advanced arithmetic 
and multiplication operations. Shift registers are also frequently 
used in communications, for converting parallel byte-wide data 
from the microprocessor to a serial data bit-stream for transmis- 
sion. An example of a serial data link controller on page 2-453 
shows such a shift register design. Shift registers are also used 
in graphics systems for serializing parallel data for use by the 
display monitor. A number of examples of video shift registers are 
included in the graphics section (page 2-257). 

The fundamental purpose of a shift register (Figure 41 ) is to shift 
data from one flip-flop to another. There are several types of shift 
registers. They are classified by the way in which incoming data 
is received (parallel or serial), and how outgoing data is transmit- 
ted (parallel or serial). 



In the following example, we will discuss a simple universal shifter 
which provides both serial and parallel input and output functions. 
Depending upon the control signals 10 and 11, the data is shifted 
from one flip-flop to another in the left or the right direction. These 
inputs also control when the new parallel data is loaded onto the 
registers. When shifting left or right, serial data can be received 
and transmitted on serial pins LIRO and RILO. Since the flip-flop 
outputs appear on the output pins at all times, the parallel output 
data is always available. The truth table is shown in Figure 42. 

The Boolean logic equations can be directly derived from the truth 
table, and are shown in the PAL device design file in Figure 43. 

Shift registers can be modified to suit various system design 
requirements. This universal shift register can be used for serial 
in/serial out, parallel in/parallel out, serial in/parallel out and 
parallel in/serial out functions. 



CLOCK 



CONTROL 
SIGNALS ■ 



LEFT SERIAL DATA 
IN AND OUT (LIRO) 



PARALLEL DATA IN 
Do D, Dj D3 D4 D5 Dg D7 

U-Lii-LLUi 




TTTTTTTT 

Qo Qi O2 Q3 Q4 Q5 Qs Q7 
PARALLEL DATA OUT 



RIGHT SERIAL DATA 
OUT AND IN (RILO) 



Figure 41. A Shift Register Biocl< Diagram 
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Q7 


Q6 


Q5 


Q4 


Q3 


Q2 


Q1 


QO 


11 


10 




Q7 


Q6 


Q5 


Q4 


Q3 


Q2 


Q1 


QO 








jRetain Data 


RILO 


Q7 


Q6 


Q5 


Q4 


Q3 


Q2 


Q1 





1 


;Shifl Right 


Q6 


Q5 


Q4 


Q3 


Q2 


Q1 


QO 


LIRO 


1 





;Shift Left 


D7 


D6 


D5 


D4 


D3 


D2 


D1 


DO 


1 


1 


;Load Data 



Figure 42. The Truth Table for a Universal Shift Register 

TITLE UNIVERSAL_SHIFT_REGISTER 

PATTERN SHIFT. PDS 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY MONOLITHIC MEMORIES, INC. 

DATE 09/12/87 

CHIP UNIV_SHI PAL20X8 

CLK 10 DO Dl 02 D3 D4 D5 06 07 II GND 
/OC RILO Q7 Q6 Q5 Q4 Q3 Q2 Ql QO LIRO VCC 



;DESCRIPTION 

THIS PAL DEVICE IS AN 8-BIT SHIFT REGISTER WITH PARALLEL LOAD AND 
HOLD CAPABILITY. TWO FUNCTION SELECT INPUTS (10,11) PROVIDE ONE OF 
FOUR OPERATIONS WHICH OCCUR SYNCHRONOUSLY ON THE RISING EDGE OF THE 
CLOCK (CLK). THESE OPERATIONS ARE: 



/OC CLK II 10 07-DO Q7-Q0 



OPERATION 



H 


X 


X 


X 


X 


Z 


HI-Z 


L 


C 


L 


L 


X 


L 


HOLD 


L 


c 


L 


H 


X 


SR(Q) 


SHIFT RIGHT 


L 


c 


H 


L 


X 


SL(Q) 


SHIFT LEFT 


L 


c 


H 


H 


D 





LOAD 



TWO OR MORE OCTAL SHIFT REGISTERS MAY BE CASCADED TO PROVIDE LARGER 
SHIFT REGISTERS. RILO AND LIRO ARE LOCATED ON PINS 14 AND 23 
RESPECTIVELY, WHICH PROVIDES FOR CONVENIENT INTERCONNECTIONS WHEN 
TWO OR MORE OCTAL SHIFT REGISTERS ARE CASCADED TO IMPLEMENT LARGER 
SHIFT REGISTERS. 



EQUATIONS 

/QO := /IIVIOVQO 

+ /II* lOVQl 
:+: I1*/IO*/LIRO 
+ II* IO*/DO 

/Ql := /IIVIOVQI 

+ /II* I0*/Q2 

:+: I1*/I0*/Q0 

+ II* I0*/D1 

/Q2 := /Il*/I0*/Q2 

+ /II* I0*/Q3 

:+: I1*/I0*/Q1 

+ II* I0*/D2 

/Q3 := /Il*/I0*/Q3 

+ /II* I0*/Q4 

:+: Il*/I0*/Q2 

+ II* I0*/D3 

/Q4 := /Il*/I0*/Q4 

+ /II* I0*/Q5 

:+: Il*/I0*/O3 

+ II* I0*/D4 



/Qb 



/Il*/I0*/Q5 

/II* I0*/Q6 

Il*/I0*/Q4 

II* I0*/D5 



HOLD QO 
SHIFT RIGHT 
SHIFT LEFT 
LOAD DO 

HOLD Ql 
SHIFT RIGHT 
SHIFT LEFT 
LOAD 01 

HOLD Q2 
SHIFT RIGHT 
SHIFT LEFT 
LOAD 02 

HOLD Q3 
SHIFT RIGHT 
SHIFT LEFT 
LOAD 03 

HOLD Q4 
SHIFT RIGHT 
SHIFT LEFT 
LOAD 04 

HOLD Q5 
SHIFT RIGHT 
SHIFT LEFT 
LOAD 05 



Figure 43. Octal Shift Register Design File 
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/Q6 := /11*/I0*/Q6 

+ /II* I0*/Q7 

:+: Il*/I0*/Q5 

+ II* I0*/D6 

/Q7 := /I1VI0VQ7 

+ /II* I0*/RILO 
:+: Il*/I0*/Q6 
+ II* I0*/D7 



/LIRO 
LIRO.TRST 



/QO 
/I1*I0 



/RILO = /Q7 
RILO.TRST = I1*/I0 



HOLD Q6 
SHIFT RIGHT 
SHIFT LEFT 
LOAD D6 

HOLD Q7 
SHIFT RIGHT 
SHIFT LEFT 
LOAD 07 

iLEFT IN RIGHT OUT 



; RIGHT IN LEFT OUT 



SIMULATION NOT SHOWN HERE 

Figure 43. Octal Shift Register Design File (Cont'd.) 



Barrel Shifters 

In most data processing systems, some form of data sliifting or 
rotation is necessary. In typical computer systems, \he shifter is 
located at the output of the ALU, and usually requires a single- 
cycle shift and add function (Figure 44). For such applications as 
floating-point arithmetic or string manipulation, ordinary shift 
registers are inefficient, since they require n clock cycles for an n- 
bit shift. 



INPUT DATA BUS 




OUTPUT DATA BUS 

Figure 44. Typical ALU Architecture 



> 



A specialized shift register, called a "barrel shifter," is used to shift 
(or rotate) data by any number of bits in a single clock cycle. The 
name "barrel shifter" is used because of the circular nature of the 
shift operation. The storage registers on the output of the shifter 
are used in this architecture to pipeline the data operation, 
increasing throughput. The three-state buffer on the output 
registers is also useful for providing an interface to the data bus. 

The design of a barrel shifter proceeds in the same manner as a 
regular shift register. The truth table is drawn, and the Boolean 
equations are then written based upon the truth tables. An eight- 
bit barrel shifter requires at least eight data inputs, eight regis- 
tered data outputs, three control lines to specify the shift distance, 
a clock input and an output enable that controls the three-state 
buffer on the register output. 

Figure 45 shows the block diagram for an eight-bit registered 
barrel shifter, while Figure 46 shows the truth table. The regis- 
tered barrel shifter requires a total of 1 4 inputs and 8 outputs. This 
function can be easily implemented in an AmPAL22V10. Figure 
47 shows the PLPL design file for this eight-bit barrel shifter 
function. 



D7 De D5 D4 D3 Dj Di Do 

i 1 i i 1 i i 1 



So ► 

Si — * 

82— ► 

CLK ► 


AmPAL22V10 


4 RST 

OE 




i i i i i i i i 
Q7 Qe Q5 O4 Q3 Q2 Qi Oo 





Figure 45. Block Diagram of an Eight-bit Barrel Shifter 
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S2 


SI 


so 


Q7 


Q5 


Q5 


Q4 


Q3 


Q2 


Q1 


QO 











D7 


D6 


D5 


D4 


D3 


D2 


D1 


DO 








1 


D6 


D5 


D4 


D3 


D2 


D1 


DO 


D7 





1 





D5 


D4 


03 


D2 


D1 


DO 


D7 


D6 





1 


1 


D4 


D3 


D2 


D1 


DO 


D7 


D6 


D5 


1 








D3 


D2 


D1 


DO 


D7 


D6 


D5 


D4 


1 





1 


D2 


D1 


DO 


D7 


D6 


D5 


D4 


D3 


1 


1 





D1 


DO 


D7 


D6 


D5 


D4 


D3 


D2 


1 


1 


1 


DO 


D7 


D6 


D5 


D4 


D3 


D2 


D1 



Figure 46. Truth Table for an Eight-bit Barrel Shifter 

DEVICE EIGHT_BIT_BARREL_SHIFTER (PAL22V10) 



CLK 

D [7:0] 

SEL2 

OE 



1 

2:9 
13 
14 



/Q[7:0] = 23:16 
SELl = 11 SELO 
RSY = 15; 



BEGIN 

IF (RST) THEN ARESET( ) ; "SHIFTER INITIAL 

IF (OE) THEN ENABLE( 0(7:0] ) ; "ENABLE OUTPUTS" 

/SELO) THEN Q[7:0] 

SELO) THEN Q[7:0] 

/SELO) THEN Q[7:0] 

SELO) THEN Q[7:0] 

/SELO) THEN Q[7:0] 

SELO) THEN Q[7:0] 

/SELO) THEN Q[7:0] 

SELO) THEN Q[7:0] 

END. 



IF 


(/SEL2 


* 


/SELl 


IF 


(/SEL2 


* 


/SELl 


IF 


(/SEL2 


* 


SELl 


IF 


(/SEL2 


* 


SELl 


IF 


( SEL2 


* 


/SELl 


IF 


( SEL2 


* 


/SELl 


IF 


( SEL2 


* 


SELl 


IF 


( SEL2 


A 


SELl 



TIALIZATION' 




D[7 


0]; 






D[6 


0], 


D[7 




D[5 


0], 


D[7 


6] 


D[4 


0], 


Dt7 


5] 


D(3 


0), 


D(7: 


4] 


D[2 


0], 


D[7: 


31 


Dll 


0], 


D[7: 


21 


D[0 


, D[7:l] 


; 



"FUNCTION TABLE SPECIFICATION" 

TEST_VECTORS 

IN CLK RST D[7:0] SEL2 SELl SELO; 

I_0 OE; 

OUT Q [ 7 : ] ; 

BEGIN 

"CLK RST D[7:0] SEL2 SELl SELO OE Q[7:0] 



C 


1 


00000000 


1 





1 





zzzzzzzz 


C 





11111111 





1 





1 


HHHHHHHH 


c 


1 


xxxxxxxx 


X 


X 


X 


1 


LLLLLLLL 


c 





xxxxxxxx 


X 


X 


X 





ZZZZZZZZ 


c 





00000000 











1 


LLLLLLLL 


c 





11111111 











1 


HHHHHHHH 


c 





01111111 


1 








1 


HHHHLHHH 


c 





01111111 





1 





1 


HHHHHHLH 


c 





01111111 


1 


1 





1 


HHLHHHHH 



Figure 47. Design Fiie for an Eight-bit Barrel Shifter 



Gray-code, Johnson Counter and Shift Register Device 
Selection Considerations 

Gray-code counters, Johnson counters and shift registers are not 
very logic-intensive; the number of product terms required is 
minimal. The D-type flip-flops provide the most efficient imple- 
mentations, allowing these designs to be easily implemented in 
most PAL devices. 



Since Gray-code counters are often used as system clocks, very 
high speed PAL devices provide the highest resolution clocks. 

Barrel shifters are very logic-intensive and require many product 
terms, since data from all the inputs needs to be accessible at any 
output. Registered PLDs with a large number of product terms 
are ideal for barrel shifters. Large barrel shifters can also be 
partitioned into a number of PLDs. 
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Asynchronous Registered Designs 

Until now we have discussed strictly synchronous registered 
designs, where a common system clock is used. In asynchro- 
nous registered designs, a common clock is not used. The 
register clock may be generated by the output of another register, 
or by a logical combination of various other signals. Such designs 
are usually slow for such applications as timing generation, 
because when the output of one register is used to clock another, 
multiple delays are encountered before all the register outputs 
stabilize. On the other hand, designs can be very fast for 
asynchronous applications such as bus arbitration and control, 
where a fast response to a bus signal can be provided without 
waiting for a common system clock. 

Although asynchronous designs are easier to visualize, they 
present larger problems in implementation. Combinatorial haz- 



ard conditions can cause false clocking of registers, destroying 
the logic intended by the designer. The designer also needs to 
worry about race conditions when clocking a number of registers 
simultaneously. Careful design analysis is strongly recom- 
mended before implementing any asynchronous design. 

Ripple counters are probably the easiest examples of such 
asynchronous designs. Figure 48 shows the logic diagram of a 
five-bit binary ripple counter. These counters clearly have the 
advantage of design simplicity. The output from one stage is fed 
as the clock to the next stage. However, this results in a slower 
counting rate, since the clock signals need to propagate through 
all five registers before the next count is reached. 

The ripple counter design illustrated below is implemented in a 
PAL20RA1 0. Figure 49 shows the design file for this counter. 



CK SET 




EXTRA CIRCUIT REQUIRED 
FOR MODULO 20 COUNTER 



Figure 48. A Five-bit Ripple Counter 



TITLE Dual 5-Bit Counter 

PATTERN Upcount . pds 

REVISION 01 

AUTHOR Joe Engineer 

COMPANY Monolithic Memories, Inc. 

DATE 11/12/87 

CHIP Upcount PAL20RA10 

PL CKA CKB ENA ENB NC NC NC NC NC INIT GND 
OE QB4 QB3 QB2 QBl QBO QA4 QA3 QA2 QAl QAO VCC 



EQUATIONS 






; The First 


Counter 




/QA4 


:= QA4 * /ENA 


; Toggle if 


IS zero 








+ /QA4 * ENA 


; Hold count 


QA4.CLKF 


= /QA3 


; Toggle 


QA4.RSTF 


= INIT 


; Initialize 


QA4.TRST 


= VCC 





/QA3 :■= QA3 * /ENA ; Toggle when QA2 zero 

+ /QA3 * ENA ; Hold count 

QA3.CLKF = /QA2 ; Toggle 

QA3.RSTF = INIT ; Initialize 
QA3.TRST = VCC 
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Figure 49. Design File for a Five-bit Ripple Counter 
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/QA2 


:- QA2 


* 


/ENA 


; Toggle when 


QAl zero 




+ /QA2 


* 


ENA 


; Hold count 




QA2.CLKF 


- /QAl 






; Toggle 




QA2.RSTF 


- INIT 






; Initialize 




QA2.TRST 


- VCC 










/QAl 


:- QAl 


* 


/ENA , 


; Toggle when 


QAO zero 




+ /QAl 


* 


ENA , 


; Hold count 




QAl.CLKF 


- /QAO 






; Toggle 




QAl .RSTF 


- INIT 






; Initialize 




QAl.TRST 


= VCC 










/QAO 


:= QAO 


* 


/ENA , 


; Toggle LSB 






+ /QAO 


* 


ENA 


; Hold count 




QAO.CLKF 


- CKA 






; Toggle 




QAO. RSTF 


= INIT 






; Initialize 




QAO. TEST 


= VCC 











; The Second Cotjnter 



/QB4 


:>= QB4 


* 


/ENB 


; Toggle if lower MSB is zero 




+ /QB4 


* 


ENB 


; Hold count 


QB4.CLKF 


= /QB3 






; Toggle 


QB4.RSTF 


- INIT 






; Initialize 


QB4.TRST 


- VCC 








/QB3 


:= QB3 


* 


/ENB 


; Toggle when QB2 zero 




+ /QB3 


* 


ENB 


; Hold count 


QB3.CLKF 


= /QB2 






; Toggle 


QB3.RSTF 


= INIT 






; Initialize 


QB3 . TRST 


=■ VCC 








/QB2 


:= QB2 


* 


/ENB 


; Toggle when QBl zero 




+ /QB2 


* 


ENB 


; Hold count 


QB2.CLKF 


= /QBl 






; Toggle 


QB2.RSTF 


= INIT 






; Initialize 


QB2.TRST 


= VCC 








/QBl 


:= QBl 


* 


/ENB , 


; Toggle when QBO zero 




+ /QBl 


* 


ENB 


; Hold count 


QBl.CLKF 


= /QBO 






; Toggle 


QBl. RSTF 


= INIT 






; Initialize 


QBl. TRST 


= VCC 








/QBO 


:° QBO 


* 


/ENB , 


; Toggle LSB 




+ /QBO 


* 


ENB , 


; Hold count 


QBO.CLKF 


= CKB 






; Toggle 


QBO. RSTF 


= INIT 






; Initialize 


QBO. TRST 


= VCC 








SIMULATION 











TRACE_ON /PL QA4 QA3 QA2 QAl QAO QB4 QB3 QB2 QBl QBO 

SETF PL INIT /CKA /CKB /OE 

SETF INIT CKA CKB 

SETF /INIT /CKA /CKB 

CHECK /QA4 /QA3 /QA2 /QAl /QAO 

CHECK /QB4 /QB3 /QB2 /QBl /QBO 

Figure 49. Design File for a Five-bit Ripple Counter (Cont'd.) 
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SETF CKA CKB 








CHECK /QA4 /QA3 


/QA2 


/QAl 


QAO 


CHECK /QB4 /QB3 


/QB2 


/QBl 


QBO 


SETF /CKA /CKB 








SETF CKA CKB 








CHECK /QA4 /QA3 


/QA2 


QAl 


/QAO 


CHECK /QB4 /QB3 


/QB2 


QBl 


/QBO 


SETF /CKA /CKB 








SETF CKA CKB 








CHECK /QA4 /QA3 


/QA2 


QAl 


QAO 


CHECK /QB4 /QB3 


/QB2 


QBl 


QBO 


SETF /CKA /CKB 









SETF CKA CKB 

CHECK /QA4 /QA3 QA2 /QAl /QAO 
CHECK /QB4 /QB3 QB2 /QBl /QBO 
SETF /CKA /CKB 



SETF CKA CKB 

CHECK /QA4 /QA3 QA2 /QAl 
CHECK /QB4 /QB3 QB2 /QBl 
SETF /CKA /CKB 



QAO 
QBO 



TRACE OFF 



Figure 49. Design File for a Five-bit Ripple Counter (Cont'd.) 



There are additional independent SET and RESET controls in the 
PAL20RA1 which can be used to restart the count. If a modulo 
counter is desired, these SET and RESET signals can be used as 
described above. Figure 48 shows the implementation of a 
moduio-20 counter which is RESET when output bits Q4 and Q2 
are both HIGH. Since the RESET is implemented with a product 
term, the extra AND gate shown can be implemented directly 
within the PAL device. 



Asynchronous Designs Device Selection Considerations 

The device selection for asynchronous designs is easy. As the 
clock signals require logic, only PLDs which allow implementation 
of Boolean logic on the clock signals are useful. Figure 50 shows 
a list of devices which provide programmable clock signals. They 
also provide programmable SET and RESET product terms, and 
the capability of bypassing the register. 



DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 

FREQUENCY IN 

MHz 

'max 


TTL 


16RA8 
20RA10 


20 
24 


8 
10 


0-8 
0-10 


8-0 
10-0 


4 
4 


20 
33,20 


CMOS 


C29MA16 


24 


5 


- 


16 


4-12 


20,15 



Figure 50. Programmable Logic Devices for Asynchronous Designs 
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Other Applications of 
Registered PLDs 

Registered PLDs are used for a number of miscellaneous appli- 
cations which are not covered by the synchronous and asynchro- 
nous design applications discussed up to now. In the subsequent 
discussion we will cover a few of these applications: 

• Frequency dividers 

• Addressable registers 

Frequency Dividers 

Standard synchronous counters provide the basic capability of 
dividing an input frequency. A single register of a PAL device will 
let us divide by two. If we stack these registers, a binary counter 
provides symmetrical division by 2,4,8,16, etc. This divider has 
been a standard for years, and the PAL device has always been 
an excellent choice for such applications. 

One unique application of PAL devices is for dividing input 
frequencies by odd numbers. This has been done historically by 
designing a counter which cycles modulo an odd number, and 
decoding the specific states of the counter. The disadvantage of 
this approach is that the output is not symmetrical and the duty 
cycle is not 50%. 

Let's examine a simple divide-by-five counter. This counter can 
be implemented using three flip-flops that start at zero and reset 
at four, resulting in a five-state counter. The table in Figure 51 
shows the outputs of the three individual flip-flops. 



Present State 


Next State 


Q2 Q1 QO 


Q2 


Q1 


QO 



1 
1 

1 1 

1 






1 






1 
1 





1 State zero to one. 

State one to two. 

1 State two to three. 
State three to four. 
State four to zero. 



Figure 51. Truth Table for a Five-bit Counter 

The Boolean equations are: 

Q2 := /Q2 * Ql * QO ;MSB bit 

Ql := /Ql * QO + Ql * /QO 

QO := /Q2 * /QO ;LSB bit 



The waveforms for this divider are shown in Figure 52. Notice that 
the Q2 output goes HIGH for one state and that this output is one- 
fifth of the input frequency, but it is a 20% duty cycle. Ql is active 
for two states; it provides the same frequency, but with a 40% duty 
cycle. If we want a 50% duty cycle we are going to have to divide 
a state in half. 

To provide the 50% duty cycle, the two edges should be evenly 
spaced in the count sequence, one edge in the middle of state two 
and one at the beginning of state zero. The first edge can be 
formed by logically "ANDing" state_2 with the falling edge of the 
clock. The second edge can be formed by decoding state zero. 



edge 1 = /clock * /Q2 * Ql * /QO 



edge_2 - /Q2 * /Ql * /QO 



;edge between 
; states two and 
; three 

;edge at state 
; zero 



The logical "OR" of these two equations will provide the needed 
rising edges. To provide a clean output, this signal should clock 
another output register. 

The next step in the design is to pick the appropriate PAL device 
to fit this design. Our biggest concern is that we need the 
capability of clocking the counter at one speed and the output flip- 
flop at another. To do this, we cannot use a PAL device that has 
a dedicated clock pin; we need an architecture that allows pro- 
grammable clocks. The PAL20RA10, PAL16RA8 and 
AmPALC29MA16 have this feature. This design uses the 
PAL20RA10 device. 

Since the clock signal requires two product terms (one for each 
edge), and the PAL20RA10 clock signal has only one product 
term, this implementation is not possible. Anothertechnique is to 
use the independent asynchronous SET and asynchronous 
RESETproducttermsofthe PAL20RA10 output register. AHIGH 
on the SET product term asserts the register output, and a HIGH 
on the RESET productterm unasserts the register output. Due to 
the asynchronous nature of the product terms some adjustment 
in timing is required. The SET product term is asserted when in 
state (Q2=0, 01=0 and 00=0), and the RESET product term is 
asserted when between states two and three. 

OUTPUT. SET = /clock * /Q2 * Ql * /QO ; set between 

; states 2 & 3 

OUTPUT. RESET - /Q2 * /Ql * /QO ; reset at 

; state zero 
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Figure 52. Waveform for a Frequency Divider 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



DIVIDE BY FIVE EXAMPLE 

DIVBY5.PDS 

A 

BARRY SEIDNER 

MONOLITHIC MEMORIES INC. 

7/25/1987 



SANTA CLARA, CA 



CHIP DIVIDER PAL20RA10 



PL CLK 13 14 15 16 17 18 
OE Q14 015 Q16 Q17 Q18 Q19 Q20 



EQUATIONS 



19 110 111 GND 
/Q21 /Q22 /Q23 VCC 



;The following equations delineate the three bit counter of our 
;state machine driver. It has a total count of five starting 
;at zero and counting thru four, then resetting to zero. 



Q23:= Q21 * Q22 

Q23.CLKF=CLK 

Q23.TRST=VCC 



/Q23 



MSB OF STATE MACHINE 

CLOCK ON RISING EDGE OF CLK 

OUTPUT ALWAYS ON 



Q22:=/Q22 * Q21 

Q22.CLKF=CLK 

Q22.TRST=VCC 



+ Q22 * /Q21 



BIT 1 OF STATE MACHINE 
CLOCK ON RISING EDGE OF CLK 
OUTPUT ALWAYS ON 



Q21:=/Q21 * /Q23 

Q21.CLKF=CLK 

Q21.TRST=VCC 



;LSB OF STATE MACHINE 

;CLOCK ON RISING EDGE OF CLK 

/OUTPUT ALWAYS ON 



;The last cell of our divide by five counter is the actual 
;output of divider. It divides by setting and resetting at the 
.•appropriate time in the count sequence. The output will stay high 
;for count zero, one, and the first half of two-then go low for 
;the second half of two, thru three and four. 



Q20 


SETF = /Q23 


* Q22 * /Q21 


Q20 


RSTF = /Q23 


* /Q22 * /Q21 


020 


CLKF = GND 




Q20 


:= GND 




Q20 


TRST = VCC 





/CLK 



SET ON COUT THREE AND 
RESET ON COUNT ZERO 
CLOCK INPUT NOT USED 
DATA INPUT NOT USED 
OUTPUT ALWAYS ON 



1/2 



SIMULATION 



;This simulation will run the divider through 
.-several passes of the count to verify operation. 



TRACE ON 



OE PL CLK Q23 Q22 Q21 Q20 



.•Initialization of all inputs. Before the simulator can be 
.•executed, all of the inputs to the PAL device should be 
;set to a specific value. This section of simulator 
/declaration sets three input variables inactive. 



SETF 



PL /OE /CLK 



;SET PRELOAD FALSE 
;SET OUTPUT ENABLE 
;SET CLOCK LOW 



.•The PRLDF statement will set the registers to a known 
.•state at the beginning of our simulation. This line sets 
;us to state zero. 

PRLDF /Q23 /Q22 /Q21 Q20 

;The FOR loop statement exemplifies an easy way to perform 
/repetitive functions. In this example, the "RA" cell does 
;not have a dedicated clock input so this routine will 
.•generate 30 repetitive clocks to drive our design. 



FOR I:=l TO 30 DO 

BEGIN 

SETF CLK 
SETF /CLK 

END 



;SET CLOCK HIGH 
;SET CLOCK LOW 



;End of simulation 
TRACE OFF 



Figure 53. Design File for Frequency Divider 



^ 



Monoliihio 



m 



Memories 



^ 



2-99 



Registered Logic Design 



Addressable Registers 

Addressable registers are commonly-usGd MSI functions, often 
implemented in PAL devices. Addressable registers are used as 
building blocks for digital computers. Depending upon the ad- 



TITLE 


16-BIT ADDRESSABLE REGISTER 


PATTERN 


ADREG16.PDS 


REVISION 


A 


AUTHOR 


John Birlcn«r 


COMPANY 


Monolithic Meiiiori«« Inc. Santa Clara 


DATE 


2/11/85 



dress input one of the many flip-flops in a register is loaded with 
the input data. All other flip-flops in the register retain their 
previous values. An example of a 16-bit addressable register is 
shown in Figure 54. 



; Th« 16-blt addressabl* r«giater loads one of 16 registers 
selected by ADDR[0..3] with data input, DATA. 

CHIP ADREG16 PAL32R16 

QO Ql (52 Q3 /El NC NC AO Al VCC A2 A3 DATA NC /PRLD2 CLK2 

Q4 Q5 Q6 Q7 Q8 Q9 QIO Qll /E2 NC NC NC NC GND NC NC NC NC /PRLDl CLKl 

Q12 Q13 Q14 Q15 



Q14 :- AO *Q1« 
+ /Al 'QU 
* /A2 *Q1« 


;hold 
;hold 
!hold 


+ /A3*Q14 
+ /AO* Al« A2« A3«DATA 


;hold 
;load 


Q15 :- /AO 'QIS 
■>■ /Al •Q15 
» /A2 'QIS 
+ /A3»Q15 
+ AO* Al* A2* A3*DATA 


;hold 
;hold 
;hold 
;hold 
; load 


SIMULATION 





SETF El E2 /DATA /PRLDl /PRLD2 



:. 


AO *ao 


;hold 


+ 


Al 'QO 


;hold 


+ 


A2 '00 


;hold 


+ 


A3*(20 


;hold 


* 


/A0*/A1*/A2*/A3* DATA 


;load 


:. 


/AO 'Ql 


;hold 


+ 


Al -Ql 


;hold 


+ 


A2 *Q1 


;hold 


■♦■ 


A3*Q1 


;hold 


* 


A0*/A1*/A2*/A3*DATA 


;load 


:. 


AO *Q2 


;hold 


■♦■ 


/Al *Q2 


;hold 


■♦■ 


A2 *Q2 


.-hold 


+ 


A3*Q2 


;hold 


" 


/AO* A1*/A2*/A3*DATA 


;load 


:. 


/AO *Q3 


;hold 


+ 


/Al •03 


;hold 


■f 


A2 •03 


.-hold 


-f 


A3«Q3 


;hold 


+ 


AO* A1*/A2*/A3«DATA 


.-load 


:. 


AO *Q4 


;hold 


-f 


Al *Q4 


;hold 


-*■ 


/A2 *Q4 


!hold 


•f 


A3*Q4 


;hold 


♦ 


/A0*/A1* A2*/A3*DATA 


;load 


:. 


/AO *05 


;hold 


•f 


Al •OS 


.-hold 


+ 


/A2 *Q5 


;hold 


+ 


A3*Q5 


;hold 


+ 


A0«/A1* A2*/A3*DATA 


;load 


.. 


AO *Q6 


;hold 


+ 


/Al *06 


;hold 


+ 


/A2 •QS 


;hold 


+ 


A3*Q6 


;hold 


+ 


/AO* Al* A2*/A3^DATA 


;load 


._ 


/AO •Q? 


;hold 


+ 


/Al *Q7 


.-hold 


+ 


/A2 *Q7 


;hold 


+ 


A3*Q7 


;hold 


+ 


AO* Al* A2*/A3*DATA 


;load 


._ 


AO 'OS 


;hold 


+ 


Al *Q3 


Ihold 


+ 


A2 •Q8 


;hold 


■f 


/A3*Q8 


;hold 


+ 


/A0*/A1*/A2* A3*DATA 


;load 


:. 


/AO 'OS 


;hold 


■f 


Al *Q9 


;hold 


+ 


A2 *a9 


;hold 


+ 


/A3*Q9 


;hold 


■f 


A0*/A1*/A2* A3*DATA 


;load 


:. 


' AO *Q10 


;hold 


+ 


/Al *Q10 


:hold 


+ 


A2 *Q10 


;hold 


+ 


/A3*Q10 


;hold 


+ 


/AO* Al*/A2* A3*DATA 


;load 


:■ 


■ /AO *Q11 


;hold 


+ 


/Al 'Qll 


;hold 


+ 


A2 *Q11 


ihold 


+ 


/A3*Q11 


;hold 


+ 


AO* Al*/A2* A3*DATA 


.•load 


:■ 


■ AO *Q12 


;hold 


+ 


Al •012 


;hold 


■♦■ 


/A2 •012 


Ihold 


+ 


/A3^Q12 


;hold 


+ 


/A0*/A1* A2* A3*DATA 


;load 


:■ 


■ /AO *Q13 


.-hold 


+ 


Al *Q13 


;hold 


+ 


/A2 *Q13 


;hold 


+ 


/A3*013 


;hold 


+ 


A0*/A1* A2* A3*DATA 


;load 

Figure 54. 



010 
Qll 
Q12 
Q13 
Q14 
Q15 



Al A2 A3 



AO Al A2 A3 



SETF DATA 



SETF /AO /Al /A2 /A3 
CLOCKF CLXl CLX2 



g g cgcgcg 
XXXXLLLLLL 
XXXXXXLLLL 
XXXXXXXXLL 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXLLLKHLLH 
XXLLLLLHHH 
XXIXLLLLLL 

yV TTTTTTTT. 

L T.TiT.T.r.r.T.riT.T, 



cgcgcgcgcg 

LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 



LLLLLLLLLL LLLLLLLLLL 
XXLLLLLLLL LLLLLLLLLL 



XXXXLLLLLL 
XXXXXXLLLL 
XXXXXXXXLL 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
HLLHHLLHHL 
HLIXLHHHHL 
LHHHHHHHHL 

LLLLLLLLLL 



LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
XXLLLLLLLL 
XXXXLLLLLL 
XXXXXXLLLL 
XXXXXXXXLL 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
LHHLLHHLLH 
LLLHHHHLLL 
LLLLLLLHHH 
HHHHHHHHHH 
Tl.TlTrTrTiT.T.T, 



cgcgcgc 

LLLLLLH 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
XXLLLLL 
XXXXLLL 
HLLHHLL 
LHHHHLL 
HHHHHLL 
HHHHHLL 
LLLLLHH 
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Introduction 



State machine designs are widely used for sequential control 
logic, which forms the core of many digital systems. State ma- 
chines are required in a variety of applications covering a broad 
range of performance and complexity; low-level control of micro- 
processor-to-VLSI-peripheral interfaces, bus arbitration and 
timing generation in conventional microprocessors, custom bit- 
slice microprocessors, data encryption and decryption and trans- 
mission protocols are but a few examples. 

Typically, the details of control logic are the last to be settled in the 
design cycle, since they are continuously affected by changing 
system requirements and feature enhancements. Program- 
mable logic is a forgiving solution for control logic design because 
it allows easy modifications to be made without disturbing PC 
board layout, its flexibility provides an escape valve that permits 
design changes without impacting time-to-market. 

A majority of registered PAL device applications are sequential 
control designs where state machine design techniques are 
employed. As technology advances, new high-speed and high- 
functionality devices are being introduced which simplify the task 
of state machine design. A broad spectrum of different function- 
ality-and-performance solutions is available for state machine 
design. In this discussion we will examine the functions per- 
formed by state machines, their implementation on various de- 
vices, and their selection. Finally, we will implement a state 



machine design and go through all of the stages involved in a 
design tutorial. 

What Is a State Machine? 

A state machine is a digital device which traverses through a 
predetermined sequence of states in an orderly fashion. A state 
is a set of values measured at different parts of the circuit. A 
simple state machine can consist of PAL-device-based combina- 
torial logic, output registers, and buried (state) registers. The 
state in such a sequencer is determined by the values stored in 
the buried and/or output registers. 

A general form of a state machine can be depicted as a device 
shown in Figure 1 . In addition to the device inputs and outputs, 
a state machine consists of two essential elements: combinatorial 
logic and memory (registers). This is similar to the registered 
counter designs discussed on page 2-66, which are essentially 
simple state machines. The memory is used to store the state of 
the machine. The combinatorial logic can be viewed as two 
distinct functional blocks: the next state decoder and the output 
decoder (Figure 2). The next state decoder determines the next 
state of the state machine while the output decoder generates the 
actual outputs. Although they perform two distinct functions, 
these are usually combined into one combinatorial logic array as 
in Figure 1. 
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Figure 1. Block Diagram of a Simple State Machine 



COMBINATORIAL 
LOGIC 



COMBINATORIAL 
LOGIC 



NEXT 

-*| STATE 

DECODER 

(TRANSITION 

FUNCTION) 



MEMORY 
(REGISTERS) 



OUTPUT 
DECODER 
(OUTPUT 
DECODE 
FUNCTION) 



Figure 2. State Machine, with Separate Output & Next State Decoders 
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The basic operation of a state machine is twofold: 

1 . Ittraverses through a sequence of states, where the next state 
is determined by next state decoder, depending upon the 
present state and input conditions. 

2. It provides sequences of output signals based upon state 
transitions. The outputs are generated by the output decoder 
based upon present state and input conditions. 

Using input signals for deciding the next state is also known as 
branching. In addition to branching, complex sequencers provide 
the capability of repeating sequences (looping) and subroutines. 
The transitions from one state to another are called control 
sequencing and the logic required for deciding the next states is 
called the transition function (Figure 2). 

The use of input signals in the decision-making process for output 
g0/7eraf/b/7 determines the type of a state machine. There are two 
widely-known types of state machines: Mealy and Moore 
(Figure 3). Moore state machine outputs are a function of the 
present state only. In the more general Mealy-type state ma- 
chine, the outputs are functions of both the state and the input 
signals. The logic required is known as the output function. For 
either type, the control sequencing depends upon both states and 
input signals. 

Most practical state machines are synchronous sequential cir- 
cuits which rely on clock signals to triggerthe state transitions. A 
single clock is connected to all of the state and output edge- 
triggered flip-flops, which allows a state change to occur on the 
rising edge of the clock. Asynchronous state machines are also 
possible, which utilize the propagation delay in combinatorial 
logic for the memory function of the state machine. Such 
machines are highly susceptible to hazards, hard to design and 
are seldom used. In our discussion we will focus solely on 
sequential state machines. 



State Machine Applications 

State machines are used in a number of system control applica- 
tions. A sampling of a few of the applications, and how state 
machines are applied, is described below. 

As sequencers for digital signal processing (DSP) applications, 
state machines offerspeed and sufficient functionality withoutthe 
overkill of complex microprocessors. For simple algorithms, such 
as those involved in performing a Fast Fourier Transform (FFT), 
a state machine can control the set of vectors that are multiplied 
and added in the process. For complex DSP operations, a pro- 
grammable DSP may be better. On the other hand, the program- 
mable DSP solution is not likely to be as fast as the dedicated 
hardware approach. 

Consider the case of a video controller. It generates addresses 
for scanning purposes, using counters with various sequences 
and lengths. But instead of implementing these as actual count- 
ers, the sequences involved can be "unlocked"and implemented, 
instead, as state machine transitions. And there is an advantage 
beyond mere economy of parts. A count can be set or initiated, 
then left to take care of itself, freeing the microprocessor for other 
operations. 

In peripheral control the simple state machine approach can be 
very efficient. Considerthecase of run-length-limited (RLL)code. 
Both encoding and decoding can be translated into state ma- 
chines, which examine the serial data stream as it is read, and 
generate the output data. 

Industrial control and robotics offer further areas where simple 
control functions are required. Such tasks as mechanical posi- 
tioning of a robot arm, simple decision making, and calculation of 
a trigonometric function, usually do not require the high-power 
solution of microprocessors with stacks and pointers. Rather, 
what is required is a device that is capable of storing a limited 
number of states and allows simple branching upon conditions. 
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Figure 3. The Two Standard State Machine Models 
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Data encryption and decryption present similar problems to those 
encountered in encoding and decoding for mass media, only here 
it is desirable to make the scheme not so obvious. A program- 
mable state machine device with a security fuse is ideal for this 
because memory is internally programmed and cannot be ac- 
cessed by someone tampering with the system. 



Control sequencing is represented in the state transition diagram 
as shown in Figure 5. Direct control sequencing requires an 
unconditional transition from state A to state B. Similarly condi- 
tional control sequencing shows a conditional transition from 
state C to either state D or state E, depending upon input 
signal 11. 



Functions Performed 

Allof the system design functions performed by controllers can be 
categorized as one of the following state machine functions: 

• Arbitration 

• Event monitoring 

• Multiple condition testing 

• Timing delays 

• Control signal generation 

Later we will take a design example and illustrate how these 
functions can be used when designing a state machine. 

State Machine Theory 

Let us take a brief look at the underlying theory for all sequential 
logic systems, the finite state machine (FSM), or simply state 
machine. 

Those parts of digital systems whose outputs depend on their 
past inputs as well as their current ones can be modeled as finite 
state machines. The "history" of the machine is summed up in the 
value of its internal state. When a new input is presented to the 
FSM, an output is generated which depends on this input and the 
present state of the FSM, and the machine is caused to move into 
a new state, referred to as the next state. This new state also 
depends on both the input and present state. The structure of an 
FSM is shown pictorially in Figure 2. The internal state is stored 
in a block labelled "memory". As discussed-earlier, two combina- 
torial functions are required: the transition function, which gener- 
ates the value of the next state, and the output function, which 
generates the state machine output. 

State Diagram Representation 

The behavior of an FSM may be specified in graphical form as 
shown in Figure 4. This is called a state diagram, or state 
transition diagram. Each bubble represents a state, and each 
arrow represents atransition between states. Inputs which cause 
the transitions are shown next to each transition arrow. 
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Figure 5. Control Sequencing 

For Moore machines the output generation is represented by 
assigning outputs with states (bubbles) as shown in Figure 6. 
Similarly, for Mealy machines conditional output generation is 
represented by assigning outputs to transitions (arrows), as was 
shown in Figure 4. More detail on Mealy and Moore output 
generation is given later. 





a Moore Machine 



b. Mealy Machine 



Figure 6. Output Generation 

For this notation, there is a specification uncertainty as to which 
signals are outputs or inputs, as they both occur on the drawing 
next to the arrow in whicii they are active. This is usually resolved 
by separating the input and output signals names with a line 
(Figures 4 & 6). Sometimes an auxiliary pin list detailing the logic 
polarity and input or output designations is also used. 

State transition diagrams can be made more compact by writing 
on the transitions not the input values which cause the transition, 
as in Figure 4, but a Boolean expression defining the input 
combination or combinations whicli cause this transition. For 
example, in Figure 7, some transitions have been shown for a 
machine with inputs "START", "Xl" and "X2". In the transition 
between states 1 and 2, the inputs XI and X2 are ignored (that is, 
they are "don't cares") and thus do not appear on the diagram. 
This saves space and makes the function more obvious. 




Figure 4. State lUIachine Representation 



Figure 7. State Transition Diagram with Mnemonics 
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There can be a problem with this method if one is careless. The 
state transitions in Figure 8 show what can happen. There are 
three input combinations, {10, 11, 12, 13} = {1011}, {1101} and 
{1111}, which make both (/I0VI2 + 13) and (10*11 + 10*12) true. 
Since a transition to two next states is impossible, this is an error 
in the specification. It must either be guaranteed that these input 
combinations never occur, or the transition conditions must be 
modified. In this example, changing (10*11 + 10*12) to (10*11 + 
i0*l2)*/l3 would solve the problem. 



ALL OTHER 
COMBINATIONS 




Figure 8. State Diagrarii with Conflicting 
Branch Conditions 

State Transition Table Representation 

A second method for state machine representation is the tabular 
form known as the state transition table, which has the format 
shown in Figure 9. Along the top are listed all of the possible input 
bit combinations and internal states. Each row gives the next 
state and the next output; the table thus specifies the transition 
and output functions. This type of table, however, is not suitable 
for specifying practical machines in which there is a large number 
of inputs, since each input combination defines a row of the table. 
With 10 inputs for example, there would have to be 1024 rowsl 
A modified version of this table is often used directly for program- 
mable logic sequencer (PLS) device design. 



PRESENT 
STATE 


INPUTS 


NEXT STATE 


OUTPUTS 
GENERATED 


So-s„ 


'o-l. 


So-s„ 


Oo-Op 



Figure 9. A State Transition Table 



Flowcharts 



The use of these symbols is shown in Figure 10. Each path 
through the decision boxes from one state to another defines a 
particular combination or set of combinations of the input vari- 
ables. A path does not have to include all input variables; thus it 
accommodates "don't cares". These decision trees take more 
space than the expressions would, but in many practical cases, 
state machine controllers only test a small subset of the input 
variables in each state and the trees are quite manageable. Also, 
the chain of decisions often mirrors the designer's way of thinking 
about the actions of the controller, it is important to note thatthese 
tests are not performed sequentially in the FSM; all are performed 
in parallel by the FSM's state transition logic. 

A benefit of this method of specifying transitions is that the 
problem of Figure 8 can be avoided. Such a conflict would be 
impossible as one path cannot diverge to define paths to two 
states. 

When there is no danger of conflicts due to multiple next states 
being defined, this flowchart notation can be compacted by 
allowing more complex decisions. Expressions can be tested, as 
shown in Figure 11a, or multiple branches can extend from a 
decoding box, as in Figure 11b. in the second case it is 
convenient to group the set of binary inputs into a vector, and 
branch on different values of this vector. 

The three methods of state machine representation — state dia- 
grams, state tables, and flowcharts — are all equivalent and 
interchangeable, since they all describe the same hardware 
structure. Each style has its own particular advantages. Although 
most popular, the state transition diagrams are more complex for 
problems where state transitions depend on many inputs, since 
the transition conditions are written directly on the transition 
arrows. Although cumbersome, the state tables allow the de- 
signer tight control over signal logic. Flowcharts are convenient 
for small problems where there are not more than about ten states 
and where up to two or three inputs or input expressions are 
tested in each state. For larger problems, they can become 
ungainly. 

Once a state machine is defined, it must be implemented on a 
device. Software packages are then used to implement the 
design on a device. The task is to convert the state machine 
description into transition and output functions. Software pack- 
ages also account for device-specific architectural variations and 
limitations, to provide a uniform user interface. 



Another popular notation is based on flowcharts, in this notation, 
states are represented by rectangular boxes, and alternative 
state transitions are determined by strings of diamond-shaped 
boxes. The elements may have multiple entry points, but in 
general have only one exit. The state name is written as the first 
entry in the rectangular state box. Any Moore outputs present are 
written next in the state box, with a caret {""") following those that 
are unregistered. The state code assignment, if it is known, is 
written next to the upper right corner of the state box. Decision 
boxes are diamond or hexagonal shaped boxes containing either 
an input signal or a logic expression. Two exits labelled "0" and 
"1" lead to either another decision box, a state box, or a Mealy 
output. The rounded oval is used for Mealy machine outputs. 
Once again, a caret ("'^") follows those outputs that are unregis- 
tered. All of the boxes may need to be expanded to accommodate 
a number of output signals or a larger expression. 



NN STATE CODE 



( (A«. B». C*. ■■■) ) 




( (AB,C, ...) ) 



Figure 10. Flowchart Notation 
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Some software packages accept all of the three different state 
machine representations directly as design inputs. However, the 
most prevalent design methodology is to convert the three state 
machine design representations to a simple textual representa- 
tion. Textual representations are accepted by most software 
packages although the syntax varies. The PALASM 2 software 
package offers one such simple and easy-to-use state machine 
textual representation. The task of converting from a state 
transition diagram and flowchart representation to PALASM 2 
software state machine syntax is demonstrated in a design 
tutorial on page 2-122. 

Since the most common of ail state machine representations is 
the state transition diagram representation, we will use it in all 
subsequent discussions. Transition table and flowchart repre- 
sentation implementations will be very similar. 




a. Testing Expressions 



State Machine Types: Mealy & Moore 

With the state machine representation clarified, we can now 
return to the generic sequencer model of Figure 1, which has 
been labelled (Figure 12) to show the present state (PS), next 
state (NS) and output (OB, OA). This will illustrate how Mealy and 
Moore machines are implemented with most sequencer devices 
which provide a single combinatorial logic array for both next state 
and output decodef unctions. There are four ways of using the se- 
quencer, two of which implement Moore machines and two Mealy. 
First, let us look at the fy^lealy forms. 

The standard Mealy form is shown in Figure 1 3, where the signals 
are labelled as in Figure 1 2 to indicate which registers and outputs 
are used. The register outputs PS are fed back into the array and 
define the present state. The combinatorial logic implements the 
transition function, which produces the next state flip-flop inputs 
NS, and the output function, which produces the machine output 
OB. This is the asynchronous Mealy form. 




b. MulSway Branch 



407 11b 

Figure 11. Using Flowcharts 
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Figure 12. Generic Mode! of an FSIVI 



TRANSITION 
FUNCTION 



NEXT 

STATE 

NS 



REGISTER 

(STATE 
MEMORY) 



OUTPUT 
FUNCTION 



■+-OB 

OUTPUTS 



PRESENT STATE 
PS 



CLOCK 

Figure 13. Asynchronous Mealy Form 
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An alternative Mealy form is shown in Figure 1 4. Here tlie outputs 
are passed through an extra output register (OA) and thus do not 
respond immediately to input changes. This is the synchronous 
Mealy form. 

The standard Moore form is given in Figure 1 5. Here the outputs 
OB depend only on the present state PS. This is the asynchro- 
nous Moore form. The synchronous Moore form is shown in 
Figure 1 6. In this case the combinatorial logic can be assumed 
to be the unity function. The outputs (OB) can be generated 
directly along with the present state (PS). Although these forms 
have been described separately, a single sequencer is able to 
realize a machine which combines them, provided that the 
required paths exist in the device. 



In the synchronous Moore form, the outputs occur in the state in 
which they are named in the state transition diagram. Similarly in 
the asynchronous Mealy and Moore forms the outputs occur in 
the state in which they are named, although delayed a little by the 
propagation delay of the output decoder. This is becausethey are 
combinatorial functions of the state (and inputs in the Mealy 
case). 

However, the synchronous Mealy machine is different. Here an 
output does not appear in the state in which it is named, since it 
goes into another register first. It appears when the machine is 
in the next state, and is thus delayed by one clock cycle. The state 
diagram in Figure 17 illustrates ail of the possibilities on a state 
transition diagram. 
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Figure 1 4. Synchronous Mealy Form 
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Figure 15. Asynchronous Moore Form 
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Figure 17. State Diagram Labelling for 
Different Output Types 

As a matter of notation, Moore outputs are often placed within the 
state bubble and Mealy outputs are placed next to the path or 
arrow which activates them. 

The relationship of Mealy and Moore, synchronous and asyn- 
chronous outputs to the states is shown in Figure 18. 



Logic-based devices include the PAL and PLS devices. These 
devices use the sum-of-products logic array to implement the 
transition and output functions. The memory-based devices like 
PROSE implement the transition and output functions using a 
PROM or RAM array. Conceptually this is no different from the 
logic-based devices, since the memory can be viewed as special 
logic. The instruction-based sequencers (Am29PL141) offer 
hardwired instructions and fixed logic blocks to provide the 
transition function logic with enhanced capabilities like subrou- 
tines. Functionally all three types of devices work similarly per- 
forming two basic functions: control sequencing and output 
generation. 

There are three major criteria for selecting the correct state 
machine device for a design: 

• Number of inputs/outputs 

- I/O flexibility 

- Number of output registers 

• Speed 

• Intelligence/functionality 

- Number of product terms 

- Type of flip-flops 

- Number of state registers 

- Number of PROM locations (memory-based sequencers) 

Number of l/Os 



Device Selection Considerations 

Architecturally, the state machine devices can be divided into 
three categories: 

• Logic-based devices 

• Memory-based devices 

• Instruction-based devices 



REGISTERED MOORE 
OUTPUT 



REGISTERED MEALY 
OUTPUT 



ASYNCHRONOUS 
MOORE 
OUTPUT 



ASYNCHRONOUS 

MEALY 

OUTPUT 



The numberof inputs, outputs and I/O pinsdeterminesthe signals 
which can be sampled or generated by a state machine. Figure 
1 9 lists the devices offered for state machine designs, and shows 
the number of inputs, outputs and I/O pins available on each 
device. 

Timing and Speed 

The timing considerations for sequencer design are similar to 
those for registered logic design (page 2-64). A system clock 




Figure 18. State Machine Timing Diagram 
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DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


REGISTERED 


PINS 


PRODUCT 


FREQUENCY IN 




OF PINS 


PINS 


OUTPUT 
PINS 




TERMS/ 
OUTPUT 


MHz 
'max 


TTL 


16R8 


20 


8 


8 


. 


8 


55.5, 37, 25, 16 


16R6 


20 


8 


6 


2 


8 


55.5,37,25,16 


16R4 


20 


8 


4 


4 


8 


55.5,37,25,16 


16RP8 


20 


8 


8 


- 


8 


22.2 


16RP6 


20 


8 


6 


2 


8 


22.2 


16RP4 


20 


8 


4 


4 


8 


22.2 


16RA8 


20 


8 


0-8 


8-0 


4 


20 


16X4 


20 


8 


4 


4 


8 


14 


23S8 


20 


9 


4 


4 


8-12 


33.3, 28.5 


20R8 


24 


12 


8 


. 


8 


37,25,16 


20R6 


24 


12 


6 


2 


8 


37.25,16 


20R4 


24 


12 


4 


4 


8 


37.25,16 


20X10 


24 


10 


10 


- 


4 


22.2 


20X8 


24 


10 


8 


2 


4 


22.2 


20X4 


24 


10 


4 


6 


4 


22.2 


20RP10 


24 


10 


10 


. 


8 


37,25 


20RP8 


24 


10 


8 


2 


8 


37.25 


20RP6 


24 


10 


6 


4 


8 


37,25 


20RP4 


24 


10 


4 


6 


8 


37,25 


20XRP10 


24 


10 


10' 


- 


8 


30,22.2,14 


20XRP8 


24 


10 


8 


2 


8 


30,22.2,14 


20XRP6 


24 


10 


6 


4 


8 


30,22.2,14 


20XRP4 


24 


10 


4 


6 


8 


30, 22.2, 14 


20RS10 


24 


10 


10 


. 


8-16 


19.2 


20RS8 


24 


10 


8 


2 


8-16 


19.2 


20RS4 


24 


10 


4 


6 


8-16 


19.2 


20RA10 


24 


10 


0-10 


10-0 


4 


33,20 


22RX8 


24 


14 


0-8 


8-0 


8 


28.5 


22V10 


24 


12 


0-10 


10-0 


8-16 


40.28.5,18 


32VX10 


24 


12 


0-10 


10-0 


8-16 


25, 22.2 


32R16 


40 


16 


16 




8-16 


16 


PLS167 


24 


14 


6 




Total 48 


33 


PLS168 


24 


12 


8 


- 


Total 48 


33 


PLS105 


28 


16 


8 


- 


Total 48 


37 


PMS14R21 


24 


8 


8 


- 


128 states 


25 


29PL141 


28 


8 


16 


- 


64 states 


20 


CMOS 


G16R8 


20 


8 


8 


. 


8 


28.5 


C16R6 


20 


8 


6 


2 


8 


28.5 


C16R4 


20 


8 


4 


4 


8 


28.5 


C20R8 


24 


12 


8 


- 


8 


20, 15.3 


C20R6 


24 


12 


6 


2 


8 


20, 15.3 


C20R4 


24 


12 


4 


4 


8 


20, 15.3 



Figure 19. Devices for State Machine Design 
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DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 
FREQUENCY IN 

MHz 

'max 


C22V10 


24 


12 


0-10 


10-0 


8-16 


33.3, 20 


C29M16 
C29MA16 


24 
24 


5 
5 


- 


16 
16 


8-16 
4-12 


20,15 
20.15 


ECL 


10H20EV/EG8 
10020EV/EG8 


24 
24 


12 
12 


0-8 
0-8 


8-0 
8-0 


8-12 
8-12 


125 
125 



Figure 19. Devices for State Machine Design (Cont'd.) 



cycle forms the basic l<ernel for evaluating control function 
behavior. For the most part, all input and output functions are 
specified in relationship to the positive edge. Registered outputs 
are available after a period of time X^^^^, the clock-to-output 
propagation delay. Asynchronous outputs require an additional 
propagation delay (tpp) before they are valid. 

For the circuit to operate reliably, all of the flip-flop inputs must be 
stable at the flip-flop no later than the minimum set-up time (t J of 
the flip-flops before the next active clock edge. If one of the inputs 
changes after this threshold, then the next state or synchronous 
output could be stored incorrectly; the circuit may even malfunc- 
tion. To avoid this, the clock period (t ) must be greater than the 
sum of the set-up time of the flip-flops and the clock to output time 
(L + *clk)- "'"'^'^ determines the minimum clock period and hence 
the maximum clock frequency, f„^^, of the circuit. Metastability 
and erroneous system operation may occur if these specifica- 
tions are violated. 

The timing relationships are shown in Figure 20. In each cycle 
there are two regions: the stable region, when all signals are 
steady, and the transition region, when the machine is changing 
state and signals are unstable. The active clock edge causes the 
flip-flops to load the value of the new state which has been set up 



at their inputs. At atime afterthis, the present state and output flip- 
flop outputs will start to change to their new values. After a time 
has elapsed, the slowest flip-flop output will be stable at its new 
value. Ignoring input changes for the moment, the changes in the 
state register cause the combinatorial logic to start generating 
new valuesforthe asynchronous outputs and the inputs to theflip- 
flops. If the propagation delay of the logic is tp^, then the stable 
period will start at atime equal to the sum of the maximum values 
o^Wandtp,. 

Figure 19 also shows the maximum operating frequency of the 
devices. ECL-based PAL sequencers can implement simple 
state machines from 60 MHz to above 100 MHz. Conventional 
TTL PAL sequencers can now reach speeds of 55 MHz. The new 
PLS devices operate at 37 MHz. Finally, PROSE sequencers 
provide 25 MHz operation, and the Am29PL141 can run at 20 
MHz. The design engineer usually selects the simplest device 
that provides the desired level of performance. 

Asynchronous Inputs 

The timing of the inputs to a synchronous state machine are often 
beyond the control of the designer and may be random, such as 
sensor or keyboard inputs, or they may come from another 



CLOCK 


X^ 














- 






4 tp_ 


» 


* — 


•su »j'cLK 


^_ 




INPUT 




X^ 


( 




n 


f 

V 


X)X 














REGISTERED 
OUTPUT 


J( 


x_ 




) 


/ 


X 






♦-tpD->| 






ASYNCHRONOUS 
OUTPUT 




X. 




JL 


X. 




_x 


STATE 




n 


> 


L 


n + 1 


) 


{ n + 2 


X 


n + 3 



Figure 20. Timing Diagram for Maximum Operating Frequency 
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Figure 21. Asynchronous Input Causing Race 



synchronous system that has an unrelated clock. In either case 
no assumptions can be made about the times when inputs can or 
cannot arrive. This fact causes reliability problems that cannot be 
completely eliminated, but only reduced to acceptable levels. 

Figure 21 shows two possible transitions from state "S1" (code 
00) either back to itself, orto state "S2" (code 1 1 ). Which transition 
is taken depends on input variable "A" which is asynchronous to 
the clock. The transition function logic for both state bits B1 and 
B2 includes this input. The input A can appear in any part of the 
clock cycle. For the flip-flops to function correctly, the logic for B1 
and 82 must stabilize correctly before the clock. The input should 
be stable in a window t^ (setup time) before the clock and t^ (hold 
time) after the clock. If the input changes within this window, both 
the flip-flops may not switch, causing the sequence to jump to 
states 01 or 1 0, which are both undefined transitions. This type 
of erroneous behavior is called an input race. 

A solution to this problem is to change the state assignment so 
that only one state variable depends on the asynchronous input. 
Thus the 11 code must be changed to 01 or 10. Now, with only 
one unsynchronized flip-flop input, eitherthe input occurs in time 
to cause the transition, or it does not, in which case no transition 
occurs. In the case of a late input, the machine will respond to it 
one cycle later, provided that the input is of sufficient duration. 

There is still the possibility of an input change violating the setup 
time of the internal flip-flop, driving it into a metastable state. This 
can produce system failures which can be minimized, but never 
eliminated (see Metastability, page 3-164). The same problem 
arises when outputs depend on an asynchronous input. 

Very little can be done to handle asynchronous inputs without 
severely constraining the design of the state machine. The only 
way to have complete freedom in the use of inputs is to convert 
them into synchronous inputs. This can be done by allocating a 
flip-flop to each input as shown in Figure 22. These synchronizing 
flip-flops are clocked by the sequencer clock, and may even be 
the sequencer's own internal flip-flops. This method is not 
foolproof, but significantly reduces the chance of metastability 
occurring. 



Functionality 

The functionality of different devices is difficult to compare since 
different device architectures are available. The number of 
registers in a device determines the number of stale combina- 
tions possible. However, all of the possible state combinations 
are not necessarily usable, since other device constraints may be 
reached. The number of registers do give an idea of the 
functionality achievable in a device. Otherfunctionality measures 
include the number of product terms, type of flip-flop, or the 
number of PROM locations in a memory-based sequencer. One 
device may be stronger than another in one of these measures, 
but overall may be less useful due to other shortcomings. Choos- 
ing the best device involves both skill and experience. 

A designer has a complete spectrum of devices with different 
architectures to choose from for a state machine design. These 
range from the PAL1 6R8D family of very high speed devices to 
the new PROSE PMS14R21 and instruction-based Am29PL1 41 
high-functionality devices. The spectrum is completed by mid- 
range PAL devices including the PAL23S8, PAL22RX8A and 
PAL32VX10/A, with architectural features specifically designed 
for state machine designs, and the PLS devices designed exclu- 
sively for state machine implementation. 

In order to give an idea of device functionality, we will consider 
each of the architecture options available to the designer and 
evaluate its functionality. 
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Figure 22. Input Synchronizing Register 
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PAL Devices as Sequencers 

A vast majority of state machine designs are implemented with 
PAL devices. Early versions of software required the user to 
manually write the sum-of-products Boolean equations for using 
PAL devices. Second generation software allows one to specify 
the design in "state machine syntax," and handles the translation 
to sum-of-products logic automatically. PAL devices implement 
the output and transition functions in sum-of-products form via a 
user-programmable AND array and a fixed OR array. 

PAL devices deliver the fastest speed of any sequencer and are 
ideally suited for simple control applications characterized by few 
input and output signals interacting within a dedicated controller 
in a sequential manner. The number of flip-flops in a PAL device 
range from 8 to 12, which offer potentially more than one thou- 
sand state values. Since some of the flip-flops are used for 
outputs, and the number of product terms is limited, the usable 
numberof states is reduced drastically. Generally, up to about 35 
states can be utilized. 

PAL Device Flip-flops 

PAL device based sequencers implement small state machine 
designs, which have a relatively large number of output transi- 
tions. Since the output registers change with most state transi- 
tions, they can be used simultaneously as state registers, once 
the state values are carefully selected. Most PAL devices are 
used for small state machines, and efficiently share the same 
register for output and state functions. High-functionality PAL 
device based sequencers provide dedicated buried state regis- 
ters when sharing is difficult. 



As a state machine traverses from one state to another, every 
output either makes a transition (changes logic level) or holds 
(stays at the same logic level). Small state machine designs 
require relatively more transitions and fewer holds. As designs 
get larger, state machines statistically require relatively fewer 
transitions and more holds. 

Most PAL devices provide D-type output registers. D-type flip- 
flops use up product terms only for active transitions from logic 
LOW to HIGH level, and for holds for logic HIGH level only. J-K, 
S-R, and T-type flip-flops use up product terms for both LOW-to- 
HIGH and HIGH-to-LOW transitions, but eliminate hold terms. 
Thus D-type flip-flops are more efficient for small state machine 
designs. Some high-end PAL devices offer the capability of 
configuring the flip-flops as J-K, S-R orT-types, which are more 
efficient for large state machine designs since they require no 
hold terms. 

Many examples of PAL-device-based sequencers can be found 
in system time base functions, special counters, interrupt control- 
lers, and certain types of video display hardware. 

PALdevices are produced in a variety of technologies for multiple 
applications, and provide a broad range of speed-power options. 
PAL devices which can be used for sequencer designs are listed 
in Figure 23. We will consider the following PALdevices in detail. 

PAL10H/10020EV/EG8 
PAL16R8D family 
PAL23S8 
PAL22RX8 
PAL32VX10 



DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


REGISTERED 


PINS 


PRODUCT 


FREQUENCY IN 




OF PINS 


PINS 


OUTPUT 
PINS 




TERMS/ 
OUTPUT 


MHz 
'max 


TTL 


16R8 


20 


8 


8 


. 


8 


55.5,37,25,16 


16R6 


20 


8 


6 


2 


8 


55.5,37.25, 16 


16R4 


20 


8 


4 


4 


8 


55.5,37,25, 16 


16RP8 


20 


8 


8 


- 


8 


22.2 


16RP6 


20 


8 


6 


2 


8 


22.2 


16RP4 


20 


8 


4 


4 


8 


22.2 


16RA8 


20 


8 


0-8 


8-0 


4 


20 


16X4 


20 


8 


4 


4 


8 


14 


23S8 


20 


g 


4 


4 


8-12 


33.3, 28.5 


20R8 


24 


12 


8 


. 


8 


37,25, 16 


20R6 


24 


12 


6 


2 


8 


37.25,16 


20R4 


24 


12 


4 


4 


8 


37,25,16 


20X10 


24 


10 


10 


- 


4 


22.2 


20X8 


24 


10 


8 


2 


4 


22.2 


20X4 


24 


10 


4 


6 


4 


22.2 



Figure 23. Table of PAL Devices for Sequencer Applications 
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DEVICE 


TOTAL 


DEDICATED 


DEDICATED 


I/O 


LOGICAL 


SPEED GRADES 




NUMBER 


INPUT 


REGISTERED 


PINS 


PRODUCT 


FREQUENCY IN 




OF PINS 


PINS 


OUTPUT 
PINS 




TERMS/ 
OUTPUT 


MHz 
'max 


20RP10 


24 


10 


10 


. 


8 


37,25 


20RP8 


24 


10 


8 


2 


8 


37,25 


20RP6 


24 


10 


6 


4 


8 


37,25 


20RP4 


24 


10 


4 


6 


8 


37,25 


20XRP10 


24 


10 


10 


- 


8 


30, 22.2, 14 


20XRP8 


24 


10 


8 


2 


8 


30, 22.2, 14 


20XRP6 


24 


10 


6 


4 


8 


30,22.2,14 


20XRP4 


24 


10 


4 


6 


8 


30, 22.2, 14 


20RS10 


24 


10 


10 


. 


8-16 


19.2 


20RS8 


24 


10 


8 


2 


8-16 


19.2 


20RS4 


24 


10 


4 


6 


8-16 


19.2 


20XRP10 


24 ' 


10 


10 


. 


8 


30, 22.2, 14 


20XRP8 


24 


10 


8 


2 


8 


30, 22.2, 14 


20XRP6 


24 


10 


6 


4 


8 


30,22.2,14 


20XRP4 


24 


10 


4 


6 


8 


30, 22.2, 14 


20RS10 


24 


10 


10 


. 


8-16 


19.2 


20RS8 


24 


10 


8 


2 


8-16 


19.2 


20RS4 


24 


10 


4 


6 


8-16 


19.2 


20RA10 


24 


10 


0-10 


10-0 


4 


33,20 


22RX8 


24 


14 


0-8 


8-0 


8 


28.5 


22V10 


24 


12 


0-10 


10-0 


8-16 


40,28.5,18 


32VX10 


24 


12 


0-10 


10-0 


8-16 


25, 22.2 


32R16 


40 


16 


16 


- 


8-16 


16 


CMOS 


C16R8 


20 


9 


8 


. 


8 


28.5 


C16R6 


20 


9 


6 


2 


8 


28.5 


C16R4 


20 


9 


4 


4 


8 


28.5 


C20R8 


24 


13 


8 


- 


8 


20, 15.3 


C20R6 


24 


13 


6 


2 


8 


20, 15.3 


C20R4 


24 


13 


4 


4 


8 


20. 15.3 


C22V10 


24 


12 


0-10 


10-0 


8-16 


33.3, 20 


C29M16 


24 


5 


. 


16 


8-16 


20,15 


C29MA16 


24 


5 


- 


16 


4-12 


20,15 


ECL 


10H20EV/EG8 


24 


12 


0-8 


8-0 


8-12 


125 


10020EV/EG8 


24 


12 


0-8 


8-0 


8-12 


125 



Figure 23. Table of PAL Devices for Sequencer Applications (Cont'd.) 
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The ECL PAL10H/10020EV/EG8 

At 125 MHz, the ECL PAL10H/10020EV/EG8 provides 
the highest speed for a state machine design. The 
PAL1 OH/1 0020EV/EG8 has eight outputs and 20 inputs. Half of 
the outputs have 8 product terms and half have 1 2 product terms. 
All of the 20EV8 outputs use D-type flip-flops, while the 20EG8 
outputs are transparent latches. Two architectural fuses per 
output control the polarity of the logic and bypass the flip-flop for 
combinatorial asynchronous outputs. Two global product terms 
are present to Set and Reset all flip-flops. Maximum frequency 
of operation as a state machine is 125 MHz. 

Figure 24 details the macrocell for the PAL1 OH/1 0020EV8. 

The PAL16R8D Family 

This is the high-speed end (55 MHz) of the TTL state machine 
design solutions. PALI 6R8D family devices are available in three 
different versions (PAL16R8/6/4) with varying numbers of regis- 
tered and combinatorial outputs and I/O pins for convenient 
design fitting. The PAL16R8 family provides 64 product terms 
(eight per output). 



The software design options available for the PAL16R8D family 
are the traditional Boolean equations and the state machine 
syntax provided by PALASM 2 software. 

The PAL23S8 

At 33.3 MHz and in a 20-pin DIP, this device (Figure 25) provides 
the highest performance-to-board-space ratio. Designed for 
high-functionality state machines, this device offers eight output 
and six dedicated buried registers. It also offers four output 
macrocells with register bypass for asynchronous outputs. It has 
nine dedicated device inputs and four I/O pins giving a total of 23 
array inputs. For improved state machine initialization, it also 
offers dedicated synchronous Preset and asynchronous Reset 
product terms. Pin-compatible with the PAL16R8 family, this 
device offers varied distribution of 145 product terms for large 
designs and optimal fit. 

The software design options include Boolean equations and state 
machine syntax. 
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Figure 24. The PAL10H/10020EV8 Output Macrocell 
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The PAL22RX8A 

The 24-pin PAL22RX8A (Figure 26) provides eight configurable 
input/output macrocells with register bypass for a user-program- 
mable number of registered or combinatorial outputs and I/O 
pins. It also provides dedicated asynchronous reset and preset 
product terms for state machine initialization. The PAL22RX8A 
provides a product-term-controlled XOR gate with its sum-of- 
products logic. This allows users to configure individual macro- 
cells with D, J-K, S-R or T-type flip-flops. 

The requirements of transition and hold terms are dependent 
upon state selection, which varies for different applications. The 
PAL22RX8A provides both D-type and J-K type flip-flops, allow- 
ing the designer to select the one requiring fewer product terms. 

ThePAL22V10 

The PAL22V10 provides output macros similar to the 
PAL22RX8A. It provides both programmable polarity and regis- 
ter bypass. In addition, it has varied product term distribution. 

The PAL22\/10 has an advantage of a large number of product 
terms. It provides a varied distribution of eight to sixteen product 
terms per output. It also provides ten output macrocells, allowing 
for larger designs. 

The PAL32VX10/A 

The PAL32VX10/A (Figure 27) provides high functionality and 
density for PAL device designs. The PAL32VX10/A has config- 
urable flip-flops and initialization product terms for state machine 
designs. It offers unprecedented design density because of its 1 
input/output macros and a large number (8-1 6/output) of product 
terms. 

The PAL32\/X10/Aalso provides dual feedback paths from each 
output macrocell. This allows the macrocell register to be used 
as a buried state register in case the I/O pin is used as a dedicated 
input. The PAL32\/X10/A is designed for large state machine 
designs which have relatively few output transitions and require 
separate output and buried registers. The PAL32VX10/A pro- 
vides up to 10 buried registers for state machine design without 
any I/O penalty. The PAL32\/X1 0/Aalso allows trading-off output 
registers for state registers and vice-versa. This optimizes the 
use of device resources. 
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Figure 25. Block Diagram of PAL23S8 
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Figure 26. PAL22RX8A Output Macrocell 
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Figure 27. PAL32VX10/A Output Macrocell 
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Figure 28. Architecture of a PLS105 Device 
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Programmable Logic Sequencers (PLS) 

Another alternative for control logic is the PLS (Programmable 
Logic Sequencer) device (Figure 28). PLS devices have both 
programmable AND and programmable OR arrays. Forty-eight 
product terms are available, which can be shared between 
transition functions and output functions for all of the outputs, as 
dictated by the application needs. PLS devices also offer S-R flip- 
flops with buried registers for control sequencing. The flip-flops 
have a common clock line. The S-R type flip-flops are similar to 
the J-K type flip-flops discussed earlier, and require no hold 
terms. Consequently S-R type flip-flops are very efficient for large 
state machine designs. PLS devices can function as either as 
Moore machines or Mealy machines with a register on the output 
path. 

With the improvements in design tools, it is possible to specify 
designs directly from the state diagram. However, one format 
which has been used often in the past is the PLS table. This table 
is primarily used where the design is entered manually. The table 
is also useful for learning about and experimenting with se- 
quencer designs. Often, it is possible to manipulate sequencer 
programs using the table in ways which are not possible with 
some of the more rigid CAD languages. 

The table has provision for input and output signal pin allocation 
and naming, and a line for each of the 48 device product terms. 
The entries used in each line are shown in Figure 29. 



VALUE 


AND ARRAY TERMS 


OR ARRAY TERMS 


(input, present state) 


(next state, output) 


H 

L 


true input/state bit 
complement 
don't care 


Set Flip-flop 
Reset Flip-flop 
no change 



The table defines the output function and transition function. Both 
functions depend on the present sfafe and inputs. Each possible 
link path between two states uses a product term and becomes 
a line in the PLS table. The collection of all link path lines is a 
complete definition of the combinatorial logic required. The 
following example illustrates the relationship between a state 
diagram and the PLS table. 

Let us look at an arbitrary function as shown in Figure 30. The 
inputs are A,B,C, and D; the state names are SO, S1 , S2 and S3; 
and the outputs are W, X, Y and Z. To construct the PLS table, a 
state assignment is necessary. Selecting an arbitrary binary code 
for each state: 

50 = 00 

51 =01 

52 = 10 

53 = 11 

With the two state variables P and 0, the PLS table can be 
derived, and is shown in Figure 31 . 



PATH 

# 


INPUTS 


PRESENT 
STATE 


NEXT 
STATE 


OUTPUTS 




A B C D 


P 


P Q 


W X Y Z 


1 
2 
3 
4 
5 
6 
7 
8 
9 


H - - - 
L - - - 

- H H - 

- L - - 

- H L - 

- H - - 

- L - - 

- - - H 

- - - L 


L L 
L L 
L H 
L H 
L H 
H L 
H L 
H H 
H H 


H L 
L H 
H H 
L L 
L L 
H H 
L L 
L L 
L L 


L H L L 
H L L L 
L L L L 
L L L L 
L L L L 
L L L L 
L L L L 
L L L H 
L L H L 



Figure 29. Symbols Used In a PLS Table 



Figure 31. Example PLS Table 





Figure 30. Example State Machine 



Figure 32. "HOLD" Transitions 
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This is now a complete description of the combinational logic of 
the state machine. Every line in the table is a product term which 
is active when conditions for a transition to another state are 
satisfied. For this reason, the product terms in a PLS device are 
also known as transition terms. As the circuit moves from state 
to state, new transition terms become active as others become 
inactive. 

A comment may be appended to each term to aid documentation 
of the design. 

"Hold" Terms 

The flip-flops used in PLS devices are the S-R type. As discussed 
earlier, S-R flip-flops do not requij-e any product terms for holds 
and require product terms only for state transition or output 
generation (Figure 32). 

This may be seen in the two lines of the PLS table corresponding 
to the two link paths shown, in Figure 33. 



INPUT 
X1 


PRESENT 
STATE 


NEXT 
STATE 


L 
H 


LHLH 
LHLH 


- -HL 



Figure 33. Hold Transitions in a PLS Table 

The first path, corresponding to the first line in Figure 33, is a 
transition back to the same state. No change is thus required in 
any of the state variables, '-'entriescan thus be used. The first 
line has only '-' entries in the OR Array part of the table; the 
transition term is thus not connected and this line can be elimi- 
nated altogether. This is a logic minimization which can be done 
immediately when converting the flowchart or state diagram into 
a PLS table. The second path isto a different state, "0110". This 
does require a term since some state variables have to change. 
Also, note that "hold" state transitions are only free if no output 
changes are required, 

Complement Array Term 

An important addition to the PLS device's programmable AND- 
OR array is the complement array term (Figure 34). This is an 



extra OR line which is first complemented and then fed back as 
an additional input into the AN D array. Figure 35 shows the three 
possible configurations of the array at each transition term. The 
symbols shown are used in an extra truth table column for config- 
uring the complement array. 
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Figure 34. Connections of the Complement Array Term 

Proper usage of the complement array can lead to considerable 
savings in product terms. This can be seen in the typical use of 
the complement array in the implementation of default or "ELSE" 
transitions. Figure 36 shows a set of four transitions from state 
SA, three of which are enabled by one of three combinations of 
inputs A, B, and C, causing transitions to states SO, SI and S2. 
All other combinations cause a transition to state SX. 
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Figure 35. Complement Array Configurations 
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Figure 36. Transitions with a Default 
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Three product terms are required for transition to states SO, 81 
and S2 which encode the three different conditions. For the 
default transition to state SX, the remaining five condition combi- 
nations of A, B and C would have to be encoded, requiring five 
more product terms (Figure 37). But these terms can be thought 
of as "not the transitions to SO or 81 or 82". By using the comple- 
ment array (which provides this NOT signal), only one more 
product term is required for the default transition (Figure 38). 

The lastf ive terms have been compressed into one. If none of the 
three defined input combinations is true in state SA, the inputs to 
the complement array will all be zeros, and the complement array 
output will be a one. This activates the fourth transition term, 
causing a transition to 8X (Figure 39). 

The same complement array term can be used in other states too, 
since the propagated signal is ANDed with the present state code. 
Therefore all defaults can be realized with one term per state. 
PLS devices can be an effective solution in state- and branch- 
intensive applications, typically found in a variety of protocol con- 
trollers, waveform generators, and sequence detectors. In such 
applications, complex arrangements of multiple branches may be 
present. Multi-step control functions may include many loops of 
dozens of steps to count packets of data or input events as they 
occur. 

PLS devices can be characterized by the number of inputs, 
number of product terms, number of flip-flops and number of 
outputs. The fvlMl PLS family currently has three members; these 
are listed in Figure 40. 



Cn 


A 


B 


C 


PRESENT 
STATE 


NEXT 
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COMMENT 
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L 
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to 80 
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L 
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LHL 
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to 81 
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LHL 
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to 82 
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H 


LHL 
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Figure 37. Product Terms Required for Transition to 
State SX 
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Figure 38. Default Transition Using Compiement 
Array Term 



Software for PLS devices can usually accept input descriptions in 
three formats: truth tables, Boolean equations and state machine 
syntax. The first two require the user to work out the explicit logic 
assignment for all terms and code the detailed fuse states 
programmed or unprogrammed. The new state machine syntax 
allows design in a high-level syntax, and is easier to use. 
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Figure 39. Transitions Using Complement Array Term 
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DEVICE 


TOTAL 
NUMBER 
OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 

REGISTERED 

OUTPUT 

PINS 


I/O 
PINS 


LOGICAL 

PRODUCT 

TERMS/ 

OUTPUT 


SPEED GRADES 

FREQUENCY IN 

MHz 

MAX 


TTL 


PLS167 
PLS168 


24 
24 


14 
12 


6 
8 


- 


Total 48 
Total 48 


33 
33 


PLS105 


28 


16 


8 


- 


Total 48 


37 



Figure 40. Table of PLS Devices 



PROSE Sequencer (PMS14R21) 

The PROSE (PMS14R21 PROgrammable SEquencer) is a revo- 
lutionary architecture optimized for very high functionality state 
machine designs. It combines a PROM and a PAL array on a 
single device, utilizing the efficiencies of both for state machine 
design. 

The PROSE is a high-speed, 1 4-input, 8-output state machine. It 
consists of a 1 28x21 PROM array preceded by a 1 4H2 PAL array 
(see Figure 41). The PAL array is efficient for performing logic 
functions on a large number of input conditions, while the PROM 
array is optimal for implementing a large number of product terms 
(decision branches) and states. The combination allows a very 
efficient implementation of a state machine. 




Figure 41. PROSE PMS14R21 Device Architecture 

PROM Operation 

The PROM consists of 128, 21 -bit word locations. Each of these 
locations in the PROM can be viewed as a state. Eight bits 
Q(7-0) at each location define the outputs. The other 1 3 bits are 
used as feedback: six as condition select bits CS(5-0) to the PAL 
array, and seven to the PROM array itself for determining the next 
location to be addressed. Five of these seven constitute the low- 
order address bits A(4-0) for the next state. The remaining 2 bits 
are XF(I-O), inputs to the exclusive-OR gates which determine 
address bitsA6 and A5 in conjunction with PALoutput signals X1 



and XO. Effectively, the seven-bit address of every PROM 
location is generated by seven bits of the PROM data and 
constitutes the present state. The next PROM location is the one 
whose address is stored as data in the present location. Eight of 
the PROM data bits are used as outputs. 

PAL Array Operation 

The PAL14H2 has 14 complementary inputs and 2 active-HIGH 
outputs (XI and XO). Each of the 2 outputs is a sum of eight 
product terms. Eight of the inputs, 1(7-0), are from an external 
source, and are used for encoding conditions for branch selec- 
tion. Six inputs, 08(5-0), are from internal feedback from the 
PROM, and are used for selection of the conditions that deter- 
mine branching when decisions are involved in state transitions. 

Just as PROMs are efficient for direct control sequencing, PAL 
arrays are very efficient for encoding input conditions. The 
function of the PAL14H2 in the PROSE device is to encode all 
eight test input conditions along with the state information. Based 
upon these test conditions being true it inverts the polarity of the 
two most significant PROM address bits. This causes the PROM 
to address a different location (state), which is equivalent to 
conditional branching. The PALASM 2 design software selects 
these branch locations automatically and isolates the designer 
from low-level details. 

With 128 PROM locations, the PROSE device can be used for 
large state machines. It is very effective in single-thread control 
functions characterized by long sequences of events generated 
by one 'Irigger' event. There may be a single repetitive decision 
evaluated many times or a long chain of decisions evaluated for 
sequential inputs. The output control stream may be quite wide 
and complex depending on the system. Generally, only narrow 
branches (two or four-way) are needed. Typical applications 
occur in controllers for error detection, numerical processing, and 
data encryption and decryption. The PROM output structure can 
hold any pattern of eight-bit values to be read out as needed. No 
trade-off has to be made with branching complexity to providethis 
output flexibility. Byte or word-oriented expansion can be made 
by arranging multiple devices in parallel. 

PROSE state machines are fully supported by PALASM 2 soft- 
ware. A complete design example of a PROSE-based state 
machine is included on page 2-122. 
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Fuse Programmable Controller 
(Am29PL141) 

The Am29PL141 is a high-performance, single-chip, fuse-pro- 
grammable controller (FPC). This chip is designed to allow the 
implementation of complex state machines and controllers by 
programming the appropriate sequence of microinstructions in 
the on-chip microprogram memory. 



Large state machines require the capability of nested routines. 
Nested state machines are control sequences that may be 
invoked by one or more control functions, but when invoked, 
suspend the "calling" control sequence. Following completion of 
the low-level routine, the higher-level "calling"sequence resumes 
where it left off. The nested task may have modified part of the 
machine environment or the data processed by the system. The 
Am29PL141 provides stacks which allow nested subroutines. 
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Figure 42. Architecture of Am29PL141 
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With its on-chip intelligent microprogram address sequencer 
(Figure 42), high-speed 64 x 32-bit PROM-based microprogram 
memory, on-chip pipeline register, and an on-chip diagnostics 
serial shadow (SSR) register, this chip offers two major advan- 
tages: fast operation, and testability. This device is available with 
a 20-MHz clock rate (50-ns cycle time) in a 28-pin dual-in-line 
package. 

A microprogram address sequencer is the heart of the FPC, and 
performs all control sequencing functions. TheAm29PL141 has 
29 high-level microinstructions which include jumps, loops, sub- 
routine calls, and multiway branching. These microinstructions 
can be conditionally executed based on the test inputs. The 
output generation function is similar to that of the PROSE device, 
where output data is stored in PROM locations. A block diagram 
of the FPC is shown in Figure 42. 

The FPC consists of five main logic blocks: the microaddress 
control logic, condition code selection logic, branch control logic, 
microinstruction decode logic, and the microprogram memory. 

The microaddress control logic generates the proper sequences 
of addresses for the microinstructions in the microprogram 
memory. A PLA microinstruction decoder is used to decode the 
opcode from the microinstruction. This microinstruction decoder 
also generates all necessary control signals for executing each 
microinstruction. Depending on the microinstruction, the mi- 
croaddress can be generated from a number of different sources: 
branch control, microprogram counter (PC+1), subroutine regis- 
ter, or the microprogram counter (PC). 

A six-bit stack is available for looping and subroutine calls. When 
the loop counter is not being used for counting purposes, it can be 
configured with the stack register to implement a two-deep stack 
for nested loops and nested subroutine calls. The condition code 
selection logicconsistsof an 8:1 test multiplexer. Oneofeighttest 
conditions can be selected on which to base the conditional 
instructions. The polarity of the selected condition code input is 
controlled by the POL bit in the microword. 

The branch control logic is used for generating multiway branch 
addresses from the external inputs T(5-0). This logic also per- 
forms the comparison between the inputs and the pipeline data 
field for executing the COMPARE instruction. An EQ signal is 
generated as an output of the COMPARE instruction and can be 
used for a condition code test in the next clock cycle. 

With its 64-location PROM, the Am29PL141 provides up to 64 
states for state machine designs. It offers the most sophisticated 
control sequencing circuitry, which allows multiple branching and 
nested suiaroutines. This can enhance the functionality of the 
device even beyond 64 states. 

The Am29PL1 41 is supported by an assembler (ASM1 4X) which 
allows the user to write instructions in a high-level language 
syntax. The assembler then converts the high-level statements 
into the specific device microinstructions and PROM data. The 
JEDEC device output file is also generated for programming the 
device PROM. 



State Machine Design Tutorial 

The following discussion is a tutorial on state machine design 
methodology. In this tutorial we will use the PROSE PMS1 4R21 
device and explain the state machine design process. The 
PROSE PMS14R21 is supported by the state machine syntax of 
PALASM 2 design software. This software syntax is identical to 
and can be used for programming other PAL and PLS devices. 

Therearefourstagesinastatemachinedesign. The first requires 
conceptualizing the design and selecting the correct device; 
second converting it to its state diagram representation; third 
converting the state diagram to its state machine syntax text file; 
and fourth assembling the file and programming the device. To 
explain all the stages in the design process we will take a general 
design example and then go through the stages of the design 
process. As an alternative design methodology, we will also 
examine the flowchart state representation in the design 
example. 

Conceptualizing the Design 

This is the first step in a state machine design process. In this step 
a complete functional description of the design is formalized 
along with the requisite truth tables and/or timing diagrams. We 
will consider a simple traffic signal controller example for illustra- 
tive purposes. 

To begin with, let us visualize the scene of a traffic intersection 
(Figure 43), simplified to two one-way streets. 
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Figure 43. A Traffic Intersection 
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The traffic intersection shows two one-way streets: one in direc- 
tion 1 and the other in direction 2. Each direction has a signal 
consisting of red, yellow, and green lamps. These lamps are 
activated with appropriately-named active-HIGH signals (RED,, 
YEL,, GRN,, REDj, YEL^, GRN^). The signals are generated by 
the state machine controller. Also, each direction has a sensor 
which provides an active-HIGH signal (SEN,, SEN^) that indi- 
catesthe presence of a vehicle. Thecontroller has to manage this 
intersection, with the sensors as inputs and the lamps as outputs. 

The assertion of SEN, or SENj signals a request for a green light 
for traffic in the corresponding direction. Once SEN, is HIGH and 
SENj is LOW, indicating traffic in direction 1, the GRN, light 
should be on at all times. Similarly, when SEN^ is HIGH and SEN, 
is LOW, the GRNj light should be on at all times. When SEN, and 
SENj are both HIGH, indicating traffic in both directions, the traffic 
controller should cycle, allowing equal periods of green signals 
(GRN, and GRNj) for both directions. This cycling is also done 
when SEN, and SEN^ are both LOW, indicating no traffic. 
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Whenever the signals change from direction 1 to direction 2, the 
appropriate yellow light (YEL, or YEL,) is turned on for the 
transition duration. This allows time for the traffic in the intersec- 
tion to pass, before allowing traffic in the other direction. The 
timing diagram is shown in Figure 45. When both sensors 
indicate the presence of traffic (SEN,, SEN, = H), extra time is 
allowed for traffic to pass, before the signalschange the direction. 
The extra time is also allowed when there is no traffic in both 
directions (SEN,, SEN, = L). The timing diagram for this is shown 
in Figure 46. Finally, the traffic signal controller shown includes 
the system clock (CLK), and an initialize or reset signal (INIT). 
INIT drives the controller to an initial state. 

Before we proceed to the next step, we mustselectadevice which 
will implement this design. 

In the previous section we found that the basic selection of a state 
machine controller device depends upon the speed and I/O 
considerations. The PROSE PMS14R21 has eight inputs and 
eight outputs in addition to the initialization input signal PRESET, 
which are sufficient for this design. Another selection criterion is 
the intelligence of the PROSE device. The device should be able 
to implement all of the states required by thedesign. The PROSE 
device is the only 24-pin SKINNYDIP device which has the 
capacity for up to 128 states. Later we will see that these are 
sufficient for implementing our design. 



Figure 44. Table for Traffic Flow Direction 
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Figure 45. Fast Traffic Flow Transition 
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DIRECTION 1 



S3 S4 S4 
• DIRECTION 2 



S4 S4 S5 Sg S7 Sq Sq 
DIRECTION 2 ► DIRECTION 1 



Figure 46. Slow Traffic Flow Transition 



State Machine Representation 

Once the design problem is defined, the next stage is to convert 
it to its state machine representation. State machines can be 
represented by bubbie-and-arrow state diagrams (Figure 4). 
Each bubble represents a state, with each arrow representing a 
transition. A second method of state machine representation is 
with flow charts, details of which are explained on page 2-104. 
State diagrams are by far the most popular representation. 
Flowcharts are included for completeness; you need not worry if 
you are not comfortable with them. 

The transition from the present state to the next state is depend- 
ent upon present state and input conditions. Similarly, outputs 
can be generated based upon the present state (called a Moore 
type state machine), or present state and input conditions (called 
a Mealy type state machine). Ourdesign example is a Moore type 
state machine. The PALASM 2 syntax supports both Moore and 
Mealy type state machines. 

The first step for constructing a state diagram is to define states. 
All of the specific events in a design are assigned a state. These 
include: change of output signals, response to a change in input 
signals, or time delays. For example, in Figure 45, different states 
(S„, Sj and SJ have been assigned where the outputs change. 
Often states with the same outputs can be merged into one 
unique state, as shown in the case of state S^. Once the states 
are defined as bubbles, the transition from one state to another 
(called control sequencing) and outputs generated for each state 
(output generation) are defined. The task is to use the timing 
diagram along with the functional description of the design to 



develop control sequencing and output generation functions for 
state representations. The state diagram is an improvement over 
the timing diagram since it also provide additional branch decision 
information. 

Each of the functions performed by a controller can be viewed as 
one of the following operations. We will show how each of these 
operations determine the control sequencing and the output 
generation for the state machine, and their state diagram and 
flowchart representations. 

• Arbitration 

• Multiple condition testing 

• Event monitoring 

• Control signal generation 

• Timing delays 

Arbitration is the decision-making process, it is inadequately 
represented in timing diagrams with arrows (Figures 45 and 46). 
The table in Figure 44 represents the functional requirement for 
arbitration. Based upon the sensor inputs, the state machine 
stays in the same state S^, when SEN,=H and SEN2=L (for 
allowing traffic in direction 1) or transitions either to state S3 
(SEN,=L, SEN3=H), or to state S, (SEN,, SEN^ are both HIGH or 
LOW). This is represented in the state diagram and flowchart as 
the transition from S^ (Figure 47). 

Sg can also be thought of as an event monitoring state where the 
controller waits for the command from the sensors. Based upon 
the two bits SENj, and SEN,, the state machine moves to different 
next states. This is also an example of multiple condition testing. 
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) SEN 1 • SEN 2 

SEN 1 • SEN 2 
SEN 1 • SEN^ 




a. State Diagram 




a. State Diagram 
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407 42a b. Flowchart 

Figure 47. Conditional State Transitions 

An example of control signal generation (Figure 48) is signal 
REDj which is used to turn on the red light for direction 1 . This 
control signal generation example requires assigning output 
signal name(s) (REDj) to the state(s) (S^, S,, S^ and S3) where it 
is asserted (see Figure 46). For simplicity, we are only consider- 
ing the outputs which are asserted. Other outputs in their default 
states must be accounted for in the final state machine represen- 
tation. 

The signal GRN, is kept asserted for two extra clock cycles 
(Figure 46) to allow the traffic to pass when changing the traffic 
pattern from direction 2 to direction 1 . The state diagram repre- 
sentation (Figure 48) requires assigning unconditional transition 
(control sequencing) from one state to another, for a number of 
states (S,, Sg and S3), depending upon the required time delay, it 
also requires assigning the signal name (GRN,) to all three states. 



REDj.GRN, 




REDj.GRN, 



S2 

RED2, GRN, 



REDj.YEL, 



407 41a b. Flowchart 

Figure 48. Output Assignment to States 

which results in signal assertion for the extra two clock cycles. 

Once the design has been converted into a state diagram (Figure 
49) orf lowchart (Figure 50) representing all of its timing and func- 
tional requirements, the next stage involves conversion to state 
machine syntax for its textual representation. 

State Machine Syntax 

The PALASM 2 programming software design file (also called the 
PAL device Design Specification or PDS) allows both conven- 
tional Boolean equations and state machine design constructs. It 
also allows both Mealy and Moore types of designs with extensive 
logic minimization capabilities and easy menu-driven operation. 
Other programming software packages which provide similar 
design capabilities are also available from various vendors. 
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Figure 49. State Diagram — ^Traffic Signal Controlier 
A PALASM 2 state machine design file consists of four sections: 

• Deciaration section 

• State section 

• Condition section 

• Simulation section (optional) 

Declaration Section 

As illustrated in Figure 51, tliis section stores the basic informa- 
tion about the device type and pin names, it is also used to store 
some bookkeeping information such as the company name and 
design revision numbers. 




GRN,.RED2 



GRN,,RED2 



YEL,.RED2 




RED,,GRN2 



RED,, CRN 2 



YELg.RED, 



TITLE TRAFFIC CONTROLLER 

PATTERN STATE MACHINE 

REVISION 1 

AUTHOR J. ENGINEER 

COMPANY. MONOLITHIC MEMORIES 

DATE JANUARY 30, 1987 

CHIP S_MACHINE PMS14R21 

CLOCK DCLOCK SENl SEN2 12 13 14 
15 16 17 SDI GND 
RESET SDO REDl YELl GRNl RED2 
YEL2 GRN2 01 00 MODE VCC 

Figure 51. Declaration Section 



Figure 50. Flowchart— Traffic Signal Controller 



State Section 

The state section contains all the control sequencing and output 
generation information. The state section also stores all of the 
state initialization values and default parameters. As illustrated in 
Figure 52, it initially defines the type of state machine being 
designed with keywords MOORE_MACHINE or 
MEALY_MACHINE. For the PROSE device it also selects the 
use of pin 1 3 as either MASTER_RESET or OUTPUT_ENABLE. 
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It also defines the default output logic values as HIGH or LOW 
with the keyword DEFAULT_OUTPUT. Once an output is as- 
signed a default logic value, it is explicitly mentioned only when 
asserted; otherwise it is assumed to be in its default logic value. 
This usually simplifies the task of drawing a state diagram (see 
Figure 49). Similarly, the keyword OUTPUT_HOLD defines the 
outputs which retain their previous values unless explicitly men- 
tioned in a new state. The last two keywords also help isolate the 
user from the implementation details of J-K or D-type flip-flops 
when using PAL-device-based state machines. The keyword 
OUTPUT_HOLD is not used in this design. 



MOORE_MACHINE 

MASTER_RESET 

DEFAULT_OaTPOT /REDl /YELl /GRNl /RED2 /YEL2 /GRN2 

POWER_UP := VCC -> SO 

Figure 52. State Section 

Once the default parameters are defined, all of the state transi- 
tions are represented. Only state names are used, since the state 
values are assigned automatically by the software. Every state 
transition (Figures 53 and 54) can depend upon the present state 
for direct control sequencing (state_x := VCC -> state_y), or the 
present state and input conditions for conditional control se- 
quencing (state_x := Condition_name -> state_y). The syntax 
has a direct relationship to the state diagrams. For conditional 
control sequencing, the syntax also allows a default state when 
none of the conditions of transitions to other states are satisfied. 

The output generation can also be represented easily. Outputs 
are specified from each present state. These outputs can be 
Moore type (state.OUTF := output_name) or Mealy type 
(state.OUTF := Condition_name -> output_name), depending 
upon whether or not conditions are used to generate outputs. 




STATE_A:. V^c -> STATE_B 
a. Direct Control Sequencing 




STATE_C:. Ii -> STATE_D 
+1, -> STATE_E 

b. Conditional Control Sequencing 

[state Ai 




STATE_A.OUTF:. 0| 
c. Moore Machine 



STATE_A.0UTF:-li ->Oi 
d. Mealy Machine 



Figure 53. Direct Relationship Between State Diagram 
and State Syntax 

Condition Section 

This is the third section of the PALASM 2 state machine file 
structure. This section assigns names toallof the input conditions 
used by the state section for both control sequencing and output 
generation. 

Design Exampie Syntax 

All of the state machine tasks represented by the state diagram 
can be converted easily to textual format. The transition from 
state S, to Sj and S3 (Figure 55) illustrates how outputs RED^ and 
CRN, are generated. These represent control signal generation 
and timing delay tasks of a state machine. Similarly, the arbitra- 
tion function can be converted to textual format, showing the 





STATE _A 






,, 






STATE_B 




STATE, 
a. Dire 


A:=Vcc-> 
:t Control S 


STATE_B 

squencing 




STATE A 
O1 






, , 






STATE B 






STATE_C:=. I, -> STATE_D 
+1, -> STATE_E 

b. Conditional Control Sequendng 




STATE_A.OUTF:= 1 1 -> O , 

d. Mealy Machine 



STATE_A.OUTF:=Oi 
c. Moore Machine 



Figure 54. Direct Rolatlonship Between Flowchart and Stato/Condition Syntax 
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transition from S^ to various other states (Figure 56). This is also a 
good example of multiple condition testing and event monitoring. 



so 


OUTF 


:= RED2 * 


GRNl 


SI 


OUTF 


:= RED2 * 


GRNl 


S2 


OUTF 


:■= RED2 * 


GRNl 


S3 


OUTF 


:= RED2 * 


YELl 



Figure 55. Output Generation Representation 



SO 



C3 -> 31 

+ CO -> SI 

+ CI -> S3 

+-> SO 



Simulation Section 

The simulation section is similar to that used for conventional 
Boolean equation simulation. Please see page 4-137 for details. 

Assembly and Programming 

Once the state diagram is converted to its textual constructs, the 
file (Figure 57) can be easily assembled by using PALASM 2 
software. The software then generates the device programming 
JEDEC format file. This JEDEC format file can be downloaded to 
a device hardware programmer provided by various vendors. For 
detail please see page 4-1 69 of the PALASM 2 software section. 



Figure 56. Control Sequencing Representation 



Traffic Signal Controller 

TITLE TRAFFIC CONTROLLER 
PATTERN STATE MACHINE 
REVISION 1 

AUTHOR J. ENGINEER 
COMPANY MONOLITHIC MEMORIES 
DATE JANUARY 30, 1987 

CHIP S_MACHIN PMS14R21 

CLOCK DCLOCK SENl SEN2 12 13 14 
15 16 17 SDI GND 
RESET SDO RED! YELl GRNl RED2 
YEL2 GRN2 01 00 MODE VCC 



MOORE_MACHINE ; Defined as a Moore Machine 
MASTER_RESET ; Initialization 
DEFAULT_OUTPUT /REDl /YELl/GRNl /RED2 /YEL2 /GRN2 
POWER UP: -VCC -> SO ; Power up state defined 



SO :- C3 -> SI 


Traffic in direction 1 


+ CO -> SI 


Slow signal change 


+ CI -> S3 


Fast signal change 


+-> SO 


Otherwise stay in state SO 


SI :- VCC -> S2 


Time delay 


S2 :- VCC -> S3 


Time delay 


S3 :- VCC -> S4 


Time for yellow 


S4 :- C3 -> S5 


Traffic in direction 2 


+ CO -> 85 


Slow signal change 


-I- C2 -> S7 


Fast signal change 


-l~> S4 


Otherwise stay in state SO 


S5 :- VCC -> S6 


Time delay 


S6 :- VCC -> S7 


Time delay 


S7 :- VCC -> SO 


Time for yellow 



Figure 57. Tiie Design File 
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SO.OUTF 


= 


GRNl * 


RED2 


Sl.OUTF 


= 


GRNl * 


RED2 


S2.0UTF 


= 


GRNl * 


RED2 


S3.0UTF 


= 


YELl * 


RED2 


S4.0UTF 


= 


REDl * 


GRN2 


S5.0UTF 


= 


REDl * 


GRN2 


S6.0UTF 


= 


REDl * 


GRN2 


S7.0UTF 


= 


REDl * 


YEL2 


CONDITIONS 






CO = /SENl 


* /SEN2 


CI = /SENl 


* SEN2 




C2 = SENl 


" /SEN2 




C3 = SEN] 


L 


" SEN2 





Allow direction 1 traffic 



Change from direction 1 to 2 
Allow direction 2 traffic 



Change from direction 2 to 1 



Condition for no traffic 

Condition for direction 2 

Condition for direction 1 

Condition for traffic in both directions 



SIMULATION 



TRACE ON CLOCK SENl SEN2 REDl YELl GRNl RED2 YEL2 GRN2 



SETF RESET /CLOCK 

CLOCKF CLOCK ; STATE TRANSITION ONLY ON 1ST CLOCK 

CLOCKF CLOCK ; LIGHTS CHANGE ON 2ND CLOCK 

CHECK /REDl /YELl GRNl /YEL2 /GRN2 RED2 

SETF /SENl /SEN2 

CLOCKF CLOCK 

CLOCKF CLOCK 

CHECK /REDl /YELl GRNl RED2 /YEL2 /GRN2 

CLOCKF CLOCK 

CHECK /REDl YELl /GRNl RED2 /YEL2 /GRN2 

CLOCKF CLOCK 

CHECK REDl /YELl /GRNl /RED2 /YEL2 GRN2 

CLOCKF CLOCK 

CHECK REDl GRN2 

CLOCKF CLOCK ' 

CHECK REDl YEL2 

CLOCKF CLOCK 

CHECK /REDl /YELl GRNl RED2 /YEL2 /GRN2 

SETF /SENl SEN2 

CLOCKF CLOCK 

CHECK /REDl /YELl GRNl RED2 /YEL2 /GRN2 

CLOCKF CLOCK 

CLOCKF CLOCK 

SETF SENl /SEN2 

CLOCKF CLOCK 

CLOCKF CLOCK 

CHECK YEL2 REDl 

CLOCKF CLOCK 

CHECK GRNl RED2 



TRACE OFF 



Figure 57. The Design File (Cont'd.) 
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Introduction 

The microprocessor has become the standard processing 
element in a wide variety of digital systems. Applications ranging 
from ignition control in automobiles to high performance engi- 
neering workstations utilize microprocessors. Usually a micro- 
processor with its associated VLSI components (the dynamic 
RAM, EPROM and standard microprocessor peripherals) can 
implement most application problems. The microprocessor's 
general-purpose nature, ease of use, and cost-effectiveness 
mal<es it the device of choice in all but the most performance- 
intensive or specialized applications. 

All the designer needs is the necessary "glue logic" to connect 
these VLSI components together. A typical microprocessor 
based design is illustrated in Figure 1 . The heart of the design is 
the microprocessor. Typically, DRAMs provide the datastorage 
and EPROMs provide the non-volatile program storage. De- 
pending on the application such standard microprocessor periph- 
erals as DMA controllers, serial I/O or disk controllers are usually 
present. Glue logic is needed to connect these devices together 
to create a working system. For example, memories and periph- 
erals need to be selected based on the desired address range, 
afunction called "address decoding"; high-speed address strobe 
signals RAS and CAS must be generated for the DRAMs; wait 
states must be generated for the slower memory and peripheral 
chips; arbitration and prioritization between different bus masters 
(system components which control the bus transactions: DMA 
and CPU) is necessary in multi-master systems. 



Programmable Logic Devices as 
Standard Product Glue 

The typical microprocessor system shown in Figure 1 highlights 
several of the places where programmable logic is used to 
implement the "glue" in a microprocessor system. These simple 
random logic and control-oriented functions are necessary to 
hold the standard products together to construct a complete 
system; thus the name "glue". The common glue functions in a 
microprocessor system share several application needs, and are 
best satisfied by programmable logic devices. 

Microprocessor glue functions compete for board space with LSI 
devices in almost all cases. If the glue logic uses too much board 
space, the design will need to sacrifice features (reducing the 
design's value to the customer) or add extra boards (increasing 
the cost). In either case the cost/value ratio increases, which is 
undesirable. Programmable logic devices can reduce the board 
space required for glue functions because of the density advan- 
tage the devices have over standard SSI/MSI devices. This 
makes more board space available for standard VLSI devices or 
customized peripherals. 

Since the microprocessor glue functions tend to be control- and 
timing-oriented, they are subject to many changes as the board 
design evolves, processor speeds increase, or new standard 
products are incorporated in the design. In complex systems, 
logic functions are commonly "tuned"to optimize bus band-width, 
peripheral access time, or memory timing only after the system is 
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Figure 1. Typical Microprocessor System 
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running. The optimum balance is usually too difficult to predict 
and can only be found via experimentation. Because of their 
programmable nature, PLDs offer the designer the capability of 
changing the logic at any time. Thus designs can quickly adjust 
to changing requirements, or can be improved without adding 
components or changing the circuit board. 

The address decoding, timing, and simple state machines are 
"shallow" and "wide" logicf unctions. Shallow logicfunctions need 
only a few levels of gating at a high speed. The standard AND- 
OR array architecture offers 3 to 5 levels of gating (inverter, AND- 
gate, OR-gate, flip/flop-master and flip/flop-slave) at very high 
speed (1 to 1 8ns). Wide logic functions require a large number 
of inputs on each gating structure. For example, address decode 
functions may need over 10 inputs to decode a microprocessor 
address and an 8-bit refresh counter would require over 8 inputs 
for the most significant bit. The wide-input structure of PLDs, 
where practically all signals are available (in both polarities) at 
each gating structure, is a good match for this application 
requirement. 

PLDs offer the designer several important charcteristics which 
match well with the needs of glue logic functions. Some common 
application areas will help illustrate the advantages of PLDs in 
microprocessor glue applications. 





PROGRAMMABLE 




LOGIC 


GLUE LOGIC NEEDS: 


ATTRIBUTES: 


• Minimum Board Space 


• High Logic Density 


• Easy and Frequent 


• Quick and Easy Device 


Design Changes 


Modifications 


• High Speed Simple 


• Optimized for Fast 


Logic Functions 


Shallow Logic 



Figure 2. Table for Glue Logic Needs and Programmable 
Logic Attributes 



Address Decoding 

The single most common application of combinatorial PAL de- 
vices is address decoding (Figure 3). In order for a microproces- 
sor to communicate with a peripheral or memory chip a chip select 
signal must be generated by the glue logic. Usually a combination 
of address signals and control signals are required to generate 
the chip select for microprocessor memory and peripheral de- 
vices. For example, in the 68000 microprocessor some of the 
address signals, the address strobe signal, and the read/write 
signal must participate in the address decode function. The logic 
needed to generate the chip select is many times simply a NAND 
of the true or complement of the necessary address and control 
inputs. 

In high-performance microprocessors, the chip select must be 
generated fast enough to allow single cycle access (no wait 
states) to memory. PLDs provide the high-speed wide-gating 
logic functions necessary to generate these chip select signals. 
Additionally, the addresses for peripherals and memory devices 
are often not known at the beginning of the design or may change 
many times during the design. PLDs can be easily changed to 
adjust to such modifications. More on address decoding is on 
page 2-35. 

Wait-State Generation 

Wait-state generation is in many ways related to the address 
decode function (Figure 4). After the addressed chip is selected, 
the CPU must be notified that the data transfer can be completed. 
The signal to the CPU which identifies the end of a data transfer 
may need to be extended if a slow device is being accessed. 
DRAMs, EPROMs, EEPROMs, and microprocessor peripherals 
may all need different delays depending on their access time bus 
bandwidth requirementsandfrequency of access. Usually there 
is at least afast access (no wait-states) a slower access (one wait- 
state) and a longer access (several wait-states). The wait state 
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Figure 3. Address Decoding Application Example 
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generation logic must create a variable delay depending on the 
device being presently accessed. Usuallythedelayonlydepends 
on the present access and is independent of previous accesses. 

A registered PAL device has the capability of implementing the 
simple state machine needed for wait-state generation. Addition- 
ally, the PLD contains enough logic to provide for a variety of 
memory, peripheral and processor speeds, without requiring 
additional board space. Thus a single design can have an 
extended lifetime, or provide a quick time to market when faster 
or lower-cost processors, peripherals and memories are 
available. 

DRAM Control 

DRAM control logic consists of several distinct functions: address 
multiplexing, RAS/MS/CAS generation, refresh request genera- 
tion, and memory request arbitration. Address multiplexing, as 
the name implies, involves multiplexing the upper and lower 
processor addresses to the DRAM chips. RAS/f«1S/CAS genera- 
tion involves the generation of the three separate DRAM timing 
signals: Row Address Strobe, Column Address Strobe, and Mux 
Select (on the address multiplexer). Tight timing relationships 
between the address multiplexer and the RAS/MS/CAS logic is 
necessary in high-performance systems. Refresh request gen- 
eration logic is used to time the interval between DRAM refresh 
cycles. The logic is usually a wide counter using a slow clock. 
Memory request arbitration logic decides whether the refresh 
request will be granted or a memory access will be allowed. 



Microprocessor Peripheral Interface 

Microprocessors are usually connected to a number of VLSI 
peripherals performing various ancilliary functions. In many 
cases a design calls for the use of "incompatible" microprocessor 
and VLSI peripherals. Cost, performance, availability, backwards 
compatibility, orother constraints sometimes dictate that different 
families of devices need to coexist in the same design. Glue logic 
is necessary to allow these incompatible devices to communi- 
cate. Sometimes logic signals need to be translated. The most 
obvious case is probably the different way a processor read or 
write cycle is encoded. One popular processor uses two separate 
read and write signals (/R or /W) with the active state indicating 
the presence of a valid address. Another popular processor uses 
an "address valid" signal to indicate the presence of an address 
and a single read/write signal to indicate the type of access. 
Peripherals hooking onto the other processor's bus will need 
some simple logicto translate the meaning of these signals. Often 
signal timing is also different between families. This requires 
signals to be delayed or qualified by glue logic. 

The two most popular processor families for high performance 
designs are the 8086/801 86/80286 family and the 60000/6801 0/ 
68020 family. Each processor has its own peculiarities which the 
designer must take into account when hooking up a peripheral 
device. Subsequent discussion covers several applications 
examples for each family. Finally, PLDs can also be used to make 
custom microprocessor peripherals. An application example of a 
PLD-based interrupt controller is shown on page 2-172. 



PLDs have a variety of characteristics which make them well 
suited for DRAM control applications. High speed and minimum 
skew specifications are important considerations for address 
multiplexing, RAS/MS/CAS generation and refresh arbitration in 
high performance designs. High-speed PLDs have significantly 
tighter skew specifications than the equivalent multi-device SSI/ 
MS! solutions. This makes PLD-based DRAM control applica- 
tions easier to design and higher performance than SSI/MSI 
based designs. Detail on DRAM designs is provided on 
page 2-1 79. 



Conclusion 

PLDs offer the designer several important characteristics which 
match well with the needs of glue logic applications. The following 
application examples will illustrate the PLD's versatility at imple- 
menting microprocessor glue functions. 
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Figure 4. Wait State Generator Application Example 
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Overview 

The 8086 is a general-purpose 16-bit microprocessor CPU. 
The CPU has a 16-bil data bus multiplexed with 16 address 
outputs. There are 4 additional address lines (segment ad- 
dresses which are multiplexed with STATUS) that increase the 
memory range to 1 Mbyte. 8086 addresses are specified as 
bytes. In a 16 bit word, the least significant byte has the lower 
address and the most significant byte has the higher address. 
This is compatible with 8080, 8085, Z80 and PDP1 1 address- 
ing schemes but differs from the Z8000 and 68000 
addressing. 

The data bus is "asynchronous", i.e., the CPU machine cycle 
can be stretched without clock manipulation by inserting Wait 
states between t2 and 13 of a read or write cycle to accom- 
modate slower memory or peripherals. Unlike the 68000, the 
8086 has separate address space for I/O (64 kBytes). 

The 8086 can operate in MIN. or MAX. mode, t^aximum mode 
offloads certain bus control functions to a peripheral device 
and allows the CPU to operate efficiently in a co-processor 
environment. A brief discussion on both the MIN. and MAX. 
modes are as follows: 

MIN. mode: I/O addressing is defined by a HIGH or 

the lO/M output, and activated by the RD 
output for reading from memory or I/O, or 
activated by the WR output for writing to 
memory or I/O. 

DMA: The Bus is requested by activating the HOLD input to 
the 8086. Bus Grant is confirmed by the HLDA output 
from the 8086. 



MAX. mode: I/O operation is controlled by two outputs 

(3086 plus from th e 8288. 

8288) lORC: active during Read from I/O 

lOWC: active during Write to I/O 
MRDC : active during Read from memory 
MWTC: active during Write to memory 

DMA: The Bus is requested a nd B us Grant is acknowledged 
on the same pin (RQ/GTO OR RQ/GTT) through a 
pulsed handshake. 

Interrupts in Min. and Max. Modes: 

Interrupt is requested by activating the INTR or NMI inputs to 
the 8086. 



Interrupt is acknowledged by the INTA pin on a MIN. mode 
6086 or by the INTA pin on the 8288 in MAX. mode. 



Note: There is no RD or lORC during the interrupt- 
acknowledge sequence. 
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8086 and Am7990 LANCE 

The LANCE, Am7990, has been designed to be interfaced 
easily with the popular 16-bit microprocessors (8086/80186, 
68000, Z8000, LSI-1 1 ). Most of the interface logic is embedded 
inside the chip and is program selectable. 

Although the LANCE itself has a multiplexed bus, it can easily 
be interfaced to demultiplexed buses with a minimal amount of 
effort. The following designs assume that the processor and 
the LANCE reside on the same board. Address buffers and 
data transceivers are set up to be shared between the proces- 
sor and the LANCE. All of these designs use PAL devices to 
reduce the parts count. 

The 8086 to LANCE interface requires a different Bus Request 
handshake, depending on whether the 8086 is configured in 



MAX. or MIN. mode. The 8086 has a bidirectional signal for 
both Bus Request and Bus Grant (RQ/GT). Both Bus Re- 
quest (RQ), and Bus Grant (GT) to/from 8086 are one CPU 
clock wide, and are synchronous to the CPU clock. Figure 1 
shows a PAL device design for the conversions in MAX. 
mode. This PAL device is utilized to include other external 
logic requirements for interfacing the LANCE to 8086. The 
interface diagram is similar to the one for the 80186 to 
LANCE interface except for the changes made in program- 
ming the PAL device. The interface timing diagram is shown 
in Figure 3. The PAL device design file is shown in Figure 2. 

Figure 4 shows a block diagram of the 8086 to Am7990 
interface, in MIN. mode. The interface also employs a PAL 
device to minimize parts count. The PAL device equations 
are given in Figure 5. 



Figure 1. 8086 RG/GT, Am7990 HOLD/HOLDA Conversion PAL Device 
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TITLE 8086 RQ/GT TO LANCE HOLD/HLDA CONVERTOR 

PATTERN PATOOl 

REVISION 01 

AUTHOR RASOUL OSKOUY 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP CONVERT PAL16R4 

CLK NC HOLD /GT NC NC NC NC NC GND 
NC NC NC /HLDA /D2 /R2 /R1 NC NC VCC 



EQUATIONS 

Rl := HOLD 
D2 := /Rl 



8086 RQ/GT conversion to LANCE HOLD/HLDA 
both REQ and GRANT are one clock wide and 
are synchronous to CPU clock 



R2 := Rl*/D2 
+ /R1*D2 
HLDA := GT*/R2 

+ HLDA*/D2 



Figure 2. 8086 RG/GT, Am7990 HOLD/HLDA Conversion PAL Device Design Fiie 



TITLE 8086 MIN MODE TO LANCE INTERFACE 

PATTERN PAT 

REVISION 01 

AUTHOR KHUYNH NGUYEN 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP LANC_INT PAL16L8 

ALE /AS DTR NC NC DEN NC /READY HLDA GND 
NC CPURDY READ /R /T /DAS /WR /RD LE VCC 

EQUATIONS 

/LE = /ALE 

+ /AS 
/CPURDY = /READY / CPU (8 6) is a bus master 

DAS = RD + WR 

DAS.TRST = /HLDA 

/READ = DTR 

/READ.TRST = /HLDA 

T = DTR 

T.TRST = /HLDA 

R = /DTR * DEN 

R.TRST = /HLDA ; LANCE is a bus master 

RD = READ * DAS 
RD.TRST = HLDA 
WR = /READ * DAS 
WR.TRST = HLDA 



Figure 5. Source Listing for Exampie of Figure 4 
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Figure 3. AmPAL16R4, 8086 (Max. Mode) LANCE Interface Timing Diagram 
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Figure 4. Am7990 to 8086 Interface 
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8086 and Am951 6 Universal DMA 
Controller interface 



Am9516in MIN. Mode 

Figure 1 illustrates the interface of tfie Am951 6 to tfie 8086 in 
thie MIN. mode configuration. Figure 3 illustrates the interface 
in MAX. mode. The interfaces could be accomplished by 
using rather complex implementation of standard SSI/MSI 
logics. Examples here replace the logic portion with a PAL 
device. The MIN. Mode uses the PAL device 16L8. This is a 
good example of "garbage collection". It reduces the amount 
of real estate, interconnections, parts, and part types. 



Both interface examples accomplish two major functio ns. F irst, 
when the Am9516 is bus master, it converts RD and WR into 
RAA/ and DS, and vice versa when th e Am95 16 is not the bus 
master Secondly the buffer controls, TBEN and REBN, are 
generated from DEN and DT/R . 

The two examples show different types of latches and 
transceivers and there are many more to choose from. The 
designer selects those that best meet system requirements 
while trying to minimize the number of different parts that must 
be stocked. Figure 2 shows the PAL device equations for this 
example. 
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Figure 1. The Am9516 UDC to 8086 CPU Interface (Minimum Mode) 
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TITLE Ain9516 TO 8086 MIN MODE INTERFACE 

PATTERN 01-3.66 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/05/87 

CHIP DMA_INTFCE PAL16L8 

;Am9516 to 8086 min mode interface chip 

NC ALED ALEP HLDA BW ADO DT /DEN /SEL GND 
NC /RBEN /RD ALE AO /RW /DS /WR /THEN VCC 

EQUATIONS 

DS = RD + WR 
DS.TRST = /HLDA 

RW = DT 
RW.TRST = /HLDA 

THEN = /DT * /SEL * DEN 
TBEN.TRST ^ /HLDA 

RBEN = DT * /SEL * DEN 
RBEN.TRST = /HLDA 

RD = /RW * DS 
RD.TRST = /HLDA 

WR = RW * DS 
WR.TRST = /HLDA 

ALE = /ALEP * /ALED 

AO = /ADO * /BW * HLDA * ALED 
+ ADO * BW * HLDA * ALED 
+ /ADO * /HLDA * ALEP 
+ AO * /ALEP + AO * /ALED 

; DESCRIPTION 

;THIS PAL DEVICE CONVERTS THE CONTROL SIGNALS TO INTERFACE THE flDflt 
;IN MIN MODE TO THE Am^Blb DMA CONTROLLER. ANOTHER EXAMPLE SHOWS 
;HOW THIS IS DONE IN MAX MODE. 

; SIMULATION NOT INCLUDED 



Figure 2. Source Listing for Example of Figure 1 
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Am9516inMAX. Mode 

This MAX. mode interface between the 8086 and Am9516 
(Figure 3) also uses PAL devices to reduce component 
count. 

The example makes several assumptions which result in a 
slightly more complex design than absolutely necessary. The 
8086 is assumed to be in MAX. Mode, and the design has to be 
compatible with a MULTIBUS or similar interface; the 16R4 
and 8288 could be eliminated if this is not the case. 
The 1 6R6 (Figure 4) is a PAL device that performs a function 
similar to the 8288, that is, it converts the processor's status 
signals_and clock into control signals. In this case, the s ignals 
are R/W, Data Strobe (DS), Interrupt Acknowledge (INTA) 
and Peripheral Acknowledge (PACK). This PAL device 
basically generates ZBUS signals for Zilog peripherals. 
In the example, it connects to the Am9516 (UDC). The DMA 
Controller is very similar to the AmZ801 6 except its bus inter- 
face has been modified to interface to non-multiplexed 
buses. This was changed from the previous design using a 
16R8 to eliminate problems due to skew between OSC 
and CLK. 

The 1 6R4, Figure 5, has two functions. The first is connecting 
the MIN. Mode protocol of the Am9516 or similar device to 



the MAX. Mode protocol for bus exchange. The 74LS03s are 
used to aid in this function. The timing waveform illustrates 
what happens in detail. The basic philosophy is that a rising 
edge on the HOLD input generates a pulse that is one clock 
wide. The CPU samples this pulse and, in response, issues a 
one-clock-wide pulse. The 16R4 uses this response pulse to 
generate HLDA. When the Am9516 has completed the nec- 
essary transfer, HOLD transitions HIGH to LOW. This gener- 
ates another pulse to the CPU signalling that the Am9516 is 
done and that the CPU may continue. 

The second function of the 16R4 is the conversion of R/W, 
DS and M/IO into the MULTIBUS-compatible signals 
/MRDC, /MWTC, /lORC, and /lOWC, when HLDA is HIGH. It is 
possible to collapse the 74LS03s into the PAL device, thus 
reducing the external logic required to only one open collector 
inverter The disadvantage, however, is that it adds two addi- 
tional clocks for the bus exchange overhead, one during ac- 
quisition of the bus and one on bus release. For block 
transfers, this is not significant but it may be undesirable when 
performing single transfers, or short burst, or when in Demand 
Mode. To eliminate the 74LS03, change the equation for R2 to 
/R1 • /D2 + R1 . D2, 

and then drive the /[RQ/GT] line with a 74LS05 from the R2 
output. 
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TITLE 808 6 TO 85XX PERIPHERAL INTERFACE 

PATTERN 01-3.68 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP DMA INTRR PAL16R6 



CLOCK RESET CLK 
/OE /AS /PI 



/SO 
/RW 



/SI /S2 /LOCK NC NO GND 
/DS /PC /lACK /RDY CLKD VCC 



EQUATIONS 



= 


/RESET 


* 


SO 


* 


/PO 


* /PI 


+ 


/RESET 


* 


SI 


* 


/PO 


* /PI 


+ 


/RESET 


* 


S2 


* 


/PO 


* /PI 


+ 


/RESET 


* 


SO 


* 


PI 




+ 


/RESET 


* 


SI 


* 


PI 




+ 


/RESET 


* 


S2 


* 


PI 





;= /RESET * /PO * /PI 
+ /RESET * PI * SO 
+ /RESET * PI * SI 
+ /RESET * PI * S2 



;= /lACK * /PO * PI * SO 
+ /lACK * /PO * PI * /SO 
+ lACK * SO * SI * S2 * 
+ DS * SO * 81 * S2 
+ DS * SO * /SI * 82 
+ DS * /SO * SI * S2 



" /SI * 82 
" 81 * 82 
PO * /PI ■> 



RW 



SO 



/SI 



S2 



lACK := /RESET * SO * SI * S2 * /PO * /PI * /LOCK 

+ /RESET * lACK * 80 * 81 * 82 * PO * /PI * /LOCK 

+ /RESET * lACK * LOCK * /DS 

+ /RESET * lACK * /LOCK * D8 * /PO * PI 



RDY 



:= /RESET * ■ SO 

+ /RESET * /SO 

+ /RESET * RDY 

+ /RESET * RDY 

+ /RESET * lACK 

+ /RESET * RDY 



/SI * 82 * PO * PI 
81 * 82 * PO * PI 
SO * /SI * S2 

/80 * 81 * 82 
80 * 81 * 82 * DS 
SO * 81 * 82 



/CLKD = CLK 

AS = /CLKD * PO * /PI * /lACK * CLK 

/DESCRIPTION 

;THI8 PAL DEVICE TRANSLATES 8086 BUS SIGNALS INTO COMPATIBLE SIGNALS FOR 
;THE 9516. IT IS ALSO APPLICABLE TO 85XX PERIPHERALS BY ALTERING /RW AND 
;/D8 TO /RD AND /WR. ONE FLIP-FLOP 18 AVAILABLE TO GIVE THE NECESSARY 
/DELAY TO THE FALLING EDGE OF /WR. THE DATA STROBE TIMING FOR A WRITE 
/CYCLE IS DELAYED UNTIL THE FALLING EDGE OF T2 TO MEET THE REQUIREMENTS 
/OF THE 85XX PARTS. THIS DESIGN ASSERTS RDY TO DEMAND ONE WAIT STATE 
/FROM THE 8086. THIS WAIT STATE IS NOT LONG ENOUGH FOR DESIGNS WHICH USE 
/AN 8 MHz 808 6. THEREFORE, WITH AN 8 MHz CPU, 85XX PERIPHERALS SHOULD BE 
/USED. AS AN ALTERNATIVE, THREE WAIT STATES CAN BE USED BY ALTERING THE 
/RDY EQUATION. THIS PAL DEVICE ALSO TRANSFORMS THE 8086 TWO-CYCLE 
/INTERRUPT ACKNOWLEDGE INTO A SINGLE CYCLE OF THE TYPE NECESSARY FOR 85XX 
/PARTS. THIS IS MADE POSSIBLE BY SAMPLING THE LOCK STATUS, PO, PI, AND 
/lACK SIGNALS. 

Figure 4. Source Listing for Example of Figure 3 
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TITLE Am9516 TO 8086 INTERFACE 

PATTERN 01-3.67 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP DMA_INTFCE2 PAL16R4 

;8086 to Ain9516 interface 

CLK /RQGT HOLD NC NC NC /RW /DS MIO GND 
/OE /MWTC /MRDC HLDA /D2 /R2 /Rl /lOWC /lORC VCC 

EQUATIONS 

lORC = /MIO * DS * /RW 
lORC.TRST = HLDA 

lOWC = /MIO * DS * RW 
lOWC.TRST = HLDA 

MRDC = MIO * DS * /RW 
MRDC.TRST = HLDA 

MWTC = MIO * DS * RW 
MWTC.TRST = HLDA 

Rl := HOLD 

R2 := /Rl 

D2 := Rl 

/HLDA := /Rl + /D2 * /HLDA + 
/RQGT * /HLDA 

; DESCRIPTION 

;THIS DEVICE CONVERTS THE MIN MODE SIGNALS HOLD AND HLDA TO THE 
;MAX MODE RQGT PROTOCOL. ADDITIONALLY IT GENERATES THE flEflfl 
;EQUIVALENT CONTROL OUTPUTS MRDC, MWTC, lORC, AND lOWC. THIS 
;PAL DEVICE WAS USED TO CONNECT THE AmRBlfc TO THE flDflfc IN MAX MODE 

; SIMULATION NOT INCLUDED 



Figure 5. Source Listing for Example of Figure 3 
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80286 to Am9568 Data Ciphering 
Processor interface 

This interface is designed for an 8-MHz CPU where the Data 
Ciphering Processor (DCP) is synchronously operating at the 
maximum clock rate of 4 MHz. A block diagram for the inter- 
face is shown in Figure 1. The Am9568 requires a narrower 
width of address strobe than the Am951 8. This works comfor- 
tably with the 60-ns address strobe width of an 8-MHz CPU. 

The MULTIBUS Mode Select Input of the Bus Controller 82288 
is tied LOW to optimize the command and control signals for 
short bus cycles. The Command Delay (CMDLY) becomes 
active-HIGH for one 1 6-MHz clock cycle whenever the DCP is 
selected to delay the Read and Write strobes by 125 ns. This 
satisfies the timing requirement of the minimum delay between 
ALE inactive and Read or Write strobe active of the DCP. An 
open-collector gate must be added to allow other peripherals to 
drive this Input. 



The ALE, lORC and lOWC outputs of the 82288 are wired 
directly to the DCP. ALE strobes a D-Fllp-Flop to store the state 
of Chip Select for the entire cycle. 

Q3 and the latched Chip Select CSL are ANDed externally to 
generate the Synchronous Ready for the 82284. The 82284 
samples the line at the falling edge of the clock. The registered 



output Q3 is clocked with the rising edge of the same clock, 
thus satisfying the setup and hold time requirements of the 
82284. Two Walt States are inserted. 

Half of the PAL device operates as a bidirectional Ad- 
dress/Data Multiplexer. During the Address Latch Enable ac- 
tive phase, the state of Ai and A2 is transferred to the ADi and 
AD2 pins of the PAL device. The DCP latches this two bit- 
address with the falling edge of ALE. 



When lORC and CSL are active, the states of ADi and AD2 
are passed to Di and D2 respectively. The DCP Register can 
be read. If lOWC and CSL are active, the data path is turned 
around: Di and D2 are inputs, ADi and AD2 are outputs. 

The Address Hold Time of the PAL device is sufficient because 
the address information is passed to ADi and AD2 whenever 
IGRC'CSL or IOWC*CSL are not true, i.e. whenever data is 
not transferred Between the CPU and the DCP. 

The Read Data Hold Time requirement of 5 ns of the Am9568 
is satisfied by the propagation delay of the PAL device. 

The Read Data Hold Time requirement of 5 ns of the iAPX286 
is also satisfied by the PAL device. 

The Master Port Chip Select (MCS) input of the DCP Is con- 
nected to the unlatched address decoder output. 

Figure 3 shows the source listing and the pin descriptions. 
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Figure 1. 80286 to Am9568 Interface 



The DCP Clock 

The PAL devic e syn chroni zes the DCP cloci< to the Data 
Strobes IORC and lOWC (Figure 2). It also divides the 
16 MHz system clock (8 MHz CPU clock) down to the maxi- 
mum DCP clock rate of 4 MHz. At this clock rate, the Data 
Strobe Delay to the DCP clock must be 0-30 ns. The Bus 
Controller is specified to generate a Data Strobe timing of 
3-1 5 ns to the falling edge of CLK (1 6 MHz). Because of the 
higher propagation delay of a standard PAL device, the regis- 
tered outputs are toggled at the rising edge of CLK before the 
Data Strobes become inactive. This gives an additional 
32.5 ns for the DCP clock signal path. 

Qi to Q3 are three outputs of the PAL device state machine. 
The registered outputs are clocked with the_rlsing edge of the 
16-MHz 82284 clock. Whenever ALE and CS are active, Qi to 
Q3 are set to the initial state. Qi to Q3 are outputs of a 3-bit 
down counter, with Q3 as the most significant bit. 



Q3 is used to generate the SRDY signal for the 82284 as 
mentioned above. 

Q2 is the DCP clock. This design must guarantee that the 
minimum DCP clock HIGH or LOW time is at least 1 15 ns or 
two 16-MHz clock cycles. This is done by toggling Q2 only 



during phase 2 cycles of the CPU. The CPU design guarantees 
that there is always a phase 1 cycle between two phase 2 
cycles. 

Assuming a typical PAL device propagation delay of 25 ns, 
timing parameter tCDS (Time Clock Data Strobe) is 10.5 to 
22.5 ns ( 3 + 32.5 - 25 ns to 1 5 + 32.5 - 25 ns). This satisfies 
the to 30 ns requirement. 

The AmPAL16R4 has active-LOW outputs. But one output, Q2, 
should be active-HlGH. The equation for Q2 was derived to be 

Q2 = ALE * CS + 0/ Q2 ^- Q/ Q2 

To compensate for the inversion in the PAL dGvice, the 
PALASM software minimizer (or 16RP4) can be used to 
convert it to the form shown in the PAL device Design Speci- 
fication. 

Improvements 

The DCP needs two Wait States only when the Control Reg- 
isters are read. Data Register read or writes and Control 
Register reads can be executed with only one Wait State, 
which improves the Data Ciphering speed of this interface. 
The more sophisticated Wait control logic and the two exter- 
nal TTL gates can be integrated into one PAL22V10 device. 
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TITLE 80286 TO Ain9568 (DCP) INTERFACE 

PATTERN DCP043 

REVISION 01 

AUTHOR JUERGEN STELBRINK 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP DCP_INTFCE PAL16R4 

CLK /CS CSL ALE /lORC /lOWC Al A2 NC GND 
/OE Dl D2 /Ql Q2 /Q3 CMDLY ADl AD2 VCC 



EQUATIONS 


Ql 


:= ALE*CS 
+ /Ql 


/Q2 


:= Q1*/Q2*/ALE 
+ Q1*/Q2*/CS 
+ Q1*Q2*/ALE 
+ Q1*Q2*/CS 


Q3 


:= ALE*CS 
+ Q1*Q2*Q3 
+ /Q1*Q2*Q3 
+ Q1*/Q2*Q3 
+ /Q1VQ2*/Q3 


/CMDLY 


:= /ALE 
+/CS 


/Dl = /ADl 

Dl.TRST = CSL * lORC 



/D2 = /AD2 

D2.TRST = CSL * lORC 

/ADl = /A1*ALE + /D1*/ALE 
ADl.TRST = CSL * lORC 

/AD2 = /A2*ALE + /D2*/ALE 
AD2.TRST = CSL * lORC 

; SIMULATION NOT INCLUDED 



Figure 3. Design File for the Example of Figure 1 
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80286 to Am8530 Interface 

The Am8530 is a high-speed, dual-channel serial communi- 
cations controller that supports a variety of advanced 
communication protocols. It supports data rates up to 
1.5Mbps. 

This design note shows an 80286 to an Am8530 CPU inter- 
face when no interrupts are used (Figures 1 and 2). The 
design is for a CPU running at 5 MHz with a 100 ns system 
clock cycle time. This clock is used to generate the clock for 
the 6 MHz Am8530 SCC. 

The chip select generated by the external decoder in an 
80286 system that is enabled by the M/IO line will go inactive 
with the falling edge of TcLK-phase 1. However, the chip 
enable for the SCC should stay valid for the entire read or 



write cycle. The PAL device generates the chip enable signal 
for the SCC using a structure equivalent to two cross-coupled 
NOR gates. 

The Am8 530 requires a minimum pulse width of 250 ns for RD 
and WR control signals. For a 100-ns cycle time 80286 with no 
wait state, these controls will be active for 200 ns only SRDY 
control is generat ed fo r the 82284 bus controller to guarantee 
minimum RD and WR pulse widths for the Am8530. 

CMDLY is not used in this design because the address to read 
or write control setup lime required is only 80 ns for the 
Am8530; with a 100 ns system cycle time, the time between 
ALE valid and /lORC or /IOWC valid is 100 ns. Even after 
accounting for the address latch delay, 80 ns timing will still 
be met. 
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" THIS PLPL FILE IS FOR A 16R4 THAT IMPLEMENTS THE LOGIC 

NECESSARY TO INTERFACE AN Am8530 (SCC) TO AN 80286 SYSTEM. " 



" COUNT DOWN SPECIFICATION FOR THE COUNTER. " 

" Q2 OF THE COUNTER IS THE CLOCK INPUT OF THE Am8530. " 



DEVICE Am8530 to 80286 (PAL16R4) 



PIN 



CLK 


= 1 


VCC 


= 20 


/CS 


= 2 


/CE 


= 19 


ALE 


= 3 


/INTSO 


= 18 


/lORC 


= 4 


/Q[1] 


= 17 


/I owe 


= 5 


/Q[2] 


= 16 


NCI 


= 6 


/Q[3] 


= 15 


NC2 


= 7 


NC5 


= 14 


NC3 


= 8 


/SRDY 


= 13 


NC4 


= 9 


NC6 


= 12 


GND 


= 10 


NC7 


= 11 



CASE ( Q[3:1] ) 
BEGIN 

) Q[3 

1 ) Q[3 

2 ) Q[3 

3 ) Q[3 

4 ) Q[3 

5 ) Q[3 

6 ) Q[3 

7 ) Q[3 
END ; 



" SRDY IS GENERATED FOR WAIT STATE GENERATION, IT IS SENT 
TO THE 82284 CLOCK GENERATOR. " 





= 7 ; 




= ; 




= 1 ; 




= 2 ; 




= 3 ; 




= ^ ; 




= 5 ; 




= 6 ; 



BEGIN 

" CHIP ENABLE FOR THE AiTi8530 IS DERIVED FROM ALE AND 
THE EXTERNAL DECODER CHIP SELECT OUTPUT. " 



SRDY = /(Q[3] * CE) 



END. 



CO 

o 

10 

00 
0) 

r* 

o 
> 
3 

00 
Ul 
(0 

o 



CE = CS * ALE + INTSO ; 

INTSO = /( lORC * lOWC) + CE ; 

" SYNCHRONIZE THE COUNTER WITH ALE. " 

IF ( CS * ALE ) THEN Q[3:1] = 7 ; 



Figure 2. Source Listing for AmPAL16R4 for the Example of Figure 1 



Interfacing to the 68000/68020 



The 68000 has an asynchronous, 16-bit, bidirectional data 
bus. Data types supported by the 68000 are: bit data, integer 
data of 8, 16, or 32 bits, 32-bit addresses and binary-coded 
decimal data. It can tr ansfer and accept data in either words 
or bytes. The DTACK input indicates the co mpletion of a data 
transfer. When the processor recognizes DTACK during a 
read c ycle, the data is latched and the bus cycle terminates. 
When DTACK is recognized during a write c ycle, the bus 
cycle also terminates. An active transition of DTACK indi- 
cates the termination of a data transfer on the bus. All control 
and data lines are sampled during the 68000's clocl< HIGH 
time. The clock is internally buffered, which results in some 
slight differences in the sampling and recognition of various 
signals. The 68000 mask sets prior to CC1 and allows 
DTAC K to b e r ecognize d as early as S2, and all devices 
allow BERR or DTACK to be recognized in S4, S6, etc., 
which term inates th e cycle. If the required setup time is met 
during S4, DTACK will be recogni zed dur ing S5 and S6, and 
data will be captured during S6. DTACK signal is internally 
synchronized to allow for valid operation in an asynchronous 
system. If an asynchronous control signal does not meet the 
required setup time, it is possible that it may not be 
recognized during that c ycle. Bec ause of this, synchronous 
systems must not allow DTACK to precede data by more 
than 40 to 240 nanoseconds, depending on the speed of the 
particular processor. I/O is memory-mapped, i.e., there are 
no special I/O control signals, any peripheral is treated as a 
memory location. 



DMA: This Bus is requested by activating the BR input of the 
68000. Bus Arbitration is started by the BG output going active. 
The Bus is available when AS becomes inactive. The request- 
ing devic e must acknowledge bus mastership by activating the 
BGACK input to the CPU. 

The 23-bit address (Ai ...A23) is on a unidirectional, three-state 
bus, and can address 8 M words (16 M bytes) of memory or 
I/O. It provides the address for bus operation during all cycles, 
except the interrupt cycles. During interrupt cycles, address 
lines AI , A2 and A3 provide information about the level o f 
interrupt being serviced. Instead of Aq and BYTE/WORD, 
there are two separate data strobe lines for the two bytes in a 
word. A note of caution here, the 68000 treats the MSB of the 
lower byte as an even byte, or word address. The same goes 
with processors such as the Z8000. Processors such as the 
8086 treats the lower byte as the odd byte. 

Interrupt is requested by activating any combination of the in- 
terrupt inputs to the 68000 (IPL0...2), indicating the encoded 
priority level of the interrupt requester (inputs at or below the 
current processor priority are ignored). The 68000 automati- 
cally saves the status register, switches to supervisor mode, 
fetches a vector number from the interrupting device, and dis- 
plays the interrupt level on the address bus. For interfacing 
with old 68000 peripherals, the 68000 issues an Enable signal 
at one-tenth of the processor clock frequency. There are a 
number of AMD proprietary third generation peripherals that 
can be interfaced to the 68000 CPU, to improve system perfor- 
mance. This section deals mainly with the interfacing of the 
68000 and some of the AMD proprietary peripherals. 
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The 68000 to Am8530 Interface 
with Interrupts 



This example shows how a PAL device simplifies the task of 
interrupt generation compared to the MSI implementation. 
The block diagram for the interface via a PAL device is shown 
in Figure 1 . The timing diagram (Figure 2) illustrates the Inter- 
rupt Acknowledge cycle. As In the other designs, RD is 
generated during Interrupt Acknowledge to place the vector 
on the bus. 



The timing during register programming is not shown. The PAL 
device allows selection of one or two Wait States by making Wq 
HIGH or LOW, respectively The table below shows the appro- 
priate number of Wait States as a function of CPU speed. 



Part 


CPU Speed 


Wait States 


85XX 
85XX 
85XXA 
85XXA 


4 MHz 

6 MHz 

8 MHz 

10 MHz 


1 
2 
2 
2 



PAL device equations are shown in Figure 3. 




Figure 1. 68000 to Am8530 Connection Using a PAL Device 



2-150 



^ 



Monolithic 



m 



Memories 



I\ 



The 68000 to Am8530 Interface with Interrupts 
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PAL Device 
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Figure 2. Timing Diagram for 68000 to Am8530 Interface 
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The 68000 to Am8530 Interface with Interrupts 



TITLE 68000 TO 8500 OR 9500 PERIPHERALS 

PATTERN PAT002 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP PERI_INTFC PAL16R4 

CLOCK /CS RW /LDS /WO /AS FCO FCl FC2 GND 
/OE /INTA /ACK /C /B /A /DLDS /RD /WR VCC 



EQUATIONS 


A 


; = 


A*/B 




+ 


B *C 




+ 


/AS 


B 


: = 


A*/C 




+ 


/A*C 




+ 


/AS 


C 


: = 


/A*/B*AS 




+ 


B*C*AS 


DLDS 


:= LDS 



RD = LDS*DLDS*RW*/INTA 
+ A*C*INTA*AS 
+ A*/B*INTA*AS 

WR = LDS * /RW 

INTA = FC0*FC1*FC2*AS 

ACK = /INTA*/A*/B*/C*/W0 
+ /INTA*/A*/B*C*WO 
+ INTA*/B*A 
+ ACK * LDS 



DESCRIPTION 

THIS PAL DEVICE INTERFACES 85XX TYPE PERIPHERALS TO THE 68000 
MICRO PROCESSOR. IT INSERTS 1 OR 2 WAIT STATES AS SELECTED BY 
/W0=0 IS ONE AND /W0=1 IS TWO WAIT STATES. FOUR WAIT STATES ARE 
INSERTED DURING INTERRUPT ACKNOWLEDGE CYCLES. ALSO THE RD OUTPUT 
GENERATED DURING INTA IS A FUNCTION OF THE INTERNAL STATE 
MACHINE AND NOT A FUNCTION OF LDS. OE CAN BE LEFT OPEN SINCE 
THE FLIP FLOP OUTPUTS ARE NOT USED DIRECTLY. THE FALLING EDGE 
OF RD IS DELAYED IN ORDER TO GUARANTEE THE CS TO RD SETUP TIME 
REQUIREMENTS. 

SIMULATION NOT INCLUDED 



Figure 3. Source Listing for the Example of Figure 1 
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68000 and Am7990 LANCE Interface 



Tha design on the LANCE has made it easier for the user to faster 68000 (Figure 1). The two flip-flops are needed to adapt 
interface the device with demultiplexed buses. The example the LANCE bus request handshake to the 68000. 
shown here is an interface to be compatible with an 8 MHz or 
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Figure 1. Am7990 to 68000 Interface 
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68000 and Am7990 LANCE Interface 



Autovectoring is used since the Am7990 does not return a 
vecto r during interrupt acknowledge cycles. T he BY TE and 
DAS signals of LANCE are used to gener ate th e U PS a nd LPS 
when LANCE is i n Bus Master mode; the UPS and LPS is used 
to generate the PAS when LANCE is in Bus Slave mode. It 



takes two latches to demultiplex the LANCE address/data 
lines to adapt to the 68000 address bus. The flip-flops can be 
replaced by a PAL22V10 to minimize parts count. Equations 
for the PALI 6L8 are shown in Figure 2. 



TITLE 


68000 TO LANCE INTERFACE 


PATTERN 


PAT002 


REVISION 


01 


AUTHOR 


JOE BRCICH 


COMPANY 


ADVANCED MICRO DEVICES 


DATE 


11/06/87 



CHIP LANCE_IN PAL16L8 

/AS RW BYTE /HOLD NC /EG AO NC /BGACK GND 
/CS /TB /UDS /DAS /CLRl BR /CLR2 /LDS /RB VCC 

EQUATIONS 

RB = CS*RW*UDS 
+ CS*RW*LDS 
RB.TRST = /BGACK 

TB - CS*/RW 
TB.TRST = /BGACK 

UDS = DAS*/AO*BYTE 

+ BYTE*DAS 
UDS.TRST = BGACK 

LDS = DAS*AO*BYTE 

+ /BYTE*DAS 
LDS.TRST = BGACK 

DAS = UDS*LDS 
DAS.TRST = /BGACK 

CLRl = /AS*BG 



CLR2 = BGACK 
/BR = /HOLD 



; DELAY 



DESCRIPTION 

THE GOAL OF THIS INTERFACE WAS TO BE COMPATIBLE WITH 8 MHZ 
AND FASTER 68000 'S WHILE MINIMIZING PARTS COUNT. THE 
Ani22V10 COULD BE USED TO ELIMINATE THE TWO FLIP-FLOPS 
SHOWN. AUTOVECTORFING IS USED SINCE THE 7990 DOES NOT 
RETURN A VECTOR DURING INTERRUPT ACKNOWLEDGE CYCLES. 
NOTE PROGRAM BSWP, BCON TO 1, AND ACON TO IN CSR3 REG. 



SIMULATION NOT INCLUDED 



Figure 2. Source Listing for the Example of Figure 1 
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68000 to AmZ8068 Data Ciphering 
Processor Interface 



Figures 1 and 2 show the 68000-DCP interface and the inter- 
face timing. This interface provides a two-chip solution to add 
high-speed data ciphering to a 68000-based system. About 
500 kbyte/sec are possible in a CPU-controlled transfer. The 
ciphering rate can be increased with a sophisticated DMA 
controller, or with several OCRs operating in parallel. The 
CPU operates at 8 MHz and the DCP operates synchro- 
nously at 4 MHz. The Interface Controller, a PAL device, gen- 
erates the Address and Data Strobes for the DCP and the 
Data Acknowledge for the CPU. It also divides the CPU clock 
by two and synchronizes it to the Data Strobes. 

The main features of this interface are: 

— Multiplexed Control Mode 

— Demultiplexed address and data bus 

— Two-Cycle Operation 



— Clock Synchronization with two Low Cycles after the Data 
Strobes 

— About 500 kbyte/sec ciphering speed 

Data transfers between the CPU and the DCP are accom- 
plished by a two-cycle operation. First the address of an inter- 
nal register is latched in, then the data is transferred. This 
causes a small overhead in the initialization phase, but im- 
proves the ciphering rate in a high-speed data ciphering ses- 
sion. The rate of 500 kbyte/sec can be reached only if a 
high-speed peripheral device is connected to the Slave Port 
and the DCP is programmed for dual port configuration. 

The PAL device is programmed to allow only DCP transfers to 
the DCP. The PAL device equations are shown in Figure 5. 
Ao must be odd to make the CPU transfer the data on the Low 
byte of the data bus. A "0" on A, indicates an Address Latch 
Cycle, whereas a "1 " on A, indicates a Data Transfer Cycle! Aq 
must be "1" in both cycles. 
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Figure 1. AmZ8068 to 68000 Interface 
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68000 to AmZ8068 Data Ciphering Processor interface 



A1-A23 



LDS, MAS 
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DTACK1, 2 
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Figure 2. 68000-AmZ8068 Address Latch Cycle (A-| = Low) 
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An address decoder generates the Chip Select for the DCP. 
The Address Strobe indicates a valid address. The PAL device 
is only activated if the Lower Data Strobe becomes active while 
the Upper Data Strobe stays inactive. This means that data is 
transferred in MOVE.B instructions with an odd peripheral 
address. 

The PAL device provides two Data Acknowledge outputs. 
DTACKi i s an activ e Low TTL output. DTACK2 hasthe same 
timing as DTACKi , but is an Open Collector output. (The 
Open Collector output is realized by a three-state output which 
has only two states, Low or Floating.) 

Address Latch Cycle 

In this cycle only a Master Port A ddres s Strobe (MA S) is 
generated. Master Port Chip Select (MCS) is tied to Low. LDS 
is sent to the MAS output. The minimum pulse width of LDS is 
1 15 ns; 80 ns are required for the AmZ8068. 



DTACK is activated with the falling edge of t he CPU c lock after 
cycle S2. The CPU inserts no Wait states. DTACK is deac- 
tivated with the first edge of CLK after AS becomes inactive. 

Data Read Cycle: (Figure 3) 

The generation of MDS in a Data Read Cycle is similar to the 
Data Write Cycle. Because the CPU activates LDS one cycle 
earlier, t here is no need for a Wait State. The minimum pulse 
width of LDS is 240 ns; the DCP requires 200 ns for a Status 



Register read. DTACK is activated using the same logical con- 
dition a s in th e Data Write cycle. Because of the earlier activa- 
tion of LDS, DTACK becomes active earlier and the CPU 
inserts no Wait states. 

Data Write Cycle: (Figure 4) 

A Data Write Cycle is performed with Aq is HIGH, AS, CS and 
LDS are LOW. The minimum pulse width of LDS is not suffi- 
cient for the DCP which requires at least 125 ns. One Wait 
state or a slower system clock will satisfy this par ameter. In this 
interface, one Wait state is inserted by activating DTACK at the 
end of S4. 

The DCP clock is synchronized in Data Read or Write Cycles 
by forcing it Low when DTACK becomes active. This gu aran- 
t ees t hat the DCP clock has a falling edge just before LDS 
(MDS) rises. The delay of the DCP clock to CLK is typically 
8 ns for a PAL device. The delay of LDS to MDS is typically 
1 2 ns. The delay of LDS to the system clock is 0-70 ns for t he 
8 MHz version. This results in a delay of 4-74 ns of MDS to 
the DCP clock. The DCP requires 0-50 ns when operating at 
the maximum clock rate. 

This problem is solved by stretching the clock for one cycle. 
The DCP clock stays LOW for two cycles in the end of a 
transfer cycle. This is done automatically by the PAL device 
(see Figure 4). 
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68000 to AmZ8068 Data Ciphering Processor Interface 
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Figure 3. 68000-AmZ8068 Data Read Cycle (A-, = HIGH) 
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Figure 4. 68000-AmZ8068 Data Write Cycle (Ai = HIGH) 
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68000 to AmZ8068 Data Ciphering Processor Interface 



TITLE 68000 - AmZ8068 (DCP) INTERFACE DEVICE 

PATTERN DCP044 

REVISION 01 

AUTHOR JUERGEN STELBRINK 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP AmZ_INTFCE PAL16R4 

CLK2 /CS /AS /UDS /LDS RW Al CLK NC GND 
/OE /MAS /MDS DCPCLK NC NC /DTACKl CLKl /DTACK2 VCC 



EQUATIONS 
/CLKl = CLK 



; INVERT CLOCK TO TRIGGER THE REGISTERED 
; OUTPUTS WITH THE FALLING EDGE OF CLK 



MAS = AS*LDS*/UDS*/RW*/A1*CS 

MDS = AS*LDS*/UDS*A1*CS 

/DCPCLK := DCPCLK 

+ /DTACK1*CS*AS*LDS*/UDS 
+ DTACK1*/AS*/LDS*/UDS 

DTACKl := AS*LDS*/UDS*A1*CS 
+ AS*/RW*/A1*CS 

DTACK2 = DTACKl 

DTACK2.TRST = DTACK1*AS*CS 

; SIMULATION NOT INCLUDED 



; DIVIDE BY TWO 

; TWO CLOCKS LOW IN 

; THE END OF A DATA CYCLE 

; DATA TRANSFER CYCLE 
; ADDRESS LATCH CYCLE 



Figure 5. Source Listing for Exampie of Figure 1 
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68000 and Dual Am951 6 DMA 
Controllers Interface 



There has been interest shown in connecting two Am9516 
DMA Controllers to obtain four channels. The example here 
shows that such a system can be built by incorporating one 
PAL device, AMD's 22V10 (Figure 1). Address and data 
buses are not shown as they are straightfonward and require 
no explanation. The PAL device, designated 68K16D2, con- 
verts the two DREQs into the 68000 three-wire handshal<e, 
prioritizes the request, and converts the control signals 



appropriately. Equations for the PAL device are shown in 
Figure 2. 

The key parameters are: 1) data h old with respect to the rising 
edge of DS during a write, and 2) DTACK setup time. Control 
for a data bus transceiver is shown because it will be required 
in most systems. The PAL device provides these signals when 
the CPU is bus master; the Am9516 generates these control 
signals directly when it is bus master. 
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Figure 1. Dual Am9516 UDCs to 68000 CPU Interface 
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68000 and Dual Am951 6 DMA Controllers interface 



The Am9516s are shown with independent clocks. The cloclo 
may be divided from the CPU clock or may be generated inde- 
pendently of th e CPU. B ecause WAIT must meet the set-up 
and hold times, DTACK may need synchronization by the use 
of a flip-flop, as shown. This flip-flop may be eliminated in 
synchronous systems. 



The clock Is inverted to the PAL device to meet DTACK set-up 
time in systems of 10 MHz or faster This inverter may be 
deleted In slower systems, if appropriate changes to the PAL 
device equations are made. This PAL device implements fixed 
priority between the Am9516s. BREQi is the highest priority 
Rotating priority can be implemented by adding another PAL 
device. 

EOPs are pulled up separately but could be tied together 
since they affect a channel only if it is active. The bus-error 
function can be supported by connecting BERR and EOF. 



If a bus error occurs, EOP will stop the current transfer and 
interrupt the CPU. Th e Inte rrupt Service routine can read the 
status to determine I f EOP caused the Interrupt or if termina- 
tion was normal. If EOP caused the interrupt, the Address 
Register can be read to determine where the bus error oc- 
curred. After the problem is corrected, the CPU can program 
the Am9516 to complete the transfer or do an alternate trans- 
fer, as appropriate. 

When operating the D MA in int erleave mode, an external EOP 
should be gated with DACK to prevent affecting the wrong 
channel. This is unnecessary if interleave is not used, since the 
UDC releases the bus. 

This arbiter'design supports both serial and parallel-expansion 
techniques and is therefore compatible with VME bus protocol. 
Bus grant out was implemented with an external gate due to a 
shortage of pins. The VME/BCLR function was not imple- 
mented because the Am9516 does not support pre-emption. 
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68000 and Dual Am9516 DMA Controllers Interface 



TITLE 68000 TO DUAL 9516 INTERFACE 

PATTERN PATOOl 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP DUAL 9516 PAL22V10 



CLK RW AU BREQl BREQ2 
/CS /LDS /UDS /DS /C 



/EG NC ALEl /BW ALE2 /BGACK GND 
/AS /BR BACK2 BACKl /TB /RB VCC 



EQUATIONS 

BR = BREQl */BGACK 

+ BREQl *BG 

+ BREQ2*/BGACK 

+ BREQ2*BG 

BACKl = BREQ1*BG*/AS 
+ /BG* BGACK 

BACK2 = BREQ2*/BREQ1*BG*/AS 
+ /BG*BGACK 

RB = CS*RW*UDS 
+ CS*RW*LDS 
RB.TRST = /BGACK 

TB = CS*/RW 
TB.TRST = /BGACK 

DS = AS*/C*/RW 

+ AS*RW 
DS.TRST = /BGACK*AS 

AS = ALEl 
+ ALE2 
AS.TRST = BGACK 

UDS = DS*/AO*/BW 

+ BW*DS 
UDS.TRST = BGACK 

LDS = DS*AO*/BW 

+ BW*DS 
LDS.TRST = BGACK 

C := UDS*BGACK 
+ LDS*BGACK 
C.TRST = AS 



Figure 2. Source Listing for thie Example of Figure 1 
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Am8530 to 68020 Interface 



This design shows the logic to interface a 6 IVIHz Am8530 
Serial Communications Controller to the 68020 CPU (Fig- 
ure 1) running at 10 MHz with a system clock cycle time of 
100 ns. The Am8530 is a high-performance, dual-channel 
sec that supports data rates up to 1 .5M bps and a variety of 
communication protocols. 

The PAL device generates the /RD and /WR control signals for 
the Am8530 from the 68020 /DS and R/W- control signals. It 
also generates the clock for the SCC by dividing the 68020 
system clock. This meets the 165-ns minimum cycle time for 
the 6-MHz SCC clock.. 



Also, a state machine is implemented to perform the hand- 
shake necessary for byte transfers on the 68020 bus. The 
state machine transition diagram is shown in Figure 2. Nor- 
mally, DSACK[1 :0] is inactive. When this device is selected, a 
wait state is inserted and then data transfer is acknowledged 
by asserting DSACK[1 :0] for byte transfer. When the CPU 
deasserts the address strobe, DSACK[1:0] is negated. 
DSACK[1 :0] will be driven by other devices on the 68020 
system bus. The PAL device enables DSACK[1 :0] only when 
the CPU performs a read or write cycle for the SCC; at other 
times DSACK[1 :0] lines are three-stated. 

Since the Am8530 is a byte-wide peripheral, it is connected to 
the upper byte of the 32-bit wide 68020 data bus. 
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8530CLK 


c 




> 


VJ 








\/ 


K 


I/O 
DECODER 


■^ r 


y 


1/ 


U ' 






















L->. 


/cs 

/AS 
/DS 
RW 
PAL16R4 
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. 
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^ . ... 




► 








1 


i 



Figure 1. Am6530 to 68020 Interface 
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/cs 




DSACK[1:0]=11 



DSACK[1:0]=10 



Figure 2. 68020 Byte Port DSACK Handshake State Diagram 
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" THIS PLPL FILE IS FOR A 16R4 THAT IMPLEMENTS THE LOGIC 
NECESSARY TO INTERFACE AN Am8530 (SCO TO A 68020 SYSTEM. " 

DEVICE A[ii8530_TO_68020 (PAL16R4) 



THE FOLLOWING CODE IMPLEMENTS THE STATE MACHINE TO 
PERFORM DSACK OPERATION. IT INSERTS WAIT STATES, 
ASSERTS DSACK FOR BYTE TRANSFER AND REMOVES DSACK 
WHEN AS IS NEGATED. " 



CLK 


= 


1 


VCC 


20 


/AS 


= 


2 


DSACKC1] ■■ 


= 19 


/DS 


= 


3 


DSACK [0] ■■ 


= 18 


RW 


= 


4 


ST[1] = 


17 


/CS 


= 


5 


ST[0] = 


16 


NCI 


= 


6 


Q 


15 


NC2 


= 


7 


NC5 


14 


NC3 


= 


8 


WR 


13 


NC4 


= 


9 


RD 


12 


GND 


= 


10 


NC6 


11 ; 



READ AND WRITE CONTROL SIGNALS ARE DERIVED FROM 
R/W- AND DS-. " 



WR = /(DS • /RW) ; 
RD = /(DS * RW) ; 



THE INCOMING CLOCK IS DIVIDED BY 2 TO GENERATE THE 
CLOCK FOR THE Am8530. " 



IF ( /CS ) ENABLE ( DSACK t1:0] ) 

IF ( ST[1:0] = #B10 ) 

THEN DSACK [1:0] = #B10; 
ELSE DSACK [1:0] = #811; 

CASE ( ST [1:0] ) 
BEGIN 
#B00) BEGIN 

IF ( /CS ) 

THEN ST[1:0] := #B00 
ELSE ST [1:0] := #B01; 
END ; 
#B01) ST [1:0] := #810; 
#810) BEGIN 

IF ( /AS ) 

THEN ST [1:0] := #800 
ELSE ST [1:0] := #810; 
END ; 
END ; 
END. 



CASE ( Q ) 

BEGIN 

0) Q := 1; 

1) Q := 0; 

END; 



Figure 3. PLPL Specification for the 8530 to a 68020 Interface 
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Interfacing to the 8088 



Overview 

The 8088 CPU is an 8-bit processor designed around the 8086 
internal structure. Most functions of the 8088 are identical to 
the equivalent 8086. The 8088 fetches and writes 16-bit words 
in two consecutive bus cycles. Both the 8086 and the 8088 
handle the external bus the same way but the 8088 handles 
only 8-bits at a time; and both appear identical to the software 
engineer, with the exception of execution time. 

The hardware interface of the 8088 contains the major differ- 
ences between the two CPUs. The pin assignments are nearly 
identical, however, with the following functional changes: 

As-Ais These pins are only address outputs on the 

8088. They are latched internally and remain 
valid throughout a bus cycle in a manner 
similar to the 8085 upper address lines. 

SSO Provides the SO status information in the mini- 

mum mode. This output occurs on pin 34 in 
minimum mode only. 

DT/R , lO/M , and SSO provide the complete bus status in 
minimum mode. 

lO/M has been inverted to be compatible with the 

MCS-85 bus structure. 

ALE is delayed by one clock cycle in the minimum mode when 
entering HALT, to allow the status to be latched with ALE. 



The 8088 and AMD Proprietary 
Peripherals 

The evolution of chip design has taken the 8-bit environment 
into the 16-bit environment. While the new generation of pe- 
ripheral devices are often 16 bits wide, the older, established 
8-bit orientation of CPUs and peripherals are still significant. 
Interfacing a 16-bit peripheral with an 8-bit CPU often encoun- 
ters data path incompatibility and involves bus control 
manipulation. This type of integration mainly involves separat- 
ing the control and data paths from the new peripheral and the 
system. 

The ability to mix different data path widths can improve sys- 
tem functionality, performance, and cost. It is less expensive 
to use an 8-bit bus in a new design because the memory 
requirements are generally cheaper. A designer can use this 
data path mixing to upgrade the existing system until a new 
system design is warranted, or the designer can simply im- 
prove on the existing design as new peripherals become avail- 
able. 
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Figure 1 shows the Data Bus and Control Interface between 
an 8088 microprocessor and an Am9516. This interface is 
accomplished by using an AmPAL22V10, a 74LS161 
counter, an Am2952 bidirectional I/O port, and an Am2949 
bidirectional transceiver. Figures 2 and 3 show the PAL 
device timing for both the Bus Master and the Bus Slave 
cases. 

In this design, certain simplifying constraints have been made. 
Word operations are only allowed during command chaining 
operations when the Am9516 is Bus Master. During Slave 
Write operations, the first byte output to the Am9516 must have 
an odd address, and the following second byte an even ad- 
dress. Conversely, during a Slave Read cycle, the first byte 
read from the Am9515 must be at an even address and the 
second at the next higher odd address. Furthermore, for both 



slave and master operations, the system must use the 
latched Aq (LAq) from the PAL22V10 as its sole Aq. That is, 
this lJ\o must circumvent any address latching to memory 
and so is to be used directly 

As can be seen from the figures, the AmPAL22V10 manip- 
ulates the control signals to the transceivers and latches so 
that bytes can be tunneled into words during Am9516 Com- 
mand Chaining operations and Slave Writes, and words can be 
tunneled into bytes during Am9516 Slave Reads. Also, LAO is 
alternately toggled in order to provide the dual address (16- 
bits) during chaining operations. 

Figure 4 shows the PLPL equations and resulting sum-of- 
products equations for the design (PLPL is a higher level 
"C-like" language for PAL devices). For those not familiar 
with "C" or Pascal, the intermediate sum-of-products form 
shows the equations in Boolean form. 
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Figure 1. The Am9516 UDC to 8088 CPU Data Bus and Control Interface 
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Figure 2. The Ahi9516 UDC to 8088 CPU Bus Slave Timing 
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Figure 3. The Am9516 UDC to 8088 CPU Bus Master Timing 
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DEVICE _8088_9516 (pal22V10) 

"This is a PLPL file to implement an interface between the 8-bit 
8088 and the 16-bit Am9516. This design assumes that the Am9516 
is programmed for byte operations and so only accesses 16-bit 
words during chaining. -James Williamson" 

PIN 



CK 


= 


1 


S[0:2] 


= 


2: 


AO 


= 


5 


/SEL 


= 


6 


ALE 


= 


7 


HLDA 


= 


8 


/bw 


= 


9 


READY 


= 


10 


RESET 


= 


11 



/RD 


23 


/WR 


22 


LAO 


21 


/DS 


20 


/RW 


19 


/WAIT = 


18 


/A 


17 


/B 


16 


/c 


15 


/D 


14 



BEGIN 

IF (RESET) THEN ARESETO; 

II ^rs = ^ — := = = := — — — = — — = — — — — — — — ^ — — — — — — — — — — — — — — — — '* 

"This section defines the wiggles when the Am9516 is Bus Master" 

IF (HLDA) THEN ENABLE(); 
IF (/S[2] * HLDA) THEN BEGIN 
IF (/S[l] * /SCO]) THEN 

LAO = /CK * BW + /BW * AO * ALE + 
/BW * LAO * /ALE 



ELSE 



LAO = BW + /BW * AO * ALE + 
/BW * LAO * /ALE 



END; 












IF (HLDA) 


THEN 










CASE 


(SC2:0]) 

BEGIN 

1) BEGIN 












RD 


= 


/RW 


* 


DS 




A 


= 


/BW 


* 


/RW * /CK 




WR 


= 


/BW 


* 


RW * DS 




C 


= 


/BW 


* 


RW 




WAIT 


= 


1 








END; 












2) BEGIN 












RD 


= 


/RW 


* 


DS 




B 


= 


BW 








A 


= 


/BW 


* 


/RW 




WR 


= 


/BW 


* 


RW * DS 




C 


= 


/BW 


* 


RW 




WAIT 


= 


BW 







END; 



Figure 4. PLPL Specification for 8088 to Am9516 interface 
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3) BEGIN 
RD 
B 
A 

WR 
C 



END; 



WAIT = BW 



= /RW * DS * B 

= BW * CK 

= /BW * RD 

= /BW * RW * DS 

= /BW * RW 



5) BEGIN 

RD = /RW * DS 
A = /BW * /CK 
WAIT = BW 

END; 



6) BEGIN 

RD 
A 
END; 

7) BEGIN 

RD 
A 
END; 



/RW 
BW 



/RW 
/RD 



DS 



DS 



END; 



"This section defines the wiggles when the 8088 is Bus Master' 



BEGIN 



END; 



LAO = AO * ALE * SEL + 
LAO * /ALE * SEL 
B = LAO * WR * SEL 
A = /LAO * WR * SEL 
DS = A + 

/LAO * RD * SEL 
C = /LAO * RD * SEL 
D = LAO * RD * SEL 



END. 



Figure 4. PLPL Specification for 8088 to Am951 6 Interface (Continued) 
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The addition of tiie Am9516 to an 80186 design is a natural 
choice in systems requiring four channels of DMA. Figure 1 
shows the interface between the 80186 and the Am9516 with 
a PAL device. PCS5 is programmed to provide a latched Ai 
signal. 



This interface accomplis hes t wo major control transformations. 
First, it converts RD and WR into RA/V and DS when the 80186 
is Bus Master, and vice versa when the A m9516is Bus Master. 
Secondly, the transceiver control signals, TBEN and RBEN, 
are generated from DEN and DT/R. This example shows only 
one possible configuration. Other configurations can be made 
as dictated by system requirements. A PAL device is used here 
to reduce board space. 



*, 



PCSs 
PCSo 
HOLD 
HLDA 



80186 



DT/R 
DER 
ALE 



RtJ 
WR 

S5 

CLKOUT 
ADis-AOo 



fl 




DT 

/DEN 

ALEP 



PAL16L8 



ADo 
/RD 



/TBEN 

/RBEN 

ALED 

/SEL 

Ao 

BW 

IDS 

/RW 



n 




BREQ 
BACK 



Am9516 

TIEN 
RBEN 
ALE 
CS 

B/W 

CS 

R/W 

M/RJ 

CLOCK 
AD15-AD0 



h 



If 



° Am25LS.373 ^° 



ARDY RB WR 



Ti 

M/RJ Ao 



ft 



T Am2949 

R TRANSCEIVER 



Figure 1. Am9516 to 80186 Interface 
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TITLE Am9516 TO 80186 INTERFACE 

PATTERN PATOOl 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP DMA_INT PAL16L8 

NC ALED ALEP HLDA BW ADO DT /DEN /SEL GND 
NC /RBEN /RD ALE AO /RW /DS /WR /THEN VCC 

EQUATIONS 

DS = RD + WR 
DS.TRST = /HLDA 

RW = DT 
RW.TRST = /HLDA 

TBEN = /DT * /SEL * DEN 
TBEN.TRST = /HLDA 

RBEN = DT * /SEL * DEN 
RBEN.TRST = /HLDA 

RD = /RW * DS 
RD.TRST = HLDA 

WR ^ RW * DS 
WR.TRST = HLDA 

ALE = /ALEP * /ALED 

AO = /ADO * /BW * HLDA * ALED 
+ ADO * BW * HLDA * ALED 
+ /ADO * /HLDA * ALEP 
+ AO * /ALEP 
+ AO * /ALED 

; DESCRIPTION 

; THIS PAL DEVICE CONVERTS THE CONTROL SIGNALS TO INTERFACE THE 80186 
; TO THE Ain9516 DMA CONTROLLER. 

; SIMULATION NOT INCLUDED 



Figure 2. Source Listing for Example of Figure 1 
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Introduction 

Commercial and industrial microprocessor-based systems typi- 
cally consist of a processor interfaced with many peripherals 
which randomly require service. To fully utilize the processor's 
computing potential, sources of hardware interrupts must be 
used to free the processor from software routines. The 68000 
16-bit microprocessor has 256 different exception-processing 
vectors which point to predetermined locations in the program 
memory space. There are 1 92 external user interrupt vectors and 
seven autovector interrupts. Table 1 shows the exception vector 
assignments. The interrupt structure of the 68000 will be dis- 
cussed in more detail along with two methods of designing an 



Vector 


Address 


Assignment 


Number(s) 


Dec 


Hex 


Space 








000 


SP 


Reset: Initial SSP 


- 


4 


004 


SP 


Reset: Initial PC 


2 


8 


008 


SD 


Bus Error 


3 


12 


OOC 


SD 


Address Error 


4 


16 


010 


SD 


Illegal Instruction 


5 


20 


014 


SD 


Zero Divide 


6 


24 


018 


SD 


CHK Instruction 


7 


28 


01C 


SD 


TRAPV Instruction 


8 


32 


020 


SD 


Privilege Violation 


9 


36 


024 


SD 


Trace 


10 


40 


028 


SD 


Line 1010 Emulator 


11 


44 


02C 


SD 


Line 1111 Emulator 


12* 


48 


030 


SD 


(Unassigned, reserved) 


13* 


52 


034 


SD 


(Unassigned, reserved) 


14* 


56 


038 


SD 


(Unassigned, reserved) 


15 


60 


03C 


SD 


Uninitialized Interrupt Vector 


16-23* 


64 


04C 


SD 


(Unassigned, reserved) 




95 


05 F 




- 


24 


96 


060 


SD 


Spurious Interrupt 


25 


100 


064 


SD 


Level 1 Interrupt Autovector 


26 


104 


068 


SD 


Level 2 Interrupt Autovector 


27 


108 


06C 


SD 


Level 3 Interrupt Autovector 


28 


112 


070 


SD 


Level 4 Interrupt Autovector 


29 


116 


074 


SD 


Level 5 Interrupt Autovector 


30 


120 


078 


SD 


Level 6 Interrupt Autovector 


31 


124 


07C 


SD 


Level 7 Interrupt Autovector 


32-47 


128 


080 


SD 


TRAP Instruction Vectors 




191 


OBF 




- 


48-63* 


192 


OCO 


SD 


(Unassigned, reserved) 




255 


OFF 




- 


64-255 


256 


100 


SD 


User Interrupt Vectors 




1023 


3FF 




- 



Vectornumbers12, 13 
future enhancements, 
numbers. 



,14, 16 through 23 and 48 through 63 are reserved for 
No user peripheral devicesshould be assigned these 



interrupt controller using PAL (Programmable Array Logic) 
devices. 

68000 Exception Processing and 
Pin Description 

The interrupt structure of the 68000 can grant up to 256 types of 
interrupt requests. These interrupts, orexceptions, are generated 
either by external or internal causes and are serviced by directing 
the flow of program to an exception processing routine. Table 1 
lists these exceptions and their respective address in memory. 
Exception vectors are locations in memory where the processor 
fetches the address of a routine or subprogram which will service 
that exception. The exception vector is either internally or exter- 
nally generated depending on the cause of the interrupt. The 
externally generated exceptio ns ar e inter rupt s whic h are placed 
on the interrupt control pins (IPL2, IPL1 and IPLO), bus e rrors , 
and reset requests. The interrupts placed on control pins IPL2- 
IPLOare requests from peripheral devices. The internally-gener- 
ated exceptions come from instructions, address errors, or 
tracing. These different types of exceptions and their priorities 
are shown in Table 2. We will focus on group 1 interrupt 
exceptions. 



Group 


Exception 


Processing 





Reset 

Bus Error 

Address Error 


Exception processing begins 
within two clock cycles. 


1 


Trace 

Interrupt 

Illegal Privilege 


Exception processing begins 
before the next instruction. 


2 


TRAP, TRAPV, 

CHK, 

Zero Divide 


Exception processing is started 
by normal instruction execution. 



Table 2. Exception Grouping and Priority 



The pinout of the 680 00 is shown in F igure 1. The three 
interrupt control pins (IPL2, IPL1, IPLO) are asynchronous 
active low inputs which indicate the encoded priority level of 





vi^M^)^ 


68000 
MICROPROCESSOR 


ADDRESS 


Na1-A23 
[Nd0-D15 




GND(2) 


BUS 




CLK ^ 






^ FCO 




< DATA BUS 
AS ^ 




R/W ^ 






Dds ^ 


ASYNCHRONOUS 


PROCESSOR 


^ FC1 


LDS ^ 


BUS CONTROL 


STATUS 


^ FC2 






^ DTACK 






,E 


, BR 


BUS 

ARBITRATION 

CONTROL 


M6800 
PERIPHERAL 


^VMA 


BG ^ 


VPA ^ 




CONTROL 


^BGACK 




BERR ^ 


* iPLO 


SYSTEM 


RESET 


ipu [interrupt 


CONTROL 


^HALT^ 


- FPE2 1 CONTROL 






^ 



Table 1. Exception Vector Assignment 



Figure 1. 
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the d evice requesting an interrupt. The l east significan t bit is 
IPLO and t he most significant bit is IPL2. Level seven, (IPL2, 
IPL1, IPLO) = (000), has the highest priority, while level zero, 
(111), indicates that no interrupts are requested. All lower or 
equal priority interrupts are masked during the interrupt 
service routine of the present interrupting device. There are 
two ways by which a peripheral' device can interrupt the 
normal flow of program: autovectoring or externa! vector 
number generation. The function code pins (FC2, FC1, and 
FCO) all become high during an interrupt acknowledge cycle. 
The interrupt acknowledge cycle always follows an interrupt 
request only after the present instruction cycle is completed. 
Thus interrupt acknowlege cycles come in between the 
instruction cycles and no information is lost. 

The signals which are used during an external interrupt 
request are listed below with a description of their behavior 
and function. 

ADDRESS BUS: the 24-bit address bus holds the address of the 
data to be accessed. During an interrupt acknowledge cycle the 
lower three bits (A3, A2, A1 ) hold the encoded level of the inter- 
rupt being serviced. If a level 5 interrupt is being serviced then 
(A3, A2, A1) will be (101) respectively. 

DATA BUS: The lower eight bits of this 16-bit data bus must 
contain the vector number during a user interrupt acknowledge 
cycle. If an autovector routine is in process then the data bus is 
ignored. 

AS: The processor asserts address strobe anytime there is valid 
data on the address bus. It remains asserted for as long as the 
address is valid. 

R/W: This signal defines the data bus transfer as a read or a write 
cycle. During an interrupt acknowledge cycle the processor is in 
a read mode. 

UDS, LDS: The upper and lower data strobes are asserted when 
the processor is in a read or write instruction cycle. Upper strobe 
enables the most significant byte of data while the lower strobe 

INTERRUPT ACKNOWLEDGE SEQUENCE 

FLOW CHART 

PROCESSOR INTERRUPTING DEVICE 

REQUEST INTERRUPT 



GRANT INTERRUPT 

1) COMPARE INTERRUPT LEVEL IN STATUS REGISTER 
AND WAIT FOR CURRENT INSTRUCTION TO COMPLETE 

2) PLACEJNTERRUPT LEVEL ON A1, A2, A3 

3) SET R/W TO READ 

4) SET FUNCTION CODE TO INTERRUPT ACKNOWLEDGE 

5) ASSERT ADDRESS STROBE (AS) 

6) ASSERT LOWER DATA STROBE (LDS) 

' 1 

PROVIDE VECTOR NUMBER 

1) PLACE VECTOR NUMBER ON D0-D7 

2) ASSERT DATA TRANSFER ACKNOWLEDGE (DTACK) 



A4-A233~( 



enables the least significant 



DTACK: Data transfer acknowledge is an externally-generated 
signal whic h tells th e processor that valid data is present on the 
data bus. If DTACK is not asserted before the falling edge of S4 
(S4 is the fourth CPU clock state in a seven-state instruction cycle) 
then wait states are introduced. 

IPL2-1PL0: The three interrupt control pins are inputs which 
contain the encoded priority level of the external interrupting 
device. Note that these are active low pins where (000) is level 
seven encoded. 

FC2-FC0: Function code output pins indicate the processor's 
status. When they are all high the processor is in an interrupt 
acknowledge cycle. 

E, VMA, VPA: Enable, valid memory address, an d val id peripheral 
address are the standard 6800 family signals. VPA is also used 
when autovectoring. 

The following two design examples use PAL units as a simple 
and cost-effective interrupt controller interface to the 68000. The 
first introduces external vector generation while the second 
shows how autovectoring can be done on the 68000. 

Prioritized Individually Vectored 
Interrupt (Method 1 ) 

As shown in the interrupt acknowledge sequence flow chart and 
timing diagram (Figure 2), a peripheral device must interrupt the 
processor by placing the encod ed level of priority on the 
interrupt control pins (IPL2-IPL0). The processor then grants 
the interrupt after the completion of the current instruction cycle. 
The encoded priority level of the interrupt is placed on address 
bus bits A3-A1 during the interrupt acknowledge cycle. The 
status code lines (FC2-FC0) become high, indic ating a n interrupt 
acknowledge cycle. Once the lower data strobe (LDS) is asserted 
the external device must place an 8-bit vector on the least signif- 
icant byte of the data bus. This data is latched in state S7. 

INTERRUPT ACKNOWLEDGE SEQUENCE 
TIMING DIAGRAM 



ACQUIRE VECTOR NUMBER 

1) LATCH VECTOR NUMBER 

2) NEGATE LDS 

3) NEGATE AS 

I 




D \r 



RELEASE 



FC0-23C 



1) NEGATE DTACK 
I 



START INTERRUPT PROCESSING 



LAST BUS CYCLE 
OF INSTRUCTION 
(READ OR WRITE) 



STACK 
PCL 
(SSP) 



lACK CYCLE 

(VECTOR NUMBER 

ACQUISITION) 



STACK AND 

VECTOR 

FETCH 



Figure 2. 



Monolithio 



m 



Memories 



^ 



2-173 



68000 Interrupt Controller 



As shown on Table 1 vectors 64-255 are assigned to the user 
interrupt vector numbers. These vectors must be generated 
externally and placed on the data bus during an interrupt 
acknowledge cycle. In our interrupt controller we arbitrarily 
choose vectors 249-255 as the vectors assigned to the inter- 
rupting peripheral devices. Table 3a shows this assignment 
and how the vector can be decoded from the address 
bits A1-A3. 



Priority 


Vector 






level 


assigned 


Data 


Address 


Device # 


Decimal # 


D7 D6 05 D4 D3 D2 D1 DO 


A3A2A1 


1 (low priority) 


249 


111110 1 


1 


2 


250 


111110 10 


1 


3 


251 


111110 11 


1 1 


4 


252 


1111110 


1 


5 


253 


1111110 1 


1 1 


6 


254 


11111110 


1 1 


7 (high priority) 


255 


11111111 


1 1 1 



Table 3a. 

The two PAL devices used on this design example generate the 
interrupt vectors and all the necessary control signals. The var- 
ious signals and their implementation on the PAL devices are 
explained below. 

INT7-INT0: Any of the seven peripheral devices can 
request an interrupt by asserting one of these inputs. The 
interrupt must remain asserted until acknowledged by the 
CPU. 

FC2-FC0 and AS: Function code or processor status code 
become logical high during an interrupt acknowledge cycle. 
Add ress stro be is asserted anytime valid address is on the 
bus. DTACK and Data output control are decoded from these 
four outputs of the 68000. 

A1-A3: The three least significant bits of the address bus 
contain the encoded level of the interrupting device. These 
signals are used in generating the vector number which is 
to be put on the data bus. 

RESET: Reset is an input which clears all outputs, and is 
used for system initialization. 

IPL2-IPL0: These PAL device outputs are active low synchro- 
nous signals which interface directly to the CPU. They represent 
the encoded level of the highest priority interrupt that is 
requested. Table 3b shows the truth table of our priority encoder 
implemented on a PAL device. 



Interrupt request 


input from peripheral 


Encoded int 


. level 






















INT7 


INT6 


INT5 


INT4 


INT3 


INT2 


INT1 


IPL2 


IPL1 


IPLO 





X 


X 


X 


X 


X 


X 
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Table 3b. 



DTACK: Data transfer acknowledge must be asserted by outside 
circuitry during a data transf er opera tion. The logic diagram 
shown below illustrates how DTACK is derived from address 
strobe and processor status signals. 



lO 



^ 



D7-D0: The three least significant bits of the data output can be 
decoded straight from the address bits A3, A2 and A1. That is 
D2= A3, D1 = A2 and D0= A1 . The other five bits of data can be held 
high with pull-up resistors. Outputs of the three data bits become 
enabled by using the same scheme which enables the DTACK 
output. 



INTACK7-INTACK1: Only one of these signals will be 
asserted during the interrupt acknowledge cycle. This signal 
feeds back into the interrupting device to tell that device 
that its interrupt has been acknowledged. We can use the 
3-bit addresses to decode these signals as shown in Table 3c. 
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Table 3c. 



The logic diagram of the controller is shown in Figure 3. The 
controller can operate without any wait states at the fastest CPU 
clock rate of 1 2.5 MHz as shown in the timing diagram of Figure 4. 



Auto-vectored Interrupts (Method 2) 

The seven autovector interrupts are assigned vector numbers 25 
through 31. Interrupts are requested by placi ng the enc oded 
level of the request on the interrupt control pins (IPL2-IPL0). The 
processor responds to this request by placing the requested level 
in the processor's status register and by inhibiting all requests of 
lower priority. When the current instruction cycle is completed 
an interr upt a cknowledge cycle takes place. If Valid Peripheral 
Address (VPA), is asserted before the falling edge of S4 then an 
autovector routine takes place. The data or the vector number is 
generated internally depending on the priority level of the inter- 
rupt request; the vector assigned is shown on the table at the 
beginning of this report. The autovector timing diagram and a 
very simple and practical interrupt controller implemented on a 
PAL device is shown in Figure 5. The PAL desi gn specifications 
are included in the appendix. Note that VPA is generated by 
enabling the PAL output when FC2-FC1 are high and AS is 
asserted. 
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Interrupt Controller Logic Diagram 
External Vector Generation 



AS> 



A3> 
A2> 

A1> 
FC2> 
FC1> 
FCO> 




Figure 3. 



so SI S2 S3 S4 S5 S6 S7 




WRITE MACHINE CYCLE 



INTERRUPT ACKNOWLEDGE CYCLE 



1 CLOCK PERIOD (80 ns) 

2 CLOCK LOW TO ADDRESS (55 ns) 

3 CLOCK HIGH TO AD^DRESS Hl-Z (60 ns) 

4 CLOCK HIGH TO AS LOW (55 ns) 

5 CLOCK LOW TO A£HIGH (50 ns) 

6 CLOCK HIGH TO_DS LOW (55 ns) 

7 CLOCK LOW TO DS HIGH (50 ns) 



8 CLOCK HIGH TO R/W HIGH (60 ns) 

9 CLOCK HIGH TO R/W LOW (60 ns) 

10 CLOCK HIGH TO FC VALID (55 ns) 

11 CLOCK HIGH TO DAT A Hl-Z (60 ns) 

12 AS HIGH TO DTACK HIGH (70 ns) 

13 PAL CLOCK TO OUT (25 ns) 

14 MINIMUM SETUP TIME (20 ns) 



15 DATA VALID TO DS L^W (15 ns) 

16 CLOCK HIGH TO AS, DS LOW (55 ns) 

17 PAL INPUT TO Hl-Z (25 ns) 

18 PAL INPUT TO Hl-Z (25 ns) 

19 PAL INPUT TO Hl-Z (25 ns) 



Figure 4. 
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AUTOVECTOR OPERATION TIMING DIAGRAM 
SO S2 S4 S6 SO S2 S4 w w w w w w w w w w S6 SO S2 

cLKjiJinjifiRjinjinjiTLTUTriJ^^ 



PAL16R4 




iNT7^ 
iNT6>- 

iNT5> 
rNT4> 
iNT3>- 
iNT2> 
iNTi>- 
AS> 




(TIE TO PULLUP 
RESISTOR) 



''*"NORMAL'*"r* AUTOVECTOR OPERATION" 



Figure 5. 



Conclusions 

We have just seen hov^^ to implement an interrupt controller 
circuit using one or two PAL devices. This circuit can operate at 
the maximum operating frequency of the 68000 which is 12.5 
MHz. Since the most critical timing of the circuit is the PM 
device unit's input to Hi-Z, we see that we can operate the circuit 
with a wide spectrum of frequencies and with slower PAL devi- 
ces. To guarantee operation, timing spec. No. 16 and No. 19 on 
Figure 4 must add up to assert DTACK 20 ns prior to falling edge 
of S4. Thus 55 ns + (PAL device input delay to Hi-Z) should be 
less than, or equal to, 100 ns for a 12.5-MHz clock. We see that 
this qualifies fast, regular, half-power and quarter-power PAL 
devices for this circuit. 
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TITLE INTERRUPT CONTROLLER! 

PATTERN INTA.DAT 

REVISION 01 

AUTHOR DANESH TAVANA 

COMPANY MONOLITHIC MEMORIES, INC. 

DATE 11/06/87 

CHIP INTA PAL16R4 

CLK /INT7 /INT6 /INT5 /INT4 /INT3 /INT2 /INTl /AS GND 
/OC FCO FCl NC /IPLO /IPLl /IPL2 /DTACK FC2 WCC 

EQUATIONS 

DTACK = VCC ; ASSERT IF OUTPUT ENABLE 

DTACK. TRST = FC2*FC1*FC0*AS 

IPL2 := INT? ; PRIORITY ENCODED BIT 

+ /INT7* INT6 
+ /INT7*/INT6* INT5 
+ /INT7*/INT6*/INT5* INT4 

IPLl := INT7 ; PRIORITY ENCODED BIT 

+ /INT7* INT6 

+ /INT7*/INT6*/INT5*/INT4* INT3 
+ /INT7*/INT6*/INT5*/INT4*/INT3* INT2 

IPLO := INT7 ; PRIORITY ENCODED BIT 

+ /INT7* INT6 

+ /INT7*/INT6* INT5*/INT4* INT3 
+ /INT7*/INT6*/INT5*/INT4*/INT3*/INT2* INTl 

; SIMULATION NOT INCLUDED 
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TITLE INTERRUPT C0NTR0LLER2 

PATTERN INTC.DAT 

REVISION 01 

AUTHOR DANESH TAVANA 

COMPANY MONOLITHIC MEMORIES, INC. 

DATE 11/06/87 

CHIP INTC PAL20L10 

/AS A3 A2 Al FC2 FCl FCO NC NC NC NC GND 
NC /INTACKl /INTACK2 /INTACK3 /INTACK4 /INTACK5 /INTACK6 /INTACK7 DO 
Dl D2 VCC 

EQUATIONS 

/D2 = /A3 

/D2.TRST = FC2*FC1*FC0*AS 

/Dl = /A2 

/Dl.TRST = FC2*FC1*FC0*AS 

/DO = /Al 

/DO.TRST = FC2*FC1*FC0*AS 

INTACK7 = A3* A2* Al 
INTACK7.TRST = FC2*FC1*FC0*AS 

INTACK6 = A3* A2*/A1 
INTACK6.TRST = FC2*FC1*FC0*AS 

INTACK5 = A3*/A2* Al 
INTACK5.TRST = FC2*FC1*FC0*AS 

INTACK4 = A3*/A2*/A1 
INTACK4.'rRST = FC2*FC1*FC0*AS 

INTACK3 =/A3* A2* Al 
INTACK3.TRST = FC2*FC1*FC0*AS 

INTACK2 =/A3* A2*/A1 
INTACK2.TRST = FC2*FC1*FC0*AS 

INTACKl =/A3*/A2* Al 

INTACKl. TRST = FC2*FC1*FC0*AS 

; SIMULATION NOT INCLUDED 
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Memories are an integral component of any computer system. 
PROMs (Programmable Read Only Memory), SRAMs (Static 
Random Access Memory), and DRAMs (Dynamic Random 
Access Memory) are used most often to store programs and data. 
PROMs are used to store programs which can only be read by the 
computer. The SRAMs and DRAMs are used for reading and 
writing data and programs. Figure 1 shows a system processor 
interfaced to PROMs, SRAMs and DRAMs. 

DRAMs are cheaper bulk memory, and are used instead of 
SRAMs in many systems. DRAMs are available in speeds 
ranging from 70 ns to 250 ns access times, enough to keep up 
with the new generation processor speeds. This performance is 
also sufficient to match the usually high speed SRAMs in many 
existing and newdesigns. DRAM densities have also progressed 
from 64K-bit in 1983 to iM-bit in 1986, and 4M-bit and 16M-bit 
DRAMs are being developed. DRAMs are also available in space 
efficient SIP, SIMM and ZIP (Zig-zag inline Package) packages 
with modules that can carry up to eight or nine DRAM devices. In 
this discussion we will focus on the design of PROM, SRAM and 
DRAM control circuitry using PLDs. 



Large memory requirements result in the use of many DRAMs 
which cannot be conveniently located on the same board with the 
CPU. DRAM devices are placed on separate boards and con- 
nected to the processor board via a bus. Off-board, DRAM 
devices mandate the use of specialized outputs to drive the 
address and control traces of the DRAM array. In systems with 
many off-board DRAMs the soft error rate is high and sometimes 
calls for the use of error detection and correction circuits. A 
detailed discussion of EDO (Error Detection and Correction) 
circuit design with PLDs is also included here. 

The proximity of the DRAMs to the processing units depends on 
the hierarchy of the DRAM memory in the system. This hierarchy 
depends heavily on the speed of the DRAM system versus the 
processor speed, and on the required addressing space. When 
the processor speed can be accommodated by the DRAM 
devices, it is efficient to place a DRAM subsystem as close to the 
processor as possible and to use it as the imnhediate storage 
space. When DRAMs are not fast enough for the processor, they 
must be used as remote, main-storage, while static-RAM-based 
caches are used in the processor's immediate proximity. A 
review of such cache systems design using PLDs is also included 
here. 
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Figure 1 . Processor interface to PROMs, SRAMs and DRAMs 



L^ MonolHMcMu} Memories £1 



2-179 



Memory Control 



PROM and SRAM Control 

The cx3ntrol of static RAM or PROM devices is very simple. Both 
of these devices provide a chip select input signal which activates 
the devices to read or write data. The chip select control of these 
devices is enabled by the decoded processor address signals. 
The address bits selected depend upon the range of addresses 
covered by each SRAM or PROM. More detail on address 
decoders can be found on page 2-35. 

To optimize the number of productterms used, the address range 
is generally selected so that the boundary addresses are power- 
of-two numbers. From page 2-53, for an address decoder we 
have the following equations: 

For an n-bit address X in the range from B+1 to A-1 (A>X>B) 

A= 0010000-00 (a power-of-two number) 

n p 1 

B= 0000011—11 (a power-of-two number minus one) 

n q 1 



Some memories so not provide a separate OE pin (Figure 2). 
Generating CS for these memories can cause serious bus 
contention problems. Often this problem is solved by gating the 
CS with read (R) or write (W) signals as follows: 



/CS 


= 


(X) 


* 


/R 




+ 


(X) 


* 


/w 


/WE 


= 


/w 







CS = /Xn * /Xn-1 * 

(Xp-1 + Xp-2 + - 



/Xp * 
- + Xq) 



This approach allows usually sufficient time for other buffers on 
the data bus to be disabled. This works well for the read cycles 
in which the memory is expected to be the only bus driver. 
However, for the write cycles where the processor is usually the 
data bus driver, this does not quite solve the problem completely. 

In the above equation, the CS is generated either from the read 
or the write signal. If the CS line is generated before the WE is 
asserted low, the memory will drive the data lines, which are 
already being driven by the processor, causing bus contention. 
There is still no way to ensure that CS is generated only afterthe 
WE signal to the memory is asserted. Some delay can be added 
by controlling the CS line from the WE signal, by gating the W 
signal with the WE signal (for CS) (Figure 3) as follows: 



Typical SRAMs also include a read/write input signal which 
selects between a read and a write operation. This signal is 
usually directly connected to the processor read/write signal. 
PROMs are "read only", and thus do not require this signal. 

PROM chip select signals should only be asserted for read 
cycles. Attempting a write to a PROM can cause bus contention 
problems where the memory and processor data signals attempt 
to drive the same bus lines simultaneously. This can be easily 
avoided by ensuring that the chip selects are not asserted for a 
processor write cycle. The PROM chip select signals are gated 
with the read signal (R) to insure a chip select only when R is low. 

/CS_PROM = (X) * /R ; (X is a 

combination of address signals) 

Most of the "second generation" SRAMs have output enable (OE) 
and write enable (WE) pins in addition to the chip select (CS) pins. 
A separate and independent OE line eliminates the data bus 
contention problem entirely. The OE line is disabled before 
writing to the SRAM, allowing the data pins to be driven by the 
external bus. The OE line is enabled only during the read cycles. 
Most microprocessors provide adatasignal which can be directly 
connected to the OE lines, providing the proper timing to avoid 
bus contention. 





CS = (X)*W*WE WE = W*CS- 

Figure 3. Waveform for SRAM Control Without OE 



/CS = (X) * /R + (X) * /W 



/WE 



This ensures that the WE signal of the memory is low before the 
CSisasserted. Thissolvesonlypartofthe problem. Ontheother 
edge, WE should be removed only after CS is removed. This can 
be accomplished by controlling WE from the CS by gating CS 
along with the WE signal: 

/WE = /w * /CS 

This ensures the assertion of WE as long as the CS is asserted 
and W is low. As soon as the CS is unasserted, the WE of the 
memory will be unasserted. These signals CS and WE used for 
controlling each other are simply the feedback from the appropri- 
ate I/O pins. The feedback path provides the extra delay. 



Figure 2. Controlling an SRAM Without OE 
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Access Time Considerations 

PROMs and SRAMs with data access time wliich match the 
processor speeds are commercially available. Sometimes, 
however, there is a speed mismatch. Systems with different 
types of memories encounter this problem the most. For such 
systems, different memories generate their own data available 
(DAV) and write complete signals and inform the processor of 
cycle completion. This allows processors to prolong their cycles 
and add wait states, to match the memory speeds. These signals 
are generated by using PAL devices which can generate timing 
delays. Registered PAL devices are ideal candidates for such 
timers. The application note on page 2-184 illustrates one such 
memory design where a PAL device generates the memory 
control timing signals. This design also implements the memory- 
to-processor interface handshake logic. 

DRAM Systems 

In any system that uses dynamic RAMs, special circuitry in the 
address and control lines must be introduced to drive the memory 
chips and link them to the CPU (see Figure 4). Acircuit known as 
a DRAM controller performs several functions: 

• Multiplexes the row and column addresses 

• Provides access timing 

• Provides refresh timing and addresses 

• Arbitrates between access and refresh 

• Drives the DRAMs' capacitive load inputs 

DYNAMIC 
MEMORY 



<: 



^ 



CONTROL 



DYNAMIC RAM 

INTERFACE 
AND CONTROL 



ADDRESS 



H 



a 



RAS 
CAS 
WE 



> 



Figure 4. Functions of a DRAM Controller 



Multiplexing processor addresses into row and column ad- 
dresses is required because the DRAM architecture divides the 
addresses into rows and columns. This reduces the number of 
pins required for addressing the DRAMs and consequently the 
package size. The external DRAM controller multiplexes the 
addresses and also generates the RAS (Row Address Strobe) 
and CAS (Column Address Strobe) signals to the DRAMs to 
distinguish between these two addresses. Every DRAM access, 
for reading or writing data, requires a predetermined timing 
sequence of RAS, CAS and WE (Write Enable) signals. A DRAM 



controller's main task is to generate these read and write cycle 
timings. In addition to the regular read and write cycles, many 
DRAMs are available which provide higher data bandwidth read 
and write cycle. Such nibble mode, ripple mode or page mode 
cycles require different sequences of RAS and CAS signals. 
Depending upon the data requirements, these schemes are used 
for high speed data throughput in various systems. 

DRAMs also require intermittent refresh cycles to charge the 
capacitors for every row of data. This refresh is provided by a 
specific sequence of RAS and CAS signals. There are three 
different refresh cycles, one of which is selected according to the 
system requirements. These are: 

• Regular RAS only refresh cycle 

• CAS before RAS refresh cycle 

• Hidden refresh cycle. 

The regular RAS-only refresh cycle requires an external refresh 
address generator which sequences through all the row ad- 
dresses of a DRAM. The CAS before RAS refresh cycle is only 
provided by certain DRAMs. These DRAMs are usually costlier, 
since they require internal circuitry for row address generation. 
The hidden refresh cycle DRAMs retain the last read/write cycle 
data at the outputs during the refresh cycles, which is helpful in 
certain designs. 

In addition to the type of refresh cycle, the frequency of refresh 
cycles also varies in different systems. Some systems provide 
intermittent refresh when refresh cycles are executed at fixed 
time intervals. The DRAM accesses are sometimes delayed 
pending an intermittent refresh cycle. With burst mode refresh, 
a number of refresh cycles are completed simultaneously, allow- 
ing the processor an uninterrupted access for read and write 
cycles later. Time critical systems usually find burst mode more 
useful. 

DRAM Controllers and PLDs 

Many DRAM systems make use of a VLSI DRAM controller. 
Most VLSI DRAM controllers perform the DRAM control func- 
tions which do not change greatly form one system to another. 
Excellent representatives of this approach are the 2964, 2968 
and the 673103 Dynamic RAM Controller/Drivers (Figure 5). 

Most VLSI DRAM controllers do not support all of the various 
types of read/write cycles, refresh cycles and refresh timings. 
Custom DRAM controllers are required forthe following reasons: 

• Higher throughput bandwidth 

• Different memory access times 

• Different refresh cycles and mechanisms 

• Processor related requirements. 

Depending upon the specific system requirements, DRAM con- 
trollers can be made from PLDs which support the specific cycles 
required by a design. 

Fast registered PLDs are used as DRAM controllers in many 
designs. Functions of a DRAM controller are usually split into a 
number of PLDs. The application note on page 2-187 shows a 
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Figure 5. A VLSI DRAM Controller 
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DRAM controller for a 68020 microprocessor which implements 
a custom data sizing and alignment function required by the 
microprocessor. 

VLSI DRAM controllers use PALdevicesfor a variety of functions. 
These functions depend upon the capabilities of individual VLSI 
controllers. Three detailed application notes are included in the 
subsequent sections which illustrate the interface of these VLSI 
DRAM controllers to the various microprocessors. The applica- 
tion note en page 2-202 shows an AMD 8088 microprocessor 
interfaced to a 2968 DRAM controller using a PALI 6R4. The ap- 
plication note on page 2-210 shows a 68000 microprocessor 
interface tothe 2968. The interface between the DRAM controller 
and the CPU accommodates the specific access protocols and 
speeds of the CPU as well as the access and refresh timing of the 
dynamic RAMs. This interface between the controller and the 
rest of the system is best implemented using fast programmable 
logic devices such as the PAL16R8D family devices. 

Error Detection and Correction 

Compared to PROMs and SRAMs, the physical dimensions, 
signal levels, and the stored charge of the DRAM cells are greatly 
reduced to allow higher density. As the stored charge in the 
DRAM cells decreases, the device is more susceptible to soft 
errors caused by alpha particles as well as by environmental 
noise. 

Soft errors are temporary, random, and they can be corrected by 
rewriting intothe erroneous cell. Therefore, if thesystem has the 
ability to locate the bit in error, the soft error can be corrected. One 
method to locate the bit in error is to append a fixed number of 
check-bits to the data word and store these check-bits along with 
data. This increases the data width of the DRAM system. Upon 
reading, both the data and check-bits are read into the EDAC. 
The check-bits are regenerated from the read data, and these 
newly generated check-bits are compared with the read check- 
bits. The comparison produces the syndrome bits. The syn- 
drome bits constitute a binary numberthat points to the bit in error. 

This encoding scheme of generating the check-bits and the 
syndrome bits is called a Hamming code, after its inventor. The 
modified Hamming code has one more check-bit than the original 
Hamming code, but it can detect all double-bit errors as well as 
detect and correct all single-bit errors. In addition it can detect a 
substantial number of multiple-bit errors. The modified Hamming 



code has proved particularly useful for its relatively low overhead 
in today's wide-data-word systems and its capability to detect and 
correct single-bit errors and detect all double-bit errors. 

A number of XOR logic functions are required for generating 
Hamming codes. PALdevices which provide XOR gates are ideal 
candidates for such designs. Details of Hamming codes and a 
sample eight-bit error detection and correction application note 
which uses the PAL1 6X4 is included on page 2-229. 

Cache Memory Systems 

Fast access time SRAM caches are used for very high-speed 
processors. These allow high system performance while retain- 
ing the low cost of DRAM bulk memory. Based upon how the 
SRAM cache addresses map to the DRAM addresses, three 
major cache schemes are used. 

For fully associative caches, data in a DRAM location may be in 
any cache location. For direct mapped caches, data may be in 
one of the many locations determined by higher order address 
bits. For set associative caches, data may be in one particular 
location of every set. Depending upon the performance required 
and other such system constraints such as cost, board space etc. 
one of the three schemes is used. Details of cache system design 
are included in the application note on page 2-239. 

A major task in cache memory systems is replacement, which 
requires transferring data between the cache and the DRAMs. 
This is done in an effort to keep the cache filled with only those 
blocks of memory which are in use by a program. The replace- 
ment task requires a controller with intelligence, which can read 
cache data and write to the DRAMs and vice versa. This requires 
a state-machine-based controller. Since most cache designs are 
custom designs, due to various system constraints, limited VLSI 
solutions are available. State-machine-based controllers such 
as the Am29PL141 and the PMS14R21 can perform these 
functions very effectively. 

Other uses of PLDs in cache memory systems include general 
glue logic which is required for controlling SRAM caches or tag 
buffers. Such PLDs are usually very high-speed combinatorial 
PLDs. Monolithic Memories also provides a number of ECL 
PLDs which can be used in ECL cache memory systems. These 
ECL PAL devices allow 6 ns access times. 
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A typical control logic problem is the memory-to-processor 
handshake on memory transfer used in many computer ar- 
chitectures. The processor makes a transfer request by activat- 
ing a request line (REQ) and specifies a read or write operation 
on a Read/Write line (R/W). 

During a read operation, the processor waits for a Data Avail- 
able signal at which time the data bus is sampled and the re- 
quest line lowered, thus completing the cycle. During a write 
operation, the processor places data on the bus and waits for a 
Write Complete signal after the write cycle is finished. Upon 



write complete, the request line is lowered, hence completing 
the cycle. Figure 2 shows the state assignments and the appro- 
priate outputs. The state diagram is shown in Figure 1 . Also the 
handshaking operation is illustrated in the timing diagram of 
Figure 3. 

The memory-board logic to implement this function may be de- 
signed with gates and edge-triggered flip-flops as shown in Fig- 
ure 4. This particular design would require about five SSI/IVISI 
packages, but the same design can be implemented by a single 
PAL16RP6. The PAL device design specification using state 
equations is shown on page 2-186. 
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Figure 1. State Diagram— Memory Handshal<e Logic 
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Figure 3. Memory Handshake Timing 
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Figure 4. Memory Handshal<e Logic 
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PAL Device Design Specification 



Title 


Memory Handshake 


Logic 










Pattern 


Mcraoryl.pds 












Revision 
Author 


B 

Kelvin Chow/Nlck 


Schmitz 










Company 


AMD/MMI 












Date 


2/28/85 












CHIP Memory PAL16RP6 












CLK ADRl 


ADR2 ADR3 ADR4 


REQ 


RW 


INIT 


NC 


GND 


/OE NC 


WC CI CO 


WE 


DA 


DOUT 


NC 


VCC 



STATE MEALY_MACHINE 
DEFAULT OUTPUT /DOUT /DA 
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C 5 
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C_4 -> WAIT 


READl. o 
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-Q 


* /RW * ADRl * ADR2 * ADR3 * 


C 3 = ( 
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/ADR2 + /ADR3 + 


C 4 = ( 


REQ + 


/ADRl + 


/ADR2 + /ADR3 + 


C 5 = It 
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+ C 5 -> WAIT 



ADR4 * /INIT 

ADR4 * /INIT 
/ADR4) * /INIT 
/ADR4 + /RW) * /INIT 



SIMULATION 

TRACE ON CLK REQ RW DOUT 



DA WE WC CI CO 




SETF INIT /REQ OE RW ADRl ADR2 ADR3 ADR4 
CLOCKF CLK CLOCKF CLK 
CLOCKF CLK CHECK wait /DOUT 



SETF REQ /INIT 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 



CHECK readl 
CHECK read2 
CHECK read3 
CHECK read3 



SETF INIT 

CLOCKF CLK CHECK wait 
CLOCKF CLK CHECK wait 
CHECK /DOUT /DA 



SETF /INIT REQ /RW 



CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 

TRACE OFF 



CHECK writel 
CHECK count 1 
CHECK count 2 
CHECK count 3 
CHECK wrlte2 
CHECK write3 
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Ever increasingly dense DRAMs and their interface to the emerg- 
ing sophisticated 32-bit microprocessors provide an increasing 
numberof system options to the designer. In orderto utilize these 
advanced DRAM features and to satisfy the individual system de- 
sign requirements, custom DRAM controllers are required. PAL 
device sequencers, such as the PAL23S8, provide effective con- 
trol sequencing power, flexibility, customizability and ease of use. 
They offer a very desirable alternative to the dedicated function- 
ality of VLSI DRAM controllers. 

The following megabit DRAM controller design based on the 
PAL23S8 sequencer is illustrative of a design which is easily 
customized to a system requirement. It supports features such as 
data bus sizing and data alignment which are required for the 
emerging 32-bit microprocessors, but which are not yet easily and 
adequately supported by the VLSI controllers. 

A DRAM controller needs to support at least the following func- 
tions: 

• Flexible Refresh Generation 

• Flexible Arbitration 

• Processor Handshake 

• Processor Cycle Timing and Control 

• Refresh Cycle Timing and Control 

Flexible Refresh Generation 
Mechanisms 

For refresh generation, different system designs may require 
either burst mode, intermittent mode or both kinds of refresh. In 
burst mode, refresh requests occur once very 4 milliseconds to 
meet the dynamic RAM's speed. For a megabit RAM, which 
needs 512 cycles for a complete refresh, all 512 cycles will be 
performed consecutively. The major disadvantage of the Burst 
refresh approach is that memory is "out of service"for a long time. 
The intermittent refresh scheme minimizes the out of service time 
by stealing single cycles between processor cycles. This cycle 
stealing is performed by a pre-defined arbitration mechanism for 
prioritization of different cycle requests. This design incorporates 
the arbitration in a manner which allows both intermittent and 
burst mode refreshes. 

Flexible Arbitration Scheme 

The arbitration in most designs involves prioritizing memory 
accesses for: read cycles, write cycles, read-modify-write cycles, 
and refresh cycles. 

The DRAM controller must allow both asynchronous memory and 
refresh cycle requests, it should decide whether a memory cycle 
is an access cycle or a refresh cycle. When refresh and processor 
accesses are requested simultaneously, the refresh is given 
priority. If a refresh cycle is in progress and a memory cycle 



request is made, then the memory access is inhibited until the 
refresh is complete. The memory request in this case is kept 
pending by asserting a processor wait signal. 

For a typical intermittent refresh scheme, the cycle requests are 
derived from the oscillator, which operates asynchronously with 
respect to the system clock. The arbiter grants a refresh request 
when no other memory cycle is either occurring or pending. If a 
memory cycle is in progress, the arbiter inhibits the refresh cycle 
until the current cycle is completed. However, it needs to 
rememberlhat a refresh request was made and grant that request 
on completion of the processor cycle. The PAL23S8's buried 
registers are very useful in performing such arbitration functions, 
even between asynchronous events. These registers can also be 
used for remembering refresh cycle requests and initiating ac- 
tions later for execution of the refresh cycles. 

However, some designs may require additional functions, such 
as prioritizing different processor requests. Some may require a 
specific kind of handshake mechanism between the processor 
and the DRAM controller. All such functions can be performed in 
the PAL sequencer by using more of chip resources such as I/O 
pins, buried registers and product terms. On the other hand some 
systems may not require functions such as read-modify-write 
cycles, in which case the PAL sequencer resources can be saved 
and used for other system functions. 

Flexible Handshake Protocol 

The PAL23S8 sequencer can provide a customizable interface to 
the system processor without requiring additional parts. The logic 
required for interface protocols for the APX family or the 68000 
family microprocessors can be very easily built using the internal 
resources of the PAL23S8. The design example below is for 
68020 processor interface signals. 

Processor Cycle Execution 

Different DRAMs can have different processor cycles. Some just 
support the basic read/write cycles, whereas others support 
extended functions such as page mode, ripple mode or static 
column mode cycles. Some DRAMs also support additional 
functions for extended applications such as the transfer cycle in 
VRAMs (DRAMs optimized for video applications). PAL devices 
allow user-definable functionality to be implemented in the de- 
sign, which optimizes resource utilization and improves system 
performance. Another advantage offered by such designs is the 
customizability of the design to the DRAM liming requirements, 
which can vary between -12, -15, and -20 parts significantly. All 
these timing functions can be easily implemented and modified 
using buried state registers. 

Our design implements the basic read, write, and read-modify- 
write cycles required in most designs. The timing required has 



Monolithic 



m 



Memories 



t\ 



2-187 



Customize a DHAM Controller Using Advanced PAL Devices 



been calculated for a -12 memory part. The processor cycle 
execution involves the following functions: 

• Address multiplexing 

• Timing and control signal generation 

Am29841 ten-bit buffers are used for address multiplexing. The 
multiplexing of the row or column address is underthe control of 
the DRAM controller. 

Generation of the timing and control signals RAS, CAS, WE and 
DACK is also the core function of the controller. The controller 
multiplexes the address and produces RAS, CAS, WE in a 
sequence that is understandable by the DRAMs. It also gener- 
ates a handshake signal "DACK" for the CPU, to indicate whether 
or not the memory is ready to be accessed. 

Refresh Cycle Execution 

DRAMs support different types of refresh cycles. Some DRAMs 
support CAS before RAS refresh. These DRAMs have the 
refresh address counter embedded along with the memory, thus 
eliminating the need for an external counter. Some designs may 
require "hidden refresh", or the standard RAS only refresh. 
Depending upon the system requirements, the designer can 
optimize the design and resource utilization. For this example, 
the PALdevice has been designed to execute a RAS only refresh. 
However, it can be easily modified to support CAS-before-RAS 
refresh, or hidden refresh also, if needed. 

Megabit DRAM Controller with Data 
Alignment and Dynamic Bus Sizing 

in many cases the currently available DRAM controllers do not 
support the esoteric requirements of different system designs. 
The designer is left to make a custom DRAM controller. The 
PAL23S8, with its embedded extra buried state registers, pro- 
vides an excellent tool for designing such a flexible, fast DRAM 
controller. It also offers the right speed for interfacing to the 
emerging 8 and 16 MHz processors, allowing superior synchro- 
nous designs with few or no wait states. 

Some ofthe emerging 32-bit microprocessors, such as the 68020 
and the 80386, support the concept of dynamic bus sizing and 
mis-aligned data transfers. Presently, there is no DRAM control- 
ler which utilizes the advantages offered by these two features 
effectively. 

Dynamic Data Bus Sizing 



Dynamic bus sizing allows the processor to handle variable width 
data buses (8, 1 6 or 32 bits) on a cycle-by-cycle basis. Dynamic 
bus sizing makes the size of the RAV port or channel transparent 
to the software designers. 

The 68020 allows operand transfers to or from 8, 16, and 32-bit 
ports by dynamically determining the port size during each bus 
cycle. Similarly, 80386 allows operand transfers to or from 1 6 and 
32-bit ports by determining the port size during each bus cycle. 
During an operand transfer, the slave device signals its port size 
and transfer status (complete or not) to the processor using the 



DSACK inputs for the 68020, and the Ready & BS1 6 signals for 
the 80386. 



DSACK1 


DSACKO 




H 
H 

L 

L 


H 
L 

H 

L 


Insert Wait States 
Complete Cycle 

Data bus port size 8 bits 
Complete Cycle 

Data bus port size 1 6 bits 
Complete Cycle 

Data bus port size 32 bits 



The design implemented shows a 68020 interface, but can be 
easily modified to accommodate the 80386. The 68020 always 
attempts to transfer the maximum possible amount of data (32 
bits) in a single cycle. If the port responds with asizesmallerthan 
32 bits, the processor repeats the cycles with the remaining bytes 
of data. For each cycle, the 68020 informs the slave of the number 
of bytes it is attempting to transfer by two signals SIZO, S1Z1 , as 
follows: 



SIZO 


SIZ1 


SIZE 




1 

1 




1 



1 




Byte 
Word 
3 Bytes 
Long Word 



Due to various considerations, memories with different data width 
can exist on the same system. For example, it is often preferable 
to have smaller external bus sizes, while retaining a 32-bit internal 
memory size on a board. Such instances are common when 
using DMA in microprocessor based systems. Varied memory 
widths are also required in graphics and image processing 
systems. PALdevices allow customizable dynamic memory data 
bus sizing based on various system requirements — such as 
address range (address signals) or jumper selectable static 
control for increasing memory width. 

Mis-Aligned Transfers 

Dynamic sizing is used in conjunction with address bitsAO and A1 
to also support mis-aligned data transfers; for example, a word 
write to a long word data bus size with an address offset of 1 byte 
(A0/A1=01). The 68020 and the 80386 contain internal bus 
multiplexers to route the data to the correct data lines. Both of 
these also generate extra cycles necessary to complete a mis- 
aligned transfer. However, external circuitry is still needed to 
generate the correct write enables and chip selects to support 
both these functions. PALdevices, however, can be customized 
to support both these functions easily. 



A1 


AO 


OFFSET 




1 
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Bytes 

1 Byte 

2 Bytes 

3 Bytes 
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By decoding AO and A1 , the two transfer data size signals SIZEO, 
SIZE1 , and the size of the port being addressed, PAL devices can 
easily support dynamic bus sizing and generate the appropriate 
chip select signals for the four banks of memory. 

The memory organization selected (Figure 1) allows access to 
individual data bytes underthe control of the CAS signal. The four 
banks of memory are addressed by the four RAS signals. 



The DRAM Controller 

The D RAM controller consists of two PAL23S8s: the first is called 
the Timing and Arbitration PAL device, and the second device is 
called the Data Sizing and Alignment PAL device (Figure 2). 



rA<;o CAS1 CAS2 CAS3 EACH BANK OF 

^"^^^ ■ ■ ■ 1 M X 32 BITS 



RASO- 
BANKO 



N.S.S.N 




y1_ 20 BIT ROW AND 
C COL ADDRESS TO 
N- ALL FOUR BANKS 



D31-D24 D23-D16 D15-D8 D7-D0 



Figure 1. Memory Organization 




♦• RAS0-RAS3 
♦■ CAS0-CAS3 



•- DACK0-DACK1 



Figure 2. A Megabit DRAM Controller 
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The Timing and Arbitration Controiier 

The timing and arbitration PAL device performs the following 
functions: 

• Arbitration between current cycle, Refresh cycle & CPU cycle 

• Read and Write Cycle Execution 

• Read-Modify -Write Cycle Execution 

• Refresh Cycle Execution 

• Interface Signaling 

The timing and arbitration PAL device arbitrates between the 
processor request fvlREQ signal and the refresh request RFCLK, 
giving the highest priority first to the current cycle, next to the 
refresh request and finally to the processor cycle. This allows 
support for both intermittent refresh and burst mode refresh. It 
also uses a mechanism to store the refresh requests when the 
processor cycle is in progress for servicing later. 

This device also generates the appropriate ROWADD and 
COLADD signals to multiplex the row and column addresses onto 
the memory address bus respectively. During refresh cycles it 
asserts the RFSH signal enabling the refresh row address onto 
the DRAMs. 

This PAL device executes the read and write cycles as well as 
read-modify-write cycles. The selection between these two is 
underthe control of the processor signal RMC. The timing of the 
different cycles is under the control of internal registers. 

The timing requirements of the different cycles are shown in 
Figure 3. The operation of the state machine to perform all these 
functions is also shown in Figure 4. This state diagram has been 
derived directly from the timing diagrams and the arbitration 
requirements of the design. This state machine is implemented 
by five of the six available buried state registers on the device, and 
is used mainly for timing and arbitration functions. The sixth 
buried state register implements a flag function and is repre- 
sented as a small independent state machine, as described 
below. The state description (operation) has been written in 
CURL software which generates the JEDEC file directly for 
programming the device. 

Arbitration Between Current Cycies, Refresli Cycie 
& CPU Cycie 

The request for a refresh is received by signal line RFCLK. The 
CPU requests a cycle by asserting MREQ line. If a cycle is in 
progress as indicated by states other than state 0, no arbitration 
for a fresh cycle is performed. In such a case, the PAL device just 
continues to execute the current cycle. This effectively implies 
that the current cycle is given the most priority and no other cycle 
is initiated until its completion. 

When the current cycle is complete (indicated by state 0) the PAL 
device arbitrates between the Mf^EQ signal and the buried 
register B5 flag. This buried register flag is used to latch the 
request for refresh (RFCLK) which is usually only one clock cycle 
duration. The operation of this flag is controlled by a second small 
state machine (two states only) in the same device. When RFCLK 
is high this state machine toggles to state 1 (B5=1 ) and stays in 
that state till the refresh cycle is executed (indicated by RFSH 
signal low), and then it toggles back to state (B5=0) preparing 
itself for the next refresh request RFCLK. 



For the first state machine, the arbitration begins on state 0. If 05 
is high, a refresh cycle is given priority and the state machine 
jumps to state 1 6 to execute a refresh cycle. If on the other hand 
there is no refresh request pending (indicated by the 05 flag being 
equal to 0) and MREQ is asserted, a processor cycle is started by 
jumping to state 1. If none of these conditions exists the state 
machine stays in state 0, polling for any of these events to occur. 
Such a scheme can support both burst mode and the intermittent 
refresh. 

Processor Read Write Cycie Execution 

When the processor cycle wins the arbitration, indicated by state 
1 , the timing and control signals required for the processor cycle 
are initiated. 

There are four memory banks in the design, each with a capacity 
of 4 Mbytes of memory organized as 1 M X 32 bits (Figure 1 ). The 
Timing & Arbitration PAL device generates the 4 RAS0-RAS3 
signals, one for each memory bank. The selection of one of these 
four RAS signals depends upon the two input signals ADSELO 
and ADSEL1 . In most systems these would constitute the high 
bits of the system processor address bus. 

After assertion of the appropriate RAS signal the state machine 
jumps to state 2 where it manipulates its address handling 
signals. In state 2 it removes the ROWADD signal and asserts the 
COLADD signal to allow assertion of the column address to the 
memory. At the same time it asserts another signal, SIG, which 
is used to inform the Data Sizing and Alignment PAL device of a 
processor cycle request, and allows the Data Sizing and 
Alignment PAL device to generate the appropriate CAS signal to 
the memory. 

The state machine then sequences to state 4, allowing for the 
data access time. In state 4 it decides between a read write cycle 
or a read-modify-write cycle, based on the processor signal RMC; 
it jumps either to state 5 for a read or write cycle or to state 8 (for 
read-modify-write cycle. For read or write cycle it then removes 
the RAS signals and waits for another clock period for the RAS 
precharge time before completing the cycle. Simultaneously it 
also removes COLADD signal and asserts ROWADD signal 
preparing the memory for the next processor cycle. The SIG 
signal to the data sizing and alignment PAL device is also 
removed. The DSACK signal (acknowledge to the processor) 
and the WE signals (for a write cycle) are generated if and when 
required by the Data Sizing and Alignment PAL device. 

Processor Read-iVIodify-Write Cycle Execution 

When the processor requests a read-modify-write cycle by as- 
serting the RMC signal, the Timing and Arbitration PAL device 
automatically increases the length of the cycle. It keeps the RAS 
asserted for the appropriate duration of a read-modify-write 
cycle. At the sametimetheDataSizing and AlignmentPALdevice 
generates the appropriate WE and DSACK signals. After this 
extended RAS assertion the Timing and Arbitration PAL device 
completes the cycle, similar to a read or write cycle. 

Refresh Cycle Execution 

When the arbitration is won by the refresh cycle, the timing and 
arbitration control PAL device executes a RAS-only refresh cycle. 
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Figure 3. The Timing Diagram 
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Figure 4. State Diagram for Timing and Arbitration PAL Dovico 
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On state 16 it removes the ROWADD signal and asserts the 
RFSH signal which applies the refresh row address to the 
memory address bus. In state 17 it generates all four 
RAS0-RAS3 signals, refreshing all the banks of the memory. It 
then counts for a few states (depending upon the refresh timing 
requirements of the memory) before removing the RAS and 
RFSH signals. On removal of the RFSH signal the external 1 0- 
bit refresh row address counter (PAL22V1 0) is also incremented 
for the next row. 

Interface Signaling 

One additional function performed by this PAL device is to inform 
the Data Sizing and Alignment PALdevice of the execution timing 
reference of a processor cycle. This it does by asserting /SIG 
signal low for all processor cycles. This signal is basically used 
as a synchronizing signal between these two PAL devices doing 
two independent functions. 

Data Sizing and Alignment PAL Device 

The DataSizingandAIignment PALdevice performs thefollowing 
functions: 

1 . CAS generation for dynamic sizing & alignment. 

2. DSACK generation for dynamic sizing & alignment. 

3. WE generation for write and read-modify-write cycles. 



Its state diagram is shown in Figure 5. This state machine is 
synchronized to the operation of the state machine of the Timing 
and Arbitration PALdevice by using signal SIG. This PALdevice 
has no function for a refresh cycle. For a processor cycle it asserts 
the appropriate CAS0-CAS3 and WE signals and generates the 
cycle complete signals DSACK0-DSACK1. The state machine 
functions are implemented in the four buried state registers on 
this device. 

CAS Generation 

The state machine starts at the initial polling state 0, and remains 
there until it receives the SIG signal from the timing and arbitration 
PAL device indicating the start of a processor cycle. The maxi- 
mum allowable data width for the 68020 processor is 32 bits. The 
data bus is split into four byte wide segments. The CAS signals 
forthesefourbyte segments are independently controlled, allow- 
ing independent byte-wide read and write capability. These four 
CAS signals allow read and write for low-low, low-middle, high- 
middle and high-high bytes of a 32-bit data port. For a 1 6-bit data 
port size, CAS 2-3 provide the low and high byte strobes, 
respectively. For an 8-bit port, only CAS3 is used. 

In state 1 the CAS signals are asserted. The CAS signals 
asserted depend upon the PORT size available, the size of data 
being transferred, andthealignmentofthetransfer. As explained 
before, the attempted transfer data size is encoded on two SIZO, 
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Figure 5. State Diagram of Data Sizing and Alignment PAL Device 
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SIZ1 bits by the processor, and the alignment is indicated by the 
least significant address bits AO, A1 of the processor. 

The port size is very application dependent. Different systems 
might require either static or dynamic selection of this port size. 
For example, the designer can define different address ranges 
where the port size may be different. Alternatively, jumpers can 
also define the port size. A general-purpose means for this has 
been selected, in the form of two input pins PRTO & PRT1 , which 
define the size of the port. 

Port Size 

The size of the data bus is decided based on the input pins PRT1 , 
PRTO. 



PRTO 


PRT1 


SIZE 


L 
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8 bits 


L 
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16 bits 


H 


L 


16 bits 


H 


H 


32 bits 



For 32-bit ports, all four CAS signals need to be asserted. For a 
16-bit port only two CAS signals need to be asserted. For an 
eight-bit port only CAS3 is asserted. As mentioned before, the 
processor always attempts to transfer the maximum possible 
amount of data. Its SIZO-1 signals encode the amount of data 
being transferred. The table in Figure 6 illustrates the relationship 
of the data size, port size and alignment which is decided by 
address bits A1, AO. 



The data from this table is directly used in the state description 
syntax to generate the CAS signals for state 1. The software 
compiles the Boolean equations for this logic directly. 

Once asserted, the CAS signals are held by the state machine 
until the appropriate read, write, or read-modify-write cycles are 
complete. 

DSACK Acknowledge to CPU 

The amount of data receivable by any device is dependent upon 
its port size. Based on the port size (PRTO-1 ) the encodings for 
DSACKO-1 are generated to acknowledge the CPU and inform it 
of the port size. The timing of acknowledge is synchronized by the 
state machine. For read and write cycles, the DSACK signals are 
generated in state 1 and removed on cycle completion in state 4. 

For a read-modify-write cycle, a complication exists. The read- 
modify-write cycle requires two DSACK signals; one for read and 
one for a write. The first DSACK is generated in state 1 and 
removed in either state 4 or state 5. The second is generated in 
state 8 and removed in state 11 . 

WE Generation 

In state 1 the WE is generated for normal write cycles. For read- 
modify-write cycles, a delayed WE signal is generated based 
upon the internal four-bit state counter in state 8 (Figure 3). 
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Figure 6. Data Bus Activity for Byte, Word and Long Word Ports 
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NAME 


MEGABIT DRAM TIMING & ARBITRATION CONTROLLER; 


PARTNO 


AmPAL 23S8 ; 


DATE 


03/24/86 ; 


REV 


01 ; 


DESIGNER 


KAPIL SHANKAR ; 


COMPANY 


ADVANCED MICRO DEVICES ; 


ASSEMBLY 


NONE ; 


LOCATION 


SUNNYVALE, CA ; 


DEVICE 


P23S8 ; 



NODE [B0,B1,B2,B3,BA,B5] ; 

/** Declarations and Intermediate Variable Definitions **/ 



/* */ 

/* V 

/* */ 

/******************************************************************/ 

/* Allowable Target Device Types: THE ONLY ONE */ 

/** Inputs **/ 



PIN 


2 


= MREQ , 




PIN 


3 


= ADSELO ; 


PIN 


4 


= ADSEL1 ; 


PIN 


5 


= RMC ; 


PIN 


6 


= RFCLK ; 


PIN 


7 


= INIT ; 


/** 


Outputs 


*•/ 


PIN 


19 


= ROUADD ; 


PIN 


18 


= IRASO ; 


PIN 


17 


= ICOLADO ; 


PIN 


16 


= IRAS1 




PIN 


15 


= IRAS2 




PIN 


14 


= IRFSH 




PIN 


13 


= IRAS3 




PIN 


12 


= !SIG ; 





FIELD STMA = [B0,B1,B2,B3,B4] 

FIELD STMB = [B5] ; 

FIELD BANK = [ADSEL1, ADSELO] 



BANKO = 


BANK : 


; 


BANK1 = 


BANK : 


1 ; 


BANK2 = 


BANK : 


2 ; 


BANK3 = 


BANK : 


3 ; 


(DEFINE 


SAO 


'b' 00000 


SDEFINE 


SA1 


'b'00001 


SDEFINE 


SA2 


■b'00010 


(DEFINE 


SA3 


'b'00011 


(DEFINE 


SA4 


'b'00100 


(DEFINE 


SA5 


'b'00101 


(DEFINE 


SA6 


'b'00110 


(DEFINE 


SA7 


'b'00111 


(DEFINE 


SA8 


'b'01000 


(DEFINE 


SA9 


'b'01001 


(DEFINE 


SA10 


'b'01010 


(DEFINE 


SA11 


'b'01011 


(DEFINE 


SA12 


'b'01100 


(DEFINE 


SA13 


'b'01101 


(DEFINE 


SA14 


'b'01110 


(DEFINE 


SA15 


'b'01111 


(DEFINE 


SA16 


'b'10000 


(DEFINE 


SA17 


'b'10001 


(DEFINE 


SA18 


'b'10010 


(DEFINE 


SA19 


'b'10011 


(DEFINE 


SA20 


'b'10100 


(DEFINE 


SA21 


'b'10101 


(DEFINE 


SA22 


'b'10110 



/** STATE MACHINE A **/ 
/»* STATE MACHINE B **/ 
/** MEMORY BANK ADDRESSED **/ 



/** STATE **/ 
/** STATE 1 **/ 
/** STATE 2 **/ 
/** STATE 3 **/ 
/** STATE 4 **/ 
/** STATE 5 **/ 
/** STATE 6 **/ 
/** STATE 7 **/ 
/** STATE 8 *•/ 
/** STATE 9 *•/ 
/** STATE 10 **/ 
/** STATE 11 *V 
/** STATE 12 **/ 
/** STATE 13 **/ 
/** STATE 14 **/ 
/** STATE 15 •*/ 
/** STATE 16 **/ 
/** STATE 17 **/ 
/** STATE 18 **/ 
/** STATE 19 **/ 
/** STATE 20 **/ 
/** STATE 21 **/ 
/** STATE 22 **/ 
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PRESENT SA2 



/** STATE **/ 
/** STATE 1 •*/ 



SOEFINE SBO 'b'O 
SDEFIHE SB1 'b'1 

/** Logic Equations **/ 

/** THE FIRST STATE MACHINE **/ 

SEQUENCE STMA <: 

/** POLLING FOR PROCESSOR OR REFRESH CYCLES **/ 

PRESENT SAO IF !B5 & IMREQ & BANKO NEXT SA1 OUT RASO 

IF !B5 & IMREQ & BANIC1 NEXT SAl OUT RAS1 

IF !B5 & IMREQ & BANIC2 NEXT SAl OUT RAS2 

IF IBS & IMREQ & BANK3 NEXT SAl OUT RAS3 

/** PROCESSOR CYCLE **/ 

/*• JUMP TO STATE SAl **/ 

/** ASSERT THE RAS FOR THE MEMORY BANK ADDRESSED ••/ 

IF B5 NEXT SA16 OUT ROUADD 

OUT RFSH ; 
/** REFRESH CYCLE **/ 
/** JUMP TO STATE SA16 •*/ 
/** REMOVE ROW ADDRESS AND SET UP REFRESH ADDRESS *•/ 



DEFAULT 
/*• ELSE POLL AGAIN **/ 
/** STAY IN STATE SAO **/ 



/** THIS IS THE PROCESSOR CYCLE **/ 



NEXT SAO 



PRESENT SAl 



IF BANKO 

IF BANK1 

IF BANK2 

IF BANIC3 



NEXT SA2 
NEXT SA2 
NEXT SA2 
NEXT SA2 
NEXT SA2 



OUT RASO 
OUT RAS1 
OUT RAS2 
OUT RAS3 
OUT ROUADD 
OUT COLADD 
OUT SIG ; 



/** REMOVE ROW ADDRESS AND SET UP COLUMN ADDRESS **/ 

/•* ASSERT SIG TO INFORM DATA SIZING & ALIGNMENT PAL OF PROCESSOR CYCLE **/ 



IF BANKO 

IF BANK1 

IF BANK2 

IF BANKS 



/** MEMORY ACCESS TIME DELAY **/ 



PRESENT SA3 



IF BANKO 

IF BANK1 

IF BANK2 

IF BANK3 



NEXT SA3 
NEXT SA3 
NEXT SA3 
NEXT SA3 
NEXT SA3 



NEXT SA4 
NEXT SA4 
NEXT SA4 
NEXT SA4 
NEXT SA4 



OUT RASO 
OUT RAS1 
OUT RAS2 
OUT RAS3 
OUT ROUADD 
OUT COLADD 
OUT SIG ; 



OUT RASO 
OUT RAS1 
OUT RAS2 
OUT RAS3 
OUT ROUADD 
OUT COLADD 
OUT SIG ; 



PRESENT SA4 IF BANKO & RMC NEXT SAS OUT IRASO 

IF BANK1 & RMC NEXT SAS OUT IRAS1 

IF BANK2 & RMC NEXT SAS OUT IRAS2 

IF BANK3 & RMC NEXT SAS OUT IRAS3 

IF RMC NEXT SAS OUT ! ROUADD 

OUT ! COLADD 

OUT SIG ; 

/** DISTINGUISH BETUEEN READ URITE OR READ MODIFY URITE CYCLES •*/ 

/•* IF RMC HIGH EXECUTE AS READ URITE CYCLE •*/ 

/** JUMP TO STATE SAS •*/ 

/** ALSO REMOVE COLUMN ADDRESS AND APPLY ROU ADDRESS **/ 

/** ALSO REMOVE THE RAS SIGNAL •*/ 



IF BANKO & IRMC 

IF BANK1 & IRMC 

IF BANK2 & IRMC 

IF BANKS & IRMC 

IF IRMC 



NEXT SAS 
NEXT SAS 
NEXT SAS 
NEXT SAS 
NEXT SAS 



OUT RASO 
OUT RAS1 
OUT RAS2 
OUT RAS3 
OUT I ROUADD 
OUT I COLADD 
OUT SIG ; 



/*• IF RMC LOW EXECUTE READ MODIFY URITE CYCLE **/ 

/** JUMP TO STATE SAS **/ 

/** ALSO REMOVE COLUMN ADDRESS AND ASSERT ROU ADDRESS **/ 
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D 



PRESENT SA5 

/** RAS PRECHARGE TIME DELAY **/ 



PRESENT SA6 



PRESENT SA7 

/** PROCESSOR READ WRITE CYCLE COMPLETE **/ 



NEXT SA6 OUT !SIG ; 

NEXT SA7 ; 
NEXT SAO ; 



/** THIS IS THE READ MODIFY WRITE CYCLE **/ 



PRESENT SA8 IF 


BANKO 


NEXT SA9 OUT RASO ; 


IF 


BANK1 


NEXT SA9 OUT RAS1 ; 


IF 


BANK2 


NEXT SA9 OUT RAS2 ; 


IF 


BANKS 


NEXT SA9 OUT RAS3 ; 
NEXT SA9 OUT SI G ; 


/** ACCESS TIME 


DELAY FOR BOTH READ AND WRITE •*/ 


PRESENT SA9 IF 


BANKO 


NEXT SA10 OUT RASO ; 


IF 


BANK1 


NEXT SA10 OUT RAS1 ; 


IF 


BAN)C2 


NEXT SA10 OUT RAS2 ; 


IF 


BANK3 


NEXT SA10 OUT RAS3 ; 
NEXT SA10 OUT SIG ; 


PRESENT SA10 IF 


BANKO 


NEXT SA11 OUT RASO ; 


IF 


BANK1 


NEXT SA11 OUT RAS1 ; 


IF 


BANK2 


NEXT SA11 OUT RAS2 ; 


IF 


BANKS 


NEXT SA11 OUT RASS ; 
NEXT SA11 OUT Sig ; 


PRESENT SA11 IF 


BANKO 


NEXT SA12 OUT RASO ; 


IF 


BANK1 


NEXT SA12 OUT RAS1 ; 


IF 


BANK2 


NEXT SA12 OUT RAS2 ; 


IF 


BANKS 


NEXT SA12 OUT RASS ; 
NEXT SA12 OUT SIG ; 


PRESENT SA12 IF 


BANKO 


NEXT SA13 OUT RASO ; 


IF 


BANK1 


NEXT SA1S OUT RAS1 ; 


IF 


BANK2 


NEXT SA1S OUT RAS2 ; 


IF 


BANKS 


NEXT SA13 OUT RASS ; 
NEXT SA1S OUT SIG ; 



/** THIS IS THE READ WRITE CYCLE **/ 

PRESENT SA13 IF BANKO 
IF BANK1 
IF BANK2 
IF BANKS 

/** REMOVE THE RAS SIGNAL **/ 

PRESENT SA14 

/*• RAS PRECHARGE TIME DELAY **/ 



NEXT SAU OUT IRASO 

NEXT SAU OUT IRAS1 

NEXT SAU OUT IRAS2 

NEXT SAU OUT IRASS 

NEXT SAU OUT SIG : 



NEXT SA15 OUT !SIG ; 



PRESENT SA15 NEXT SAO ; 

/** PROCESSOR READ MODIFY WRITE CYCLE COMPLETE **/ 



/** THIS IS THE MEMORY REFRESH CYCLE •*/ 



PRESENT SA16 



/*• ASSERT ALL FOUR RAS SIGNALS **/ 
PRESENT SA17 



/** REFRESH TIME DELAY **/ 
PRESENT SA18 



NEXT SA17 OUT RASO 

OUT RAS1 

OUT RAS2 

OUT RASS 

OUT ROWADD 

OUT RFSH ; 



NEXT SA18 OUT RASO 
OUT RAS1 
OUT RAS2 
OUT RASS 
OUT ROWADD 
OUT RFSH ; 



NEXT SA19 OUT RASO 
OUT RAS1 
OUT RAS2 
OUT RASS 
OUT ROWADD 
OUT RFSa ; 
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PRESENT SA19 NEXT SA20 OUT IRASO 

OUT IRAS1 

OUT IRAS2 

OUT IRAS3 

OUT IROWADD 

OUT IRFSH ; 

/** REMOVE ALL RAS SIGNALS **/ 

/** ALSO REMOVE REFRESH ADDRESS AND ASSERT ROW ADDRESS **/ 



PRESENT SA20 

/** RAS PRECHARGE TIME DELAY **/ 

PRESENT SA21 

PRESENT SA22 

/** TERMINATE MEMORY REFRESH CYCLE **/ 

/** GO BACK TO POLLING STATE **/ 

> 

/** THE FIRST STATE MACHINE COMPLETE **/ 

/** THE SECOND STATE MACHINE **/ 

SEQUENCE STMB { 

PRESENT SBO IF RFCLK 

DEFAULT 
/** JUMP TO STATE SB1 **/ 
/** REMEMBER REFRESH REQUEST **/ 

PRESENT SB1 IF RFSH 
DEFAULT 
/** JUMP TO STATE SBO **/ 
/** REFRESH CYCLE EXECUTED THUS RETURN **/ 



NEXT SA21 ; 

NEXT SA22 ; 
NEXT SAO ; 



NEXT SB1 ; 
NEXT SBO ; 



NEXT SBO ; 
NEXT SB1 ; 



/** EQUATIONS FOR SYSTEM INITIALIZATION AFTER RESET **/ 
[B0..B5].ar = UNIT ; 



RASO.AR = UNIT 

RAS1.AR = UNIT 

ROUADD.AR = !INIT 

COLADD.AR = UNIT 

RAS2.AR = UNIT 

RAS3.AR = UNIT 

RFSH.AR = UNIT 

SIG.AR = i.INIT 



/* THESE ARE THE OUTPUT ENABLES */ 



RASO.OE 


= 


'B'1 


RAS1.0E 


= 


'B'1 


ROWADD.OE 


= 


'B'1 


COLADD.OE 


= 


'B'1 


RAS2.0E 


= 


'B'1 


RAS3.0E 


= 


'B'1 


RFSH.OE 


= 


'B'1 


SIG.OE 


= 


'B'1 



/** THE SECOND STATE MACHINE COMPLETE **/ 
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NAME HEGABYT DRAM DATA SIZING & ALIGNMENT CONTROLLER; 

PARTNO AmPAL 23S8 ; 

DATE 03/24/86 ; 

REV 01 ; 

DESIGNER KAPIL SHANKAR ; 

COMPANY ADVANCED MICRO DEVICES ; 

ASSEMBLY NONE ; 

LOCATION SUNNYVALE, CA ; 

DEVICE P23S8 ; 

^**************************«***************************************y 
/* */ 

/* */ 

/* */ 

/******************************************************************/ 

/* Allowable Target Device Types: THE ONLY ONE */ 

^ik*****************************************************************^ 

/** Inputs **/ 



PIN 


2 


= 


SIG 


PIN 


3 


= 


AO ; 


PIN 


4 


= 


A1 ; 


PIN 


5 


= 


RW ; 


PIN 


6 


= 


RMC 


PIN 


7 


= 


SIZO 


PIN 


8 


= 


SIZ1 


PIN 


9 


= 


PRTO 


PIN 


11 


= 


PRT1 


PIN 


12 


= 


INIT 



/** Outputs **/ 



PIN 


19 


= 


!WE ; 


PIN 


18 


= 


ICAS2 ; 


PIN 


17 


= 


!CASO ; 


PIN 


16 


= 


ICAS3 ; 


PIN 


15 


= 


ICAS1 ; 


PIN 


14 


= 


IDSACK1 


PIN 


13 


= 


IDSACtCO 



NODE [B0,B1,B2,B3] ; 

/** Declarations and Intermediate Variable Definitions **/ 



FIELD STM = [B0,B1,B2,B3] ; 



FIELD PORTS I ZE = [PRT1, PRTO] 



P0RT8 = PORTSIZE : 
PORT 16 = PORTSIZE : 2 
P0RT32 = PORTSIZE : 3 



/** DEFINE THE STATES **/ 



$OEFINE 


SAO 


'b'OOOO 


SDEFINE 


SA1 


'b'0001 


$OEFINE 


SA2 


'b'OOlO 


$DEFINE 


SA3 


'b'0011 


SOEFINE 


SA4 


'b'0100 


$OEFINE 


SA5 


'b'0101 


$DEFINE 


SA6 


'b'0110 


SOEFINE 


SA7 


'b'0111 


$DEFINE 


SA8 


'b'1000 


(DEFINE 


SA9 


'b'1001 


$OEFINE 


SA10 


'b'1010 


$DEFINE 


SA11 


'b'1011 



/** THE STATE MACHINE **/ 



/** SIZE OF THE PORT **/ 

/** INTERFACED TO PROCESSOR **/ 

/** 8 BIT WIDE PORT **/ 

/** 16 BIT WIDE PORT **/ 

/** 32 BIT WIDE PORT **/ 



/** 


STATE 





**/ 


/** 


STATE 


1 


**/ 


/*• 


STATE 


2 


**/ 


/** 


STATE 


3 


**/ 


/** 


STATE 


4 


•*/ 


/** 


STATE 


5 


**/ 


/** 


STATE 


6 


**/ 


/** 


STATE 


7 


**/ 


/** 


STATE 


8 


•V 


/*• 


STATE 


9 


**/ 


/** 


STATE 


10 **/ 


/** 


STATE 


11 


**/ 



/** Logic Equations **/ 

/*• THE STATE MACHINE *•/ 

SEQUENCE STM { 

/** POLLING FOR SIG INDICATING A PROCESSOR CYCLE **/ 
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/** PORT DATA 

PRESENT SAO 

/** FOR PORTS I 2E 8 **/ 
IF !SIG & PORTS 

/** GENERATE ACKNOWLEDGE **/ 
IF !SIG & PORTS 



OFFSET STATE 



NEXT SA1 OUT CAS3 



NEXT SA1 OUT OSACKO ; 



/** FOR PORTSIZE 16 **/ 

IF !SIG & P0RT16 & !A0 NEXT SAl OUT CAS3 

IF !SIG & P0RT16 & ISIZO & SIZ1 & AO NEXT SA1 OUT CAS2 



/** GENERATE ACKNOWLEDGE **/ 
IF ISIG & P0RT16 



NEXT SAl OUT DSACK1 



/** FOR PORTSIZE 32 **/ 

/** BYTE TRANSFER SIZE **/ 

IF ISIG & PORT32 & IA1 & lAO NEXT SAl OUT CAS3 

IF ISIG & PORT32 & lAI & AO NEXT SAl OUT CAS2 

IF ISIG & PORT32 & IA1 & ISIZO NEXT SAl OUT CAS2 

IF ISIG & PORT32 & IA1 & SIZ1 NEXT SAl OUT CAS2 

IF ISIG & P0RT32 & A1 & lAO NEXT SAl OUT CASI 

IF ISIG & P0RT32 & IA1 & ISIZ1 & ISIZO NEXT SAl OUT CASI 

IF ISIG & PORT32 & IA1 & SIZ1 & SIZO NEXT SAl OUT CASI 

IF ISIG & PORT32 & IA1 & AO & ISIZO NEXT SA1 OUT CASI 

IF ISIG & PORT32 & A1 & AO NEXT SAl OUT CAS3 

IF ISIG & PORT32 & ISIZ1 & ISIZO NEXT SAl OUT CAS2 

IF ISIG & P0RT32 & AO & SIZ1 & SIZO NEXT SAl OUT CASI 



/** GENERATE ACKNOWLEDGE **/ 
IF ISIG & P0RT32 



NEXT SAl 



OUT DSACKO 
OUT DSACKl 



/** JUMP TO STATE SA1 **/ 

/** ASSERT THE CAS FOR THE RIGHT PORTSIZE DATASIZE AND ADDRESS OFFSET **/ 



/** ELSE WAIT FOR THE PROCESSOR CYCLE **/ 

/** POLL AGAIN •*/ 

/** STAY IN STATE SAO **/ 

/** THIS IS THE PROCESSOR CYCLE START **/ 



PRESENT SAl 
IF CASO 
IF CASI 
IF CAS2 
IF CAS3 
IF IRW & RHC 
IF PORTS 
IF P0RT16 
IF P0RT32 



NEXT SA2 
NEXT SA2 
NEXT SA2 
NEXT SA2 
NEXT SA2 
NEXT SA2 
NEXT SA2 
NEXT SA2 

NEXT SA2 



OUT CASO 
OUT CASI 
OUT CAS2 
OUT CAS3 
OUT WE ; 
OUT DSACKO 
OUT DSACKl 
OUT DSACKO 
OUT DSACKl 



/** HOLD CAS GENERATED IN THE LAST STATE **/ 

/** IF WRITE CYCLE AND NOT READ MODIFY WRITE CYCLE ASSERT WE *•/ 

/** JUMP TO STATE SA2 *•/ 



PRESENT SA2 
IF CASO 
IF CASI 
IF CAS2 
IF CAS3 
IF IRW & RHC 
IF PORTS 
IF P0RT16 
IF P0RT32 



NEXT SA3 OUT CASO 
NEXT SA3 OUT CASI 
NEXT SA3 OUT CAS2 
NEXT SA3 OUT CAS3 
NEXT SA3 OUT WE ; 
NEXT SA3 OUT DSACKO 
NEXT SA3 OUT DSACKl 
NEXT SA3 OUT DSACKO 
OUT DSACKl 
NEXT SA3 ; 



/** HOLD CAS GENERATED IN THE LAST STATE **/ 
/** HOLD WE FOR ACCESS DURATION **/ 
/** JUMP TO STATE SA3 **/ 



IF ISIG & IRW & RHC NEXT SAl 

/** GENERATE WE FOR READ WRITE CYCLE ONLY **/ 



OUT WE 



PRESENT SA3 
IF RHC 



NEXT SAO ; 
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NEXT SA4 OUT I CASO 
OUT I CASI 
OUT ICAS2 



OUT !CAS3 
OUT !WE 
OUT lOSACKO 
OUT IDSACK1 ; 

/** FOR READ WRITE CYCLE REMOVE CAS AND WE FOR CYCLE COHPLETION **/ 
/** ACKNOWLEDGE CYCLE COMPLETION BY ACKNOT **/ 
/** JUMP TO STATE SA4 **/ 



CONTINUE THE READ MODIFY WRITE CYCLE **/ 



PRESENT SA6 
IF CASO 
IF CAS1 
IF CAS2 
IF CAS3 



NEXT SA7 OUT CASO ; 
NEXT SA7 OUT CASI ; 
NEXT SA7 OUT CAS2 ; 
NEXT SA7 OUT CAS3 ; 
NEXT SA7 ; 



IF CASO & !RMC 

IF CASI & !RMC 

IF CAS2 & !RMC 

IF CAS3 & !RMC 

IF IRMC 



NEXT SA5 
NEXT SA5 
NEXT SA5 
NEXT SA5 
NEXT SA5 



OUT CASO 
OUT CASI 
OUT CAS2 
OUT CAS3 
OUT IDSACKO 
OUT I0SACK1 



/** CONTINUE THE READ MODIFY WRITE CYCLE 



/** FOR READ MODIFY WRITE CYCLE CONTINUE TO ASSERT CAS SIGNALS **/ 
/** ACKNOWLEDGE COMPLETION OF READ PORTION OF THE CYCLE BY ACKNOT **/ 
/** JUMP TO STATE SA5 **/ 



/** REMAINING PORTION OF THE READ WRITE CYCLE **/ 

PRESENT SA4 NEXT SAO ; 

/** GO BACK TO POLLING STATE SAO **/ 



PRESENT SA7 
IF CASO 
IF CASI 
IF CAS2 
IF CAS3 
IF PORTS 
IF P0RT16 
IF P0RT32 



NEXT SA8 
NEXT SA8 
NEXT SA8 
NEXT SA8 
NEXT SA8 
NEXT SA8 
NEXT SA8 

NEXT SA8 



OUT CASO ; 
OUT CASI ; 
OUT CAS2 ; 
OUT CAS3 ; 
OUT DSACKO 
OUT DSACK1 
OUT DSACKO 
OUT DSACKl 
OUT WE ; 



/** THIS IS THE WRITE PORTION OF THE READ MODIFY WRITE CYCLE **/ 
/** GENERATE THE WE SIGNAL FOR ENABLING WRITE **/ 

/** GENERATE THE CORRECT ACKNOWLEDGE FOR THE WRITE CYCLE PORTION ♦*/ 
/** CONTINUE TO ASSERT CAS **/ 



/*• COMPLETION OF READ WRITE CYCLE **/ 



/** REMAINING PORTION OF THE READ MODIFY WRITE CYCLE **/ 



PRESENT SA5 
IF CASO 
IF CASI 
IF CAS2 
IF CAS3 



NEXT SA6 
NEXT SA6 
NEXT SA6 
NEXT SA6 
NEXT SA6 ; 



OUT CASO 
OUT CASI 
OUT CAS2 
OUT CAS3 



PRESENT SA8 
IF CASO 
IF CAS1 
IF CAS2 
IF CAS3 
IF PORTS 
IF P0RT16 
IF P0RT32 



NEXT SA9 
NEXT SA9 
NEXT SA9 
NEXT SA9 
NEXT SA9 
NEXT SA9 
NEXT SA9 

NEXT SA9 



OUT CASO ; 
OUT CASI ; 
OUT CAS2 ; 
OUT CAS3 ; 
OUT DSACKO 
OUT DSACKl 
OUT DSACKO 
OUT DSACKl 
OUT WE ; 



/** THIS IS THE WRITE PORTION OF THE READ MODIFY WRITE CYCLE **/ 
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NEXT 


SA10 OUT CASO ; 


NEXT 


SA10 OUT CAS1 ; 


NEXT 


SA10 OUT CAS2 ; 


NEXT 


SA10 OUT CAS3 ; 


NEXT 


SA10 OUT DSACKO 


NEXT 


SA10 OUT DSACIC1 


NEXT 


SA10 OUT DSACKO 




OUT DSACKI 


NEXT 


SAID OUT WE ; 



PRESENT SA9 
IF CASO 
IF CAS1 
IF CAS2 
IF CAS3 
IF PORTS 
IF P0RT16 
IF PORT32 



/** THIS IS THE WRITE PORTION OF THE READ MODIFY WRITE CYCLE **/ 

PRESENT SA10 

NEXT SA11 OUT !CASO 

OUT ICAS1 

OUT ICAS2 

OUT ICAS3 

OUT ! DSACKO 

OUT JDSACKI 

OUT !WE ; 

/** COMPLETION OF THE WRITE PORTIOtJ OF THE READ MODIFY WRITE CYCLE **/ 
/** REMOVE ALL CAS WE AND ACKNOWLEDGE COMPLETION **/ 

/** REMAINING PORTION OF THE READ MODIFY WRITE CYCLE *V 

PRESENT SA11 NEXT SAO ; 

/** GO BACK TO POLLING STATE SAO **/ 

/*• COMPLETION OF THE READ MODIFY WRITE CYCLE *•/ 

> 

/** THE STATE MACHINE COMPLETE **/ 



/** EQUATIONS FOR SYSTEM INITIALIZATION AFTER RESET ••/ 
[B0..B3].ar = UNIT ; 



CASO.AR 


= UNIT 


CAS1 .AR 


= ilNIT 


DSACKO. AR 


= UNIT 


DSACK1 .AR 


= UNIT 


CAS2.AR 


= UNIT 


CAS3.AR 


= !INIT 


WE.AR 


= UNIT 



/* THESE ARE THE OUTPUT ENABLES */ 



CASO.OE 


= 


•B'l 


CAS1 .OE 


= 


'B'1 


DSACKO. OE 


= 


■B'l 


DSACK1 .OE 


= 


'B'l 


CAS2.0E 


= 


'B'1 


CAS3.0E 


= 


'B'l 


WE.OE 


= 


'B'1 



Figure 8. Source Code for Data Sizing and Alignment PAL Device (Cont'd.) 



8088 To Am2968 Interface 



Introduction 

This application note describes the implementation of a timing 
generator which interfaces between the 8088 and the Am2968 
Dynamic Memory Controller (DMC) using programmable logic 
and a delay line. The implementation does not account for any 
error detection and correction (EDC) circuitry. 

The timing generator, like the Am2970, is needed in memory 
systems utilizing a dynamic memory controller. It serves as an 
interface between the processor and the controller and gener- 
ates the necessary control signals for the controller. 

A dynamic memory controller, in brief, interfaces between a pro- 
cessor and the dynamic memory array It steers the appro priate 
addr ess in puts and Row and Column Address Strobes (RAS 
and CAS) required in the selected memory operations (i.e. 
refreshing, read/write). 

The following sections will introduce and illustrate the implemen- 
tation of the timing generator using 20-pin PAL devices, and also 
show its interface to AMD's Am2968 (DMC) and 8088 proces- 
sor. Figure 1 shows the block diagram of the interface. Program- 
mable logic was chosen because it is readily available, simple to 
use and reduces the number of devices required to implement 
specific functions. 

Interface Overview 

The implementation of this timing generator is not a general 
purpose application. It is dedicated specifically to a particular 
processor, the Intel 8088, and is limited to accessing only four 
banks of memory. The range of the four banks of memory is 



therefore 1 Mbyte — with 256K in each bank. This limitation is set 
by the fact that no additional decoding circuitry has been added. 
Decoding is done with the two address lines (A18-A19) of the 
8088, thus allowing only four banks to be selected. 

The clock to the 8088 and the two PAL devices is provided by 
the 8284A. The CLK signal from the 8284A is connected directly 
to the CLK input of the 8088 and also to CLK of RASER 
(PAL1 6R4). The PCLK of the 8284A connects to the PCLK input 
of HLDR (PAL16R6) to operate the 6-bit counter. Note that the 
clock operates on a 1/3 and 2/3 duty cycle. 

Description of PAL Device Function 

The function of RASER (PAL16R4) is to create RASI (Row Ad- 
dress Strobe) and CAS! (Column Address Strobe) at the appro- 
priate time. RASI is generated directly from the device, whereas 
CASI is generated from MRASI via a delay line. Figure 2 shows 
the timing delays between the controller and processor interface 
signals. The two modes of operation which the timing is focused 
on is Refresh w/o Scrubbing and Read/Write. By toggling the 
state of the mode control pin MC1 to either a LOW or HIGH 
respectively, and with MCO tied LOW (at the Am2968), the 
desired mode is generated. The mode control pin MC1 is 
generated base d on the state of the processor's interface pins: 
10/M, DT/R and SSO. The combination of these signals decodes 
the processor's current bus cycle and indicates the ongoing 
memory activity The MUX Select pin (MSEL) is also generated 
from the delay line. It determines whether a Row or Column 
address is sent to the memory address input based on the mode 
control inputs, MCO and MC1 . 
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Figure 2. Interface Timing 



A two-bit counter implemented internal to the (PAL16R4) 
RASER is used to keep track of the internal state of RASI during 
the Read/Write or Refresh operation s. In a normal memory 
Read/Write operation, the appropriate RASn outputs from the 
DMC will be activated in response to RASI going HIGH, as 
shown in the memory timing in Figure 3, during this time ALE 
initi ates th e cycle. In Refresh mode, receiving RASI will force all 
the RASn outputs of the DMC to go Low. FALE initiates the 
refresh cycle, since no ALE occurs during this time, as shown in 
refresh timing of Figure 3. The counter is also configured such 
that, during memory opera tions, sufficient time has been allotted 
within the cycle for RAS precharge (required in DRAMs) to 



b) HLDR(PAL16R6) 

The function of the {PAL16R6) HLDR is to generate the two 
signals— FALE (False Address Latch Enable) and HLD (Hold). 
FALE controls the internal latches (ALS and ALR) of the 
(PAL16R4) RASER and initiates the refresh cycle independent 
of the processor's ALE (Address Latch Enable) signal. 



The (PAL16R6) HLDR is essentially a 6-bit counter that gener- 
ates 64 T-Clock cycles for the 8088 processor. When a hold 
request (HLD) is made and a hold acknowledge (HLDA) is 
received from the processor, FALE will be generated for the 
(PAL16R4) RASER to initiate the refresh cycle. Refresh is per- 
formed every 8 jis which is double the required frequency. This 
time is derived in the Refresh Calculations section of this appli- 
cation note. The 8088 processor allows request only at the end 
of the bus cycle. A bus cycle duration is 4 T-Clocks, thus refresh 
is performed at cycles "60-63" as noted in the function table in 
the HLDR design specification. Figure 2 refresh timing shows 
this critical portion of the timing activity when RASI is generated 
for refresh. During CLK (0 - 0*), (this corresponds to PCLK count 
value 62), the processor acknowledge is received and RASI 
becomes active. While RASI is HIGH, the internal 2-bit counter 
will cycle through its count sequence (0* - 3*); at the end of the 
sequence as indicated by PCLK count value 63 (i.e., CLK 3*), 
RASI will become inactive. 
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Figure 3. Refresh Timing 



Refresh Timing Calculations 

The timing calculations are based on 8088 processor being in 
minimum mode (8 MHz) and a cell array of 128 rows. 

The 8088 clock period is 8 IVIHz, thus 

t = 1/f = 1/8 M = 125 ns. 

Refreshing for DRAMs are performed on 128 rows every 2 ms. 
The calculations for the above memory array is determined 
to be: 

Required refresh time = 2 ms/128 = 15.6 jjls. 

Thus the required time for 256 row cell array will be: 

Required refresh time = 4 ms/256 = 15.6 |xs. 



From the above calculations, refreshing needs to be performed 
at least every 15 |j.s. But for this application, memory is being 
refreshed every 8 |j,s as shown below: 

Refresh time = 64 T-clock cycles x 125 ns = 8 jjls. 

Delay Line Taps Computation 

The calculations for the assignment of the delay line taps is 
based on the parameters CASI and MSEL relative to RASI 
making a transition to the active state. See the timing in Fig- 
ure 4. 
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Figure 4. Delay Line Timing Diagram 
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MSEL may switch some time after T2. T2 is calculated to be as 
follows: 

T2 = tRAH + t24 = 20 + 1 1 = 31 ns. 

where T2 = delay time from RASI to MSEL 

tRAH = Row Address Hold Time o f the DRAM 
t24 = skew time for On to RASn of the DMC 

Thus, after 31 ns MSEL should initiate its active transition. 

CASI is calculated to make its change; T3 ns relative to RASI. 
The calculated value is as follows: 

T3 = T2 + t25 + tAsc = 31 + 33 + = 64. 

where T3 = delay from RASI to CASI 

t25 = skew time for On to CASn of the DMC 
Use = column setup time of the DRAM 



Thus, 64 ns after RASI initiates its transition, CASI may begin to 
go active. 

For simplicity, delay line taps may be assigned as follows: 



RASI MSEL 

RASI CASI - 



- 30 ns 
60 ns 



Counter and Mode Function Table 

The following function tables describe the activity of the 
RASI/MRASI during the speci fic m odes of the processor's inter- 
face signals (lO/M, DT/R and SSO) and the state of the internal 
counter which controls RASI/MRASI. Note: RASI will be active 
during the counter sequences between 1 to 3 as shown below in 
Figure 5. 
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COMMENTS 












1 3 
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Note: This design does not allow the user to insert Wait States 
during memory operations! Wait States may be inserted in 
interrupt or I/O cycles. 
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TITLE Ara2968 TO 8088 INTERFACE 

PATTERN RASER 

REVISION 01 

AUTHOR G. SPEARS 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP RASER PAL16R4 

CLK RES ALE FALE lOM DTR SSO CLK2 NU2 GND 
TRI MRASI RASI /MCI NC /B /A /ALR /ALS VCC 



EQUATIONS 

ALS = FALE*/ALE*/ALR 
+ RES 

ALR = /A*B*/ALS 
+ A*/ALS 

/RASI = /A*/B*/RES 
+ RES*CLK2 

/MRASI = IOM*/DTR 
+ IOM*DTR 
+ /IOM*DTR*SS0 
+ /A*/B 

A := /A*/RES*/ALS 
+ /A*/RES*ALE 

B := A*/B*/RES*/ALS 
+ /A*B*/RES*/ALS 

MCI := IOM*/DTR*ALE*/RES 
+ DTR*SSO*ALE*/RES 
+ IOM*DTR*/SS0*ALE*/RES 
+ MC1*/ALE*/RES 

; SIMULATION NOT INCLUDED 



Figure 6. Source Listing for tiie 8088 to Am2968 interface (AmPAL16R4) 
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TITLE Am2968 TO 8088 INTERFACE 

PATTERN HLDR 

REVISION 01 

AUTHOR G. SPEARS 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 . 

CHIP HLDR PAL16R6 

PCLK RASI MRASI HLDA RES CLK MCI NU2 NU3 GND 
TRI /FALE /F /E /D /C /B /A HLD VCC 

EQUATIONS 

A := /A*/RES*MC1 

B := A*/B*/RES*MC1 
+ /A*B*/RES*MC1 

C := A*B*/C*/RES*MC1 
+ /B*C*/RES*MC1 
+ /A*C*/RES*MC1 

D := A*B*C*/D*/RES*MC1 
+ /C*D*/RES*MC1 
+ /B*D*/RES*MC1 
+ /A*D*/RES*MC1 

E := A*B*C*D*/E*/RES*MC1 
+ /D*E*/RES*MC1 
+ /A*E*/RES*MC1 
+ /B*E*/RES*MC1 
+ /C*E*/RES*MC1 

F := /A*F*/RES*MC1 
+ /C*F*/RES*MC1 
+ /B*F*/RES*MC1 
+ /D*F*/RES*MC1 
+ A*B*C*D*E*/F*/RES*MC1 
+ /E*F*/RES*MC1 

/HLD = /C + /D + /E + /F 

FALE = HLDA*CLK*RASI*/MRASI 

; SIMULATION NOT INCLUDED 



Figure 7. Source Listing for the 8088 to Am2968 Interface (AmPAL16R6) 
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Introduction 

This application note siiows how the general-purpose timing 
generator can be configured to interface between MC68000 and 
the Am2968. Figure 1 shows the block diagram of the interface. 
The correct interface signals must be adapted from the proces- 
sor to the TimGen (PAL16L8) and the Am 2968. Note , fro m Fig- 
ure 2, that AS (Address Strobe) replaces CYCREQ, and U PS 
(Upper_pata Strobe) and LDS (Lower Data Strobe) replace DS 
and B/W. 

Asserting the Address Strobe signal (AS), which is connect ed to 
the LE in put, latches the valid address into the Am2968. UDS 
and LDS are controls indicating the flow of data on the data bus 
(D00-D15) during memory read/write op eratio n s. By asserting 
the correct polarity on these two signals (UDS & LDS), either a 
byte or a word may be accessed. 

The following sections will show the timing requirements which 
are being considered — that is, the desired processor operating 
frequency and the appropriate DRAM. Timing diagrams are also 
included to show the status of the interface signals during 
read/write operations for various processor frequencies. 

MC68000 Timing Requirements 

in generating the hardware for the timing interface for the 
MC68000 to the Am2968, the overall system timing require- 
ments must be considered to guarantee that the desired 
memory access time can be met. 

The following general equation formulates the parameters which 
must be considered in generating the read cycle time for the 
68000 processor. The write cycle time may be similarly 
generated. 

tpEAD = 2tcYC + tcH ~ tCHSLx ~ ^dicl " 2tpAL " t2 - tLATCH 
+ NtcYC 



tpEAD = read cycle time 

'cYc = clock period of the processor 

tcH = clock width High of the processor 

tcHsu = clock High to AS, DS Low of the processor 

tpiCL = data in to clock Low (setup time) of the processor 

tpAL = programmable logic access time 

t2 = delay from RASI to RASn of the DMC 

^LATCH = delay through latch 

N = number of inserted Wait States. 

For simplicity, zero Wait States have been assumed in selecting 
the appropriate DRAMs. If Wait States had been asserted, the 
appropriate DRAM must be selected to meet the overall tREAD 
cycle time. Note that the access time of the DRAM (tAcc) f^ust 
be less than or equal to tREAD cycle time or else access to 
memory will be missed. The following evaluations show the cal- 
culated tAcc based on the various processor operating frequen- 
cies: 8 MHz, 10 MHz and 12.5 MHz. 
MC68000 W/WAIT STATES 

8 MHz 

tREAD = (250 + 55-60-15-30 
= (164 + 125N) ns 

DRAMtAcc = 150 ns 
10 MHz 



23-13 + N*125)ns 



tREAD = (200 + 45-55- 10 
= (114 + 100N) ns 

DRAM tAcc = 1 00 ns 
12.5 MHz 

tREAD = (160 + 35-55-10-30 
= (64 + 80N) ns 



30- 23 - 13 + N* 100) ns 



23 - 13 + N*80)ns 
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Figure 1. 68000 System Diagram 



CS 
fiS 
FR 



DTACK '*- 
SUP - 

rm — 
UDS — 
LDS — 



TIMGEN 
PAL16L8 



RASI (TO) 



DELAY °'' 
LINE 20x 



80x 
100x 



-•-RASI 
-•-MSEL 
-*-CASI 



Figure 2. Interface Diagram 
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Figure 3. Read and Write Cycle Timing Diagram 



For 12.5 MHz frequency, asserting at least one Wait State will 
allow tREAD = 144 ns, thus DRAM tAcc = 120 ns. 

Figure 3 shows the read and write cycle timing diagrams relative 



to the processor's cycle time. The 8 MHz and 10 MHz are shown 
on the same diagram because of their similarity Figure 4 shows 
the read and write timing for the processor operating at 12.5 
MHz with Wait States asserted during these operations. 
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TITLE GEN INTERFACE FOR 68000 

PATTERN TIMGEN 

REVISION 01 

AUTHOR LEE/YEE 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP TIMGEN PAL16L8 

/CS /AS /FR /SUP RW T80 /UDS /LDS TlOO GND 
NC /DTACK TO FRH /RFRQ /RWRQ /WEL /WEH /OE VCC 

EQUATIONS 

/FRH = FR*/FRH + TO*RFRQ 

RWRQ = CS*AS*/RWRQ*/T100 + RWRQ*/T100 

RFRQ = FR*FRH*/RWRQ*/T100 + RFRQ*/T100 

/TO = /RWRQ*/RFRQ + T80 

DTACK = CS*RWRQ*/RFRQ*T80 

DTACK. TRST = CS*RWRQ*/RFRQ*T80 

WEH = /SUP*RWRQ*CS*/UDS*/RW 

WEL = /SUP*RWRQ*CS*/LDS*/RW 

OE = CS*RW*UDS + RW*CS*LDS 

; SIMULATION NOT INCLUDED 

Figure 5. Source Listing for the 68000 Interface 
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Introduction 

This application note shows the implementation of a general- 
purpose dual-port arbiter interfacing three processors, a 
Dynamic Memory Controller (Am2968A DMC), and dynamic 
memory. The arbiter, implemented with two programmable array 
logic PAL devices, and a delay line, can provide a simple and 
complete arbitration solution operating in asynchronous mode. 

The interface consists of two PAL devices (TIMGEN2A and 
T!MGEN2B), a timer, a delay line, and two multiplexers. Figure 1 
shows the block diagram of the system which includes the ar- 
biter logic implemented with the PAL devices. These two 
devices, TIMGEN2A and TIMGEN2B, provide the control sig- 
nals to the DMC as well as to the dynamic memory. The purpose 
of the arbiter is to mediate when two processors request for 



memory cycles, or when refresh cycles need to be performed. 
The first PAL device, TIMGEN2A, generates the signals for the 
DMC and the Address MUX; it also contains the arbitration logic. 
The second device, TIMGEN2B, generates the control signals 
for the memory and the data bus latches. 

The Am2968A Dynamic Memory Controller (DMC) interfaces 
between the processors and the dynamic memory array The 
DMC provides the required addresses either for the memory or 
the refresh cycles. For memory cycles, the addresses are 
generated by the processor and latched into the DMC. For 
refresh cycles, the addresses are generated by a 20-bit counter 
internal to the DMC. When the DMC receives the correct control 
signals, it generates the approp riate ad dres s (Qn) and the Row 
and Column Address Strobes (RASn and CASn) that are re- 
quired to perform memory read/write and refresh. 




CSA 
RDA 

B HENA 

5DP 
CSB 
RDB 
WHB 
B HENB — » 



D0OA-D15A D00B-D15B 



Figure 1. Block Diagram 
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Interface Overview 

The dual-port arbiter can be interfaced to various processors, 
using tlie proper control signals from the particular processor. 
Typically, t he processor interface signals are CS ( Chip Select), 
CYCREQ (Cycle Request), RD (Read), and WR (Write), see 
Figure 1 . 

The two PAL devices, TIMGEN2A and TIMGEN2B are designed 
to operate in the following modes: 

(1) generate proper control signals when cycle request is made 

(2) arbitrate between two processor requests 

(3) arbitrate between a refresh request and processor request 

(4) arbitrate between a refresh request and two cycle requests 



Requests by the processor are made when CS and CYCREQ 
are valid and the memory address is present. During dual-port 
arbitrati on, th e occurrence of sim ultaneous r equests, for In- 
stance, CSAJChip Select Port A), CYCREQA (Cycle Request 
Port A) and FR (Forced Refresh), will result in the granting of a 
refresh cycle and the appropriate address being generated 
accordingly by the DIVIC. 

TII\/!GEN2A generates the appropriate control signals to the 
data and address multiplexers, the delay line, and the memory 
controller. These control signals are: RASI (Row Address Strobe 
Input), SEL (Processor Address MUX Select), and MCI (Mode 
Control Input). RASI is generated for all read/write and refresh 
cycles. It is also used as the delay line input from which MSEL 
(MUX Select for the DMC) and CAS! (Column Address Strobe 
input) are generated. SEL allows the correct port address, A or 
B, to be the input to the DMC depending upon which port is 
acknowledged. MCI specifies the operating mode of the DMC, 
read/write or refresh (see Am2968A data sheet). 

The arbitration logic prioritizes and grants the requests for 
read/write cycles from either port, A or B, and refresh cycles. 
The Mode Table (Figure 2) shows the order of precedence. 
When simultaneous request for a read/write or refresh cycle 
occurs , the refresh cycle will be given priority The connection of 
RFRQ (Refresh Request) from TIMGEN2A to MCI of the DMC, 
as shown in the block diagram, implicitly gives refresh the 
precedence. When simultaneous read/write requests are made 
by processor A and processor B, processor A will be given 
priority over B. Ag ain, the o rder of precedence is set implicitly by 
the connection of RWRQA (Read/Write Request Port A) to the 
select input signal of the processor address MUX. 
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FR (refresh) 
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FR (refresh) 
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The second programmable logic device, TIMGEN2B generates 
theOE (Output Enable) signal for the data bus latches and the 
WE (Write Enable) sign al for memo r y. The inputs to TIMG EN2B 
are the co ntrol signals — CSA, CSB, RDA (Read A), RDB (Read 
B), WRA (Write A), and WRB (Write B). Signals such as BHEN 
(Byte High Enable) and AOO, the least significant bit of the ad- 
dress, have been included to show their function (if available). 
These two signals are provided by the processor and together 
they determine whether a word or byte transfer is to be per- 
formed during memory operations. 



6 555 3 

TIMER 



0.01 (iF 



Figure 3. Refresh Timer Diagram 



Refresh Timer and Calculations 

The refresh timer below shows one method of implementing an 
external refresh clock. This section is optional if an alternate 
source is used. The refresh timer, implemented with a 555 timer, 
needs to generate an active-LOW edge-triggered signal at least 
once every 15 ns. This_rneans that a refresh occurs when a 
LOWgoing edge on the FR signal is detected. The 15 jis value 
is derived from the dynamic memory refresh timing require- 
ments; refresh is normally required, over 256 rows, every 4 ms. 
This equates to a required refresh cycle every 15.6 p-s. In this 
application, the refresh timer generates FR approximately once 
every 9.8 |xs. Figure 3 shows the circuit for the forced refresh 
(FR) timer. The timer is a commercially-available 555 timer. The 
delay equation shown below must be solved for R1 , R2, and C 
values. This satisfies the refresh requirement of less than, or 
equal to, 15.6 |xs 

T = 0.693 (R1 + 2R2) C < 4 ms/256 = 15.6 |xs 
For the following values of R1 , R2 and C 
R1 = R2 = 10K and C = 470 pF 

the timer generates a refresh every 9.8 (i,s which allows some 
margin in the refresh requirement. The specified values for R1 , 
R2, and C allow for 30% total discrete component tolerance. 



Figure 2. Mode Select Table 
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TIMGEN2A 

The function of TIMGEN2A is to generate RASI (Row Address 
Strobe input) and arbitrate between processor and refresh 
cycles. Based on the arbitratio n, TiN/IG EN2A also generates 
RWRQ (Read/Write Request), or RFRQ (Refresh Request), that 
determines the type of cycle to be performed. TIMGEN2A can 
arbitrate between refresh and processor requests and also arbi- 
trate between Port A and Port B processor requests. 

When simultaneo us refr esh and processor requests are 
generated, refresh (RFRQ) will be give n priority. The priority is 
Implicit by design because the RFRQ output of TI MGEN2 A is 
connected to MCI of the Am2968A DMC. When RFRQ be- 
comes active (LOW), IV1C1 forces the Df\/IG into refresh mode. 



When simultaneous processor requests are made, RWRQA 
(Read/Write Request of Port A) will be granted before RWRQB 
(Read/Write Reque st of Port B). This priority is also implicit by 
design because the RWRQA output is c onnected to SEL of the 
processor's address multiplexer. When RWRQA becomes ac- 
tive, address from Port A will be selected for input to the Row 
and Column Address latches of the Am2968A, also, the data 
from Data Bus A will be allowed to flow to the inputs of the 
DRAMs. In either case, RASI must be generated to initiate any 
of the cycles. Once RASI is generated, the control signals, 
MSEL and CASI, will be generated from the delay line relative 
to RASI. 

TIMGEN2B 

The function of TIMGEN2B is to generate WE (Write Enable) for 
memory and also OE (Output Enable) to control the flow of data 
through the data bus latche s. Th e inputs to TIMG EN2B are, 
CSA, RDA (Read Stro be A) , WRA (Write Strobe A), CSB, RDB 
(Re ad Strobe B), and WRB (Write Strobe B). In ad dition, there 
are BHENA (Byte High Enable A) and BHENB (Byte High 
Enable B), AOOA (least significant address bit for Port A) and 
AOOB (least significant address bit for Port B). These signals are 
generated by the processor and are used to specify either byte 
or word transfers. 

The f ollow ing de scription outlines the operation of TIMGEN2B. If 
CSA and RDA become active (LOW), TIMGEN2B will cause 
OEA to become active (LOW), enabling the data bus latches 
and allowing data corresponding to address A to flow onto the 
system data bus (read operation). The identical procedure oc- 
curs for Port B when the control inputs for Port B is active. The 
block diagram shown in Figure 1 shows a multiplexer for the 
system data bus. This multiplexer controls the data flow into 
memory from either processor A or processor B , during w rite 
operations. The select to the MUX is controlled by RWRQA, the 
same signal that controls the select to the processor address 
MUX, allowing address Port A to implicitly have higher priority 
during process memory access. 

The following sections provide general information about 
Interfacing the arbiter to various major types of processors; they 
are: the iAPX-type, the AmZSOOO, and the MC68000-type 
processors. The PAL devices for the three types of processors 
will be identified as follows: TIMGEN2A and TIMGEN2B are for 
the iAPX-type; TIMGEN3A, TIMGEN3B for the AmZSOOO; and 
TIMGEN4A and TIMGEN4B are for the MC68000-type. When 
mixing different processor in an interface, some of the control 
signals mentioned for the particular groups may need to be 



modified to be applicable to the processor under consideration. 
The designer can tailor the design to meet specific needs with 
simple modification to the PAL device logic equations provided. 

Interfacing the iAPX-Type Processors 

The control signals requir ed f or TIMGEN2A are shown in Fig- 
ure 4. These signals are: CS, C YCREQ, F R and the delay line 
outputs (TAP1_-TAP5). CS and CYCREQ are provided by the 
processor. FR is the refresh request from the refresh timer. 
Sources to this input can be provided either by the timer 
described in this application note or other appropriate sources. 
TAP1-TAP5 are the timing delay outputs used to regulate many 
of the signal generation. The outputs gene rated from this PAL 
device are TAPO (RASI), XACKA, XACKB, and the grants for 
processor and refresh requests. 

The signals for TIMGEN2B are RD, WR, BIHE, and AOO. For 
byte/word transfers, the designer should consult the appropriate 
proce ssor data sheet. In general, this processor group uses 
BHE in conjunction with AOO to define the transfer function of the 
data bus. The outputs from this PAL device are: Write Enables, 
Latch Enables, and Output Enables. 

Interfacing the AmZSOOO 

The control signals required for this interface are shown in Fig- 
ure 5. For T IMGEN3A, the general controls are CS and 
CYCREQ , FR and the delay line outputs (TAPjN;TAP5). CS and 
CYCREQ are generated by the processor. FR is the refresh 
request from the refresh timer. TAP1-TAP5 are the delay line 
outputs used to regula te the s ig nal gen eration. The outputs from 
this PAL device are WAITA, WAITB, TAPO (RASI), and the 
grants for processor and refresh requests. 

The control signals for TIMGEN3B are R/W, DS, AOO and B/W 
For byte/word transfers, the designer should consult the proces- 
sor data sheet for correct signal generation. The outputs from 
this PAL device are: Write Enables, Latch Enables and Output 
Enables. 

Interfacing the MC68000-TVpe 
Processors 

The control signals required for this interface are shown in Fig- 
ure 6. For T IMGEN4A, the general controls are CS and 
CYCREQ , FR and the delay line outputs (TAP1-TAP5). CS and 
CYCREQ are generated by the processor. FR is the refresh 
request from the refresh timer. TAP1-TAP5 are the delay line 
outputs used to regulate signal generation. The o utputs 
generated from this PAL device are: DTACKA, DTACKB, TAPO 
(RASI) and the grants for processor and refresh request. 

The control signals for TIMGEN4B are R/W, UDS, and LDS. For 
byte/word transfers, the designer should consult the respective 
processor data sheet for correct signal generation. For example, 
to obtai n eith e r byt e or word transfers with the MC68000, three 
signals, UDS, LDS and R/W, must provide the correct levels to 
guarantee the correct data transfers, see MC68000 Data Bus 
Control Table in the User's Manual. The outputs generated from 
this PAL device are: Write Enables, Latch Enables, and Output 
Enables. 

The programmable logic device equations for all three interfaces 
are shown in Figures 7, 8, and 9. 
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TITLE INTERFACE TIMING PAL DEVICE #1 

PATTERN A3-249A 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 

CHIP TIMGEN2A PAL16L8 

/CSA /CYREQA /CSB /CYREQB TAPl TAP2 TAP3 TAP4 TAPS GND 
/FR /XACKA /XACKB TAPO FRH /RFRQ /RWRQA /RWRQB NC VCC 

EQUATIONS 

/FRH = FR*/FRH + TAPO*RFRQ 

RFRQ = FR*FRH*/RWRQA*/RWRQB*/TAP5 + RFRQ*/TAP5 

RWRQA = CSA*CYREQA*/RFRQ*/RWRQB*/TAP5 + RWRQA*/TAP5 

RWRQB = CSB*CYREQB*/RFRQ*/RWRQA*/TAP5 + RWRQB*/TAP5 

/TAPO = /RFRQ*/RWRQA*/RWRQB + TAP4 ;PAL device for 8086,80186 

;and 80286 processors 
XACKA = CSA*RWRQA*/RFRQ*TAP4 
XACKA. TRST = CSA*RWRQA*/RFRQ*TAP4 

XACKB = CSB*RWRQB*/RFRQ*/RWRQA*TAP4 
XACKB. TRST = CSB*RWRQB*/RFRQ*/RWRQA*TAP4 



TITLE INTERFACE TIMING PAL DEVICE #2 

PATTERN A3-2 4 9B 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 

CHIP TIMGEN2B PAL22V10 

/CSA /RDA /WRA /BHENA AOOA /BHENB /WRB /RDB /CSB NC TAPS GND 
AOOB /OEA /OEB /SUP /RFRQ /RWRQA /RWRQB /WEL /WEH LEB LEA VCC 
GLOBAL 

EQUATIONS ;PAL DEVICE FOR 8086 PROCESSORS 

WEH = /SUP*RWRQA*/RFRQ*CSA*WRA*BHENA 

+ /SUP*RWRQB*/RFRQ*/RWRQA*CSB*WRB*BHENB 

WEL = /SUP*RWRQA*/RFRQ*CSA*WRA*/AOOA 

+ /SUP*RWRQB*/RFRQ*/RWRQA*CSB*WRB*/AOOB 

OEA = CSA*RWRQA*/RFRQ*RDA 

OEB = CSB*RWRQB*/RFRQ*RDB 

LEA = CSA*RWRQA*/RFRQ*RDA*TAP3 

LEB = CSB*RWRQB*/RFRQ*RDB*TAP3 



Figure 7. PLPL Specification for the Example of Figure 4 



Monolithic 



ic Rfill Memories ^ 2-221 



General-Purpose Dual-Port Arbiter 



TITLE INTERFACE TIMING PAL DEVICE #1 

PATTERN A3-250A 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 

CHIP TIMGEN3A PAL18P8 

/CSA /CYREQA /CSB /CYREQB TAPl TAP2 TAP3 TAP4 TAPS GND 
/FR /WAITA /WAITB TAPO FRH /RFRQ /RWRQA /RWRQB NC VCC 

EQUATIONS 

/FRH = FR*/FRH + TAPO*RFRQ 

RFRQ = FR*FRH*/RWRQA*/RWRQB*/TAP5 + RFRQ*/TAP5 

RWRQA = CSA*CYREQA*/RFRQ*/RWRQB*/TAP5 + RWRQA*/TAP5 

RWRQB = CSB*CYREQB*/RFRQ*/RWRQA*/TAP5 + RWRQB*/TAP5 

/TAPO = /RFRQ* /RWRQA* /RWRQB + TAP4 ;PAL device for Z8000 

/processor 
/WAITA = CSA*RWRQA*/RFRQ*TAP4 
WAITA. TRST = CSA*RWRQA*/RFRQ*TAP4 

/WAITB = CSB*RWRQB*/RFRQ*/RWRQA*TAP4 
WAITB. TRST = CSB *RWRQB* /RFRQ* /RWRQA* TAP 4 

TITLE INTERFACE TIMING PAL DEVICE #2 

PATTERN A3-250B 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 

CHIP TIMGEN3B PAL22V10 

/CSA RWA /DSA BWA AOOA BWB /DSB RWB /CSB NC TAPS GND 
AOOB /OEA /OEB /SUP /RFRQ /RWRQA /RWRQB /WEL /WEH LEB LEA VCC 
GLOBAL 

EQUATIONS ;PAL DEVICE FOR Z8000 PROCESSOR 

WEH = /SUP*RWRQA*/RFRQ*CSA*/RWA*/BWA 

+ /SUP*RWRQA*/RFRQ*CSA*/RWA*BWA*AOOA 

+ /SUP*RWRQB*/i\FRQ*/RWRQA*CSB*/RWB*/BWB 

+ /SUP*RWRQB*/R:.'?.Q*/RWRQA*CSB*/RWB*BWB*AOOB 

WEL = /SUP*RWRQA*/RFRQ*CSA*/RWA*/BWA 

+ / SUP *RWRQA*/RFRQ*CSA*/RWA*BWA* /AOOA 

+ /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*/BWB 

+ / SUP *RWRQB* /RFRQ* /RWRQA*CSB*/RWB*BWB* /AOOB 

OEA = CSA*RWRQA*/RFRQ*RWA*DSA 

OEB = CSB*RWRQB*/RFRQ*RWB*DSB 

LEA = CSA*RWRQA*/RFRQ*RWA*TAP3 

LEB = CSB*RWRQB*/RFRQ*RWB*TAP3 

Figure 8. PLPL Specification for tlie Example of Figure 5 
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TITLE INTERFACE TIMING PAL DEVICE #1 

PATTERN A3-251A 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 

CHIP TIMGEN4A PAL16L8 

/CSA /CYREQA /CSB /CYREQB TAPl TAP2 TAP3 TAP4 TAPS GND 
/FR /DTACKA /DTACKB TAPO FRH /RFRQ /RWRQA /RWRQB NC VCC 

EQUATIONS 

/FRH = FR*/FRH + TAPO* RFRQ 

RFRQ = FR*FRH*/RWRQA*/RWRQB*/TAP5 + RFRQ*/TAP5 

RWRQA = CSA*CYREQA*/RFRQ*/RWRQB*/TAP5 + RWRQA*/TAP5 

RWRQB = CSB*CYREQB*/RFRQ*/RWRQA*/TAP5 + RWRQB*/TAP5 

/TAPO = /RFRQ*/RWRQA*/RWRQB + TAP4 ;PAL device for 68000 

/processor 
DTACKA = CSA*RWRQA*/RFRQ*TAP4 
DTACKA. TRST = CSA*RWRQA*/RFRQ*TAP4 

DTACKB = CSB*RWRQB*/RFRQ*/RWRQA*TAP4 
DTACKB. TRST = CSB*RWRQB*/RFRQ*/RWRQA*TAP4 



TITLE INTERFACE TIMING PAL DEVICE #2 

PATTERN A3-251B 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 

CHIP TIMGEN4B PAL22V10 

/CSA RWA /UDSA /LDSA NC5 /LDSB /UDSB RWB /CSB NC TAPS GND 
NCIO /OEA /OEB /SUP /RFRQ /RWRQA /RWRQB /WEL /WEH LEB LEA VCC 
GLOBAL 

EQUATIONS ;PAL DEVICE FOR 68000 PROCESSOR 

WEH = /SUP*RWRQA*/RFRQ*CSA*/RWA*UDSA 

+ /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*UDSB 

WEL = /SUP*RWRQA*/RFRQ*CSA*/RWA*LDSA 

+ /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*LDSB 

OEA = CSA*RWRQA*/RFRQ*RWA*(UDSA + LDSA) 

OEB = CSB*RWRQB*/RFRQ*RWB*{UDSB + LDSB) 

LEA = CSA*RWRQA*/RFRQ*RWA*TAP3 

LEB = CSB*RWRQB*/RFRQ*RWB*TAP3 

FIguro 9. PLPL Specification for tho Example of Figure C 
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An example of a control path application for a PAL device is in a 
memory system. Most large memory systems use MOS 
dynamic RAMs. Their high density allows packing a large 
memory size irsto a small board area. Dynamic RAM prices also 
make them very cost effective. 

Dynamic RAMs require external logic for address multiplexing, 
timing generation and refresh control. This application note 
shows the use of a PAL16R8A and an Am2964B to provide the 
necessary external logic for a typical dynamic memory system. 
The PAL device is used as a state sequencer for timing genera- 
tion and the Am2964B provides specialized control circuitry and 
reduces timing skew between control signals. This implementa- 
tion replaces about 20 SSI/MSI packages. 



Design Requirements 

A system block diagram is shown in Figure 1 . The control bus 
provides most of the inputs to the PAL state se quencer . These 
include : Mem ory Request (MREQ), READ/WRITE (RW), 
RESET (RST), Refresh Clock (RFCK), and Read-Modify-Write 
(RMW). Two upper address lines of the address bus se rve as 
board selects (BSi , BSq), and one local signal, SLOW/FAST 
Memo ry (FAST ), allows use of either s low or fast memory. A 
READ/WRITE sequence is initialized by MREQ ANDed with the 
proper board select conditions and a refresh sequence Is in- 
itialized by RFCK. If both sequences are requested at the same 
time, a refresh sequence is performed. RW when HIGH selects 
a READ operation and when LOW selects a WRITE operation. 
RMW when HIGH selects a Read-Modify-Write cycle. 
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Figure 1. Dynamic Memory Controller 
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The outputs of the PAL device provide the timing and contro l 
inputs to the Am2964B. These are: Row Address Strobe (RAS), 
Addre ss Multiplexer Selec t (MS), Column Address Strobe 
(CAS), and Refresh (RFSjj). In addition, the PAL device pro- 
vides the Write Enable (WE) to the Memory Array. Figure 1 
shows the timing for fast R EAD/WRITE cycles. The memory 
cycle is initiated b y MR EQ going LOW. The PAL device re- 
sponds by bringing RAS LOW at to, followed by MS going LOW 
at ti , and finally bringing CAS LOW at ta. If RW is LOW, WE is 



also_brought LOW at ta- WE is held LOW until t4. RAS, MS and 
CAS are brought HIGH at ts. The rising edge of any of these 
three signals may be used to latch output data during a Read 
operation. The state sequencer then disabled for three states to 
allow for memory precharge. 

By holding the FAST input LOW, an extended memory cycle is 
available to accommodate slower RAMs. The timing appears in 
Figure 3. 
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Figure 2. Fast READ/WRITE Cycle 
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Figure 3. Extended Memory Cycle 
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RAS-Only refresh cycle timing is shown in Figure 4. The refresh 
cycle is initiated when RFC K go es HIGH. The RFSH output 
goes LOW at to, followed by RASat ti . The Am2964B supplied 
the necessary refresh address. RAS is brought back HIGH atts 
and precharge is then timed out. An extended refresh cycle for 
slower memory is available also. Burst refresh can be accom- 
plished by leaving RFCK HIGH for as many refresh cycles as 
desired. 



Read-Modify-Write cycle timing is activated by setting RMW 
HIGH. This is especially valuable in systems with Error Detec- 
tion/Correction (EDC) capability Data can be read, modified by 
the EDC circuitry (Am2960), and if necessary, written back to 
memory in a single memory cycle. Read-Modify-Write cycle 
timing is shown in Figure 5. Note that WE goes LOW at the end 
of the cycle. 
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Figure 4. RAS-Only Refresh Cycle 
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Figure 5. Read-Modify-Write Cycle 



2-226 



M 



MonolUMo imrUJ Memories 



Dynamic Memory Control State Sequencer 



Design Approach 

The first step in the state sequencer design process is to define 
the timing waveforms for all of the functions desired. Figures 2, 
3, 4, and 5 are the result. Next, characteristics of the resulting 
wa veforms are examined. In itially, t he sequencer i s wait ing on 
the MREQ or RFCK input. If MREQ goes LOW, the R AS to M S 
to CA S sequence is initiated. If RFCK goes HIGH, the RFSH to 
RAS sequence is initiated. Both sequences are equivalent to a 
simple "shift" function. Once the shift sequence is completed 
and the signals are asserted, they must stay asserted for a 
specific time depending on the selected function. To time the 



length that signals must stay asserted requires a "counting" 
function. The precharge sequence at the end of all cycles also 
requires "counting". This partitions most of the design into two 
smaller functional blocks; a shifter and a counter. The remaining 
function select and control logic is partitioned into a "multiplexer- 
like" functional block. Figure 6 shows the PAL device partitioned 
into functional blocks. By dividing the design into blocks, its 
implementation becomes simple. 

Figure 7 shows PALdevice equations for the PAL1 6R8A dynamic 
memory state sequencer. 
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TITLE DYNAMIC MEMORY STATE SEQUENCER 

PATTERN AMP3-285 

REVISION 01 

NAME JOE ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/25/87 

CHIP MEMORYl PAL16R8 

CK RFCK /RST RW /MREQ RMW FAST BSl BSO GND 
/E /QO /Ql /Q2 /RFSH /WE /CAS MS /RAS VCC 



EQUATIONS 

QO := /RST*/MS*/O0 

+ /RST*RFSH*RAS*/QO 

+ /RST*/FAST*/Q0*Q2 

+ /RST*/FAST*/Q0*Q1 

+ /RST*/FAST* Q1*Q2 

+ /RST* FAST*/RMW*Q0*/Q1 

+ /RST* FAST*/RMW*Q0*/Q2 

Ql := /RST*RAS*/Q0* Ql 
+ /RST*RAS* Q0*/Q1 
+ /RST*RAS* QO* Ql 
+ /RST*RAS*/QO* Q2 

Q2 :^ /RST*RAS* Q2 
+ /RST* Q0*Q2 
+ /RST*RAS*QO* Ql 

RFSH := /RST*RFCK*/Q2*/Q1*/Q0*/RAS 
+ /RST*RFSH*RAS 
+ /RST*RFSH*/FAST* Ql 
+ /RST*RFSH* Q2 

WE := /RST*/RW*/MS*/RFSH*/RMW*/Q0*/Q2 
+ /RST*/RW*/MS*/RFSH*/RMW*/Q1*/Q2 
+ /RST*/RW*/MS*/RFSH* RMW*/Q0* Ql *Q2 
+ /RST*/RW*/MS*/RFSH* RMW* Q0*/Q1 *Q2 

CAS := /RST*/RPSH*/MS*/QO 
+ /RST*/RFSH*/MS*/Q1 
+ /RST*/RFSH*/MS*/Q2 

/MS := /RST*/RFSH*RAS*/QO 
+ /RST*/RFSH*RAS*/Q1 
+ /RST*/RFSH*RAS*/Q2 



RAS := /RST*/RFCK*/Q0*/Q1*/Q2*MREQ*/BS1*/BS0 
+ /RST*/RFSH*/Q0*/Q1*/Q2*MREQ*/BS1*/BS0 
+ /RST* RFSH*/Q0*/Q1*/Q2 
+ /RST*RAS*/QO 
+ /RST*RAS*/Q1 
+ /RST*RAS*/Q2 



Figure 7. Source Listing for Dynamic iVlemory Control State Sequencer 
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Single bit error detection and correction for an 8-bit data word 
requires 4 checl< bits, mailing a 12-bit code word. The simplest 
code to design is a 12-bit Hamming code. To arrive at the code, 
we set up the following matrix: 

B7 B6 B5 84 83 82 81 80 I C3 C2 CI CO 

S3 X X X X I X 

S2 X XXX I X 

SI XX XX X I X 

SO X X X X X I X 



The vertical columns are in a counting pattern, excluding 
the single bit values of 8, 4, 2, 1. The single bit values are 
assigned to the check bits C3-C0. By reading horizontally 
across the rows of the matrix, we get the check equations 
by exclusive OR'ing the data bits with X's in that row and 
equating that to the check bit with an X in that row: 

C3 = 87 ® 86 ® 85 ® 84 

C2 = 87 ® 83 ® 82 ® 81 

CI = 86 ® 85 ® 83 ® 82 ® 80 

CO = 86 ® 84 ® 83 ® 81 ® 80 



The Hamming matrix and resultant sets of check bit and 
syndrome equations are selected so that when a single bit error 
occurs, the syndrome gives the position of that bit (either data 
or check) in the 12-bit message format. 



Example B7 

random data word: 1 
check bits: 



10 111 



BO 





C3 = 87 ® 86 ® 85 ® 84 

= 1 ® 1 ® ® 1 =1 

C2 = 87 ® 83 ® 82 ® 81 

= 1 ® 1 ® 1 ® = 1 

CI = B6 ® 85 ® 83 ® 82 ® 80 

= 1®0®1®1®0 = 1 

CO = 86 ® B4 ® 83 ® 81 ® 80 

= 1®1®1®0®0 = 1 

message: 

M12M11 M10 M9 M8 M7 M6 M5 M4 M3 M2 Ml 
110111101011 
87 86 85 84 C3 83 82 81 C2 80 CI CO 



The check bits are stored along with the data bits in the 
following message format: 

M12 Mil M10 M9 m m m M5 m m M2M1i 
87 86 85 84 C3 83 82 81 C2 80 CI CO 

When a read occurs, the check bits are recalculated and 
compared with the stored bits to generate the 4 bit syndrome: 

S3 = 87 ® 86 ® 85 ® 84 ® C3 

S2 = 87 ® 83 ® 82 ® 81 ® C2 

SI = 86 ® 85 ® 83 ® 82 ® 80 ® CI 

SO = 86 84 ® 83 ® 81 ® 80 ® CO 



The 4 sysdrome bits indicate the location of any single bit errors 
in the 12-bit message format which may than be corrected by 
inversion. 



The Hamming code works by introducing enough other code 
words to create a difference of exactly 3 bits between legal code 
words. All other code words are illegal. If, in storage, one bit 
flips, the result is an illegal word. In addition, there is only one 
word in the set of legal code words from which it could have 
come, hence the correction. 
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assume no error: 

S3 = B7 ® B6 ® B5 ® B4 ® C3 

= 1 ® 1 ® ® 1 ® 1 =0 

S2 = B7 ® B3 ® B2 ® B1 ® C2 

= 1®1®1®0®1 = 

SI = B6 ® B5 ® B3 ® B2 ® BO ® CI 

= 1 ® ® 1 ® 1 ® ® 1 

SO = B6 ,® B4 ® B3 ® B1 ® BO ® CO 

= 1 ® 1 ® 1 ® ® ® 1 



1 


= 


1 




1 


= 


1 







® 


1 


= 





® 


1 


= 



syndrome 0000 indicates no error 
assume data bit B7 flips (1 — 0): 

S3 = ® 1 ® ® 1 

S2 = ® 1 ® 1 ® 

SI = 1 ® ® 1 ® 1 

SO = 1 ® 1 ® 1 ® 
syndrome 1100 indicates Ml 2 or B7 is in error 
assume check bit 03 flips (1 - 0): 

S3=1®1®0®1®0 = 1 

S2=i®iei®o®i=o 

S1=1®0®1®1®0®1 = 
S0= 1®1®1 ®0®0®1 = 
syndrome 1000 indicates M8 or C3 is in error 
Wiiile the checl< bits can be generated with a 256x4 PROM if 
the tolerances are loose, high performance systems will need 
to latch or register the check bits to meet cycle time require- 
ments. Similarly, the syndrome bits can be generated with a 
4096x4 PROM but in both cases the PALI 6X4 is the high 
performance choice. 

The worst case equations are SI and SO with a 6 term exclusive 
OR. We use 2 properties of the exclusive OR to fit the equations 
into the 16X4: 

1. Associativity 

A® B® C = ( A® B )® C 

2. A ® B ® C = ABC + ABC + ABC + ABC 

Since a 3 term exclusive OR can be realized with a 4 product 
sum in sum of products form, a 6 term exclusive OR can be 
realized by exclusive OR of 2 4 product sums. This is exactly 
the 16X4 configuration. 

Check Bit Equations 

C3 = B7 ® 86 ® B5 ® B4 

= (B7 • B6 + §7 • 86) ® (B5 • §4 + B5 • B4) 

02 = 87 ® 83 ® 82 ® 81 

= (87 • 83 + 87 • 83) ® (82 • 81 + 82 • 81) 

01 = 86 ® 85 ® 83 ® 82 ® 80 

= (86 • 85 • 83 + 86 • B5 • 83 + 86 • 85 • 83 + 86 • 85 • B3) 
® (82 • 80 + 82 • BO ) 

CO = 86 ® 84 ® B3 ® 81 ® 80 

= (86 • 84 • 83 + 86 • P • 83 + 86 • 84 • 83 + 86 • 84 • 83) 
®(81 • 80+ B1 • 80) 



Syndrome Bit Equations 

S3= 87® 86® B5® 84® 03 

= (87 • 86 • B5 + §7 • B6 • 85 + 87 • 86 • 85 + 87 • §6- §5) 
® (B4 • C3 + §4 • 03) 

S2 = 87 ® 83 ® 82 ® 81 ® 02 

= (87 • 83 • 82 + B7 • 83 • 82 + §7 • 83 • B2 + 87 • 83 • §2) 
® (81 • 02 + bT • 02) 

SI = 86 ® B5 83 ® 82 ® 80 ® 01 

= (86 • 85 • 83 + 86 • 85 • 83 + B6 • 85 • 83 + 86 • §5 • B3) 
® (§2 • BO • 01 + 82 • 80 • 01 + 82 • BO • 01 + 82 • BO) 

SO = B6 ® 84 83 ® 81 ® 80 ® CO 

= (86 • 84 • 83 + B6 • B4 • 83 + B6 • 84 • B3 + 86 • 84 • 83) 
® (81 • 80 • CO + bT • BO • CO + bT • 80 • CO + 81 • 80 • CO) 



The error correction block decodes the 4 syndrome bits, and, if 
they are not 0000, inverts the indicated bit in the message 
format. The equations: 



M12(=B70) = S3-S2 

= S3-S2 

= S3 • S2 

• 87 + SO 
Mil (=860) = S3 -§2 

•86 -SO 
M10(=85C) = S3-S2 

•85+ SO 
M9 (=B4C) = S3^S2 

•84+SO 
M8 (=03C) = S3^S2 

•03 + SO 
M7 (=B30) = S3^S2 

•83+SO 
M6 (=820) = S3^S2 

•82+ SO' 
M5 (=810) = S3 -52 

•81 + SO 
M4 (=02C) = S3^S2 

•02+ SO 
M3 (=800)= S3 -82 

•80+ SO 
M2 (=C1C) = S3^S2 

•01 + SO 
Ml (=C0C) = S3^S2 

• CO-SO 
ERROR =S3^S2 



SI 



50®87 
SO ~ 
SO 



•87+ (S3 


S2 • SI • 


• 87 + S3 


87+ S2- 


•B6^ S3 


86 • S2 • 


•B5+ S3 


85 + S2^ 


•P+ S3 


B4+S2- 


•^+ S3 


03+ S2^ 


•B3+ S3 


B3+S2« 


•B2+ S3 


82 + S2^ 


•B1+ S3 


B1 + S2^ 


•02+ S3 


•C2 + S2^ 


•80+ S3 


•80+S2^ 


•01+ S3 


•C1 + S2.- 


•C0+ S3 


•00+ S2^ 



SO) • 87 
87 +S1 

86 +ST 

85 +S1 

84 +S1 

03 +S1 

B3 +ST 

82 +ST 

B1 +S1 

02 +S1 

80 +ST 

01 +si 

CO +S1 



SO 



ERROR is an active high error indicator available for error 
logging along with the 4 syndrome bits. 

To use 2 PAL16L8's for the error correction block, we need 
only invert the message bits to get active true outputs. 
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Description 

This PAL device generates 4 checl< bits in a 12 bit hamming code word to provide error detection 
and correction on an 8 bit data word. 



PAL16X4 

CHKCLK [T - — >— 1 2o| VCC 
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Description 

This PAL device generates the syndrome bits for a 12 bit hamming code word as a function of the 
8 data bits and the 4 check bits to point to any single bit in error. 



PAL16X4 
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Description 

This PAL device performs error correction of bits B2-B7 based on tfie 4 bit error syndrome S0-S3. 



PAL16L8 
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Description 

This PAL device performs error correction of bits B0-B1 and checks bits C0-C3 based on the 4 bit 
error syndrome S0-S3. 



SYNDROME 
BITS 



PAL16L8 

53 [T 1 20j vcc 



sIjT- 

S1^ - 

so [7 

Bl[T 
BO [7 

^lU 

C2[7 

cT[T- 

gnd[io 




AND 

OR 

GATE 

ARRAY 



^;^-j— Tt] C3C 
— B>»-| — ]<1 C2C 



—^>^ — js] C1C 



2 LSB ERROR 
CORRECTED BITS 



CORRECTED 
CHECK 
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PAL16X4 PAL DESIGN SPECIFICATION 

CBG B. BRAFMAN 02/16/81 

CHECK BIT GENERATOR 

MMI FIELD APPLICATIONS ENGINEER YORBA LINDA, CALIFORNIA 

CHKCLK B7 B6 NC NC NC NC B5 B4 GND 

/OC B3 B2 /CO /CI /C2 /C3 Bl BO VCC 



C3 


S B 


B7*/B6 




+ 


/B7* B6 




+ 


GND 




+ 


GND 




; + t 


B5*/B4 




+ 


/B5* B4 


C2 


.s 


B7*/B3 




+ 


/B7* B3 




+ 


GND 




+ 


GND 




; + ! 


B2*/B1 




+ 


/B2* Bl 


CI 


.B 


B6* B5* B3 




+ 


/B6*/B5* B3 




+ 


/B6* B5*/B3 




+ 


B6*/B5*/B3 




: + : 


B2*/B0 




+ 


/B2* BO 


CO 


: = 


B6* B4* B3 




+ 


/B6*/B4* B3 




+ 


/B6* B4*/B3 




+ 


B6*/B4*/B3 




: + : 


B1*/B0 




+ 


/Bl* BO 



;B7 :+: B6 

; : + : 

;D0 NOT BLOW THIS PRODUCT LINE 
;D0 NOT BLOW THIS PRODUCT LINE 
;B5 :+: B4 



;B7 :+: B3 

: + ; 
;D0 NOT BLOW THIS PRODUCT LINE 
;D0 NOT BLOW THIS PRODUCT LINE 
;B2 :+: Bl 



;B6 :+: B5 :+: B3 

;B2 :+: BO 

;B6 :+: B4 :+: B3 

: + : 

;B1 :+: BO 
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PAL16X4 

SBG 

SYNDROME BIT GENERATOR 

MMI FIELD APPLICATIONS ENGINEER YORBA LINDA, CALIFORNIA 

SYNCLK D7 D6 BO Bl B2 B3 D5 D4 GND 

/OC D3 D2 A3 A2 Al AO Dl DO VCC 



PAL DESIGN SPECIFICATION 
B. BRAFMAN 03/13/81 



IN THE ABOVE PIN LIST, THE FOLLOWING SUBSTITUTIONS HAVE BEEN 
MADE TO ACCOMODATE THE SPECIFIC FORMAT (FIXED SYMBOLS) FOR THE 
ARITHEMETIC PAL DEVICES IN PALASM SOFTWARE : 



D7 


MEANS 


B7 


BO 


MEANS /C3 


(CHECK BIT 3) 




D6 


MEANS 


B6 


Bl 


MEANS /C2 


(CHECK BIT 2) 




D5 


MEANS 


B5 


B2 


MEANS /CI 


(CHECK BIT 1) 




D4 


MEANS 


B4 


B3 


MEANS /CO 


(CHECK BIT 0) 




D3 


MEANS 


B3 


AO 


MEANS /S3 


(SYNDROME BIT 


3) 


D2 


MEANS 


B2 


Al 


MEANS /S2 


(SYNDROME BIT 


2) 


Dl 


MEANS 


Bl 


A2 


MEANS /SI 


(SYNDROME BIT 


1) 


DO 


MEANS 


BO 


A3 


MEANS /SO 


(SYNDROME BIT 


0) 



B0-B7 ARE THE BITS OF THE DATA WORD. 

THE SUBSTITUTIONS APPLY BELOW WITH THE EXCEPTION OF COMMENTS. 



/AO. 


•3 


D7* D6* D5 


;B7 


: + : 


B6 


; + : 


B5 




+ 


/D7*/D6* D5 














+ 


/D7* D6*/D5 


; 


; + : 










+ 


D7*/D6*/D5 














: + : 


D4*(/B0) 


;B4 


: + : 


C3 








+ 


/D4*( BO) 












/Al 


+ 
+ 
+ 


D7* D3* D2 
/D7*/D3* D2 
/D7* D3*/D2 

D7*/D3*/D2 


;B7 


: + : 
: + : 


B3 


: + : 


B2 




: + : 


D1*(/B1) 


;Bl 


: + ; 


C2 








+ 


/Dl*( Bl) 












/A2 


+ 
+ 
+ 


D6* D5* D3 
/D6*/D5* D3 
/D6* D5*/D3 

D6*/D5*/D3 


;B6 


; + : 
: + : 


B5 


: + : 


B3 




: + : 


D2* D0*( B2) 


;B2 


: + ; 


BO 


: + : 


CI 




+ 


/D2*/D0*( B2) 














+ 


/D2* D0*(/B2) 














+ 


D2*/D0*{/B2) 












/A3 


+ 
+ 
+ 


D6* D4* D3 
/D6*/D4* D3 
/D6* D4*/D3 

D6*/D4*/t)3 


;B6 


: + : 

: + : 


B4 


: + : 


B3 




:+: 


Dl* D0*( B3) 


;Bl 


: + : 


BO 


: + : 


CO 




+ 


/D1*/D0*( B3) 














+ 


/Dl* D0*(/B3) 














+ 


D1*/D0* (/B3) 
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TITLE 
PATTERN 



ERROR CORRECTION UNIT NO. 
ECUl 



REVISION 01 

AUTHOR B . BRAFMAN 

COMPANY MONOLITHIC MEMORIES, INC. 

DATE 11/06/87 

CHIP ECUl PAL16L8 

/S3 /S2 /SI /SO B7 B6 B5 B4 B3 GND 
B2 NC NC B2C B3C B4C B5C B6C B7C VCC 



EQUATIONS 

/B7C = S3* S2*/S1*/S0* B7 
+ /S3*/B7 
+ /S2*/B7 
+ Sl*/B7 
+ S0*/B7 

/B6C = S3*/S2* SI* SO* B6 
+ /S3*/B6 
+ S2*/B6 
+ /Sl*/B6 
+ /S0*/B6 

/B5C - S3*/S2* S1*/S0* B5 
+ /S3*/B5 
+ S2*/B5 
+ /Sl*/B5 
+ S0*/B5 

/B4C = S3*/S2*/S1* SO* B4 
+ /S3*/B4 
+ S2*/B4 
+ Sl*/B4 
+ /S0*/B4 

/B3C = /S3* S2* SI* SO* B3 
+ S3*/B3 
+ /S2*/B3 
+ /Sl*/B3 
+ /S0*/B3 

/B2C = /S3* S2* S1*/S0* B2 
+ S3*/B2 
+ /S2*/B2 
+ /Sl*/B2 
+ S0*/B2 

; SIMULATION NOT INCLUDED 



CORRECTION OF B7 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF B6 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF B5 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF B4 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF B3 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF B2 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
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TITLE ERROR CORRECTION UNIT NO. 2 

PATTERN ECU2 

REVISION 01 

AUTHOR B . BRAFMAN 

COMPANY MONOLITHIC MEMORIES, INC. 

DATE 11/06/87 

CHIP ECU2 PAL16L8 

/S3 /S2 /SI /SO Bl BO /C3 /C2 /CI GND 
/CO NC ERROR COC CIC C2C CSC BOC BIC VCC 

EQUATIONS 



/BIC = /S3* S2*/S1* SO* Bl 
+ S3*/B1 
+ /S'2*/B1 
+ S1*/B1 

/BOC = /S3*/S2* SI* SO* BO 
+ S3*/B0 
+ S2*/B0 
+ /S1*/B0 
+ /SO*/BO 

/C3C = S3*/S2*/S1*/S0* C3 
+ /S3*/C3 
+ S2*/C3 
+ Sl*/C3 
+ S0*/C3 

/C2C = /S3* S2*/S1*/S0* C2 
+ S3*/C2 
+ /S2*/C2 
+ Sl*/C2 
+ S0*/C2 

/CIC = /S3*/S2* S1*/S0* CI 
+ S3*/C1 
+ S2*/C1 
+ /S1*/C1 
+ S0*/C1 

/COC = /S3*/S2*/S1* SO* CO 
+ S3*/C0 
+ S2*/C0 
+ S1*/C0 
+ /SO*/CO 

/ERROR = /S3*/S2*/S1*/S0 

; SIMULATION NOT INCLUDED 



CORRECTION OF Bl 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF BO 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF C3 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF C2 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF CI 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

CORRECTION OF CO 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 
NO CORRECTION 

NO ERROR! 
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Introduction 

The growing sophistication of computer systems has dictated a 
corresponding increase in memory storage capacity. Unfortu- 
nately, the speed of these large memories has not kept pace with 
that of their associated processors. Fast memories are still 
expensive; in order to reduce this cost, a cache memory system 
can be used to provide information to the processor without sac- 
rificing speed. Located between the central processing unit and 
the main memory a cache memory stores only the blocks of 
memory currently in use by a program. This provides fast access 
to program and data resulting in higher system performance. 
Once a new memory location is desired, blocks of cache and main 
memory are swapped to update the cache with the memory 
blocks currently in use. Thus a cache memory can dramatically 
increase the performance of the entire computer system while 
maintaining minimal costs. A cache system with a 35 ns access 
time and a 50 ns cycle time is described below. 

The use of a fuse-programmable controller (FPC), Am29PL141, 
simplifies the design of such a cache memory system. The simple 
internal architecture and instruction set of the FPC make this 
controller easy to implement in systems, and an on-chip micro- 
programmablefuse array provides a high level of integration. The 
cache controller algorithm is programmed and stored in this array. 



Because the FPC operates at 20 MHz, as opposed to the 1 MHz 
clock frequency of other currently available controllers, it is 
especially well suited for high-performance applications. 

Cache Memory Systems 

Almost any system that requires a high-speed memory interface 
can benefit from the use of a cache-memory system. The system 
bottleneck is typically the memory access time. Large dynamic 
RAMs that are currently available have access times of 1 00-1 80 
ns with cycle times of about 180-400 ns. The smaller, faster 
memory devices that can be used for a cache have access times 
of 25-30 ns and cycle times of around 40-50 ns. In order to 
achieve these lower access times and still have the memory 
capacity of the larger DRAM memories, a cache memory hierar- 
chy is used. 

Any application can make effective use of caches provided that 
the concept of "locality of reference" is operative. This concept 
states that, at any particular time, the addressing pattern of a 
program tends to be localized within a block or area of the total 
available address space. In other words, if data at a given 
address is accessed, it is likely that the next memory access will 
also be in that same block of addresses. In general, for systems 
where this concept does not apply, the use of a cache system 
would be difficult. 



DRAM 
MAIN MEMORY 

DATA ADDRESS CONTROL 



ADDRESS 

TAG BUFFER 
DATA 

CONTROL 



C 



7\ 



A 



]T 



ADDRESS BUS 



\7 



DATA BUS 



xz iz 



ADDRESS DATA 

CONTROL 
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CPU 

ADDRESS 



CONTROL 



FUSE PROGRAMMABLE 
CONTROLLER 

CACHE SYSTEM 
CONTROLLER 



PAL CONTROL 
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CONTROLLER 



Figuro 1. Block Diagram of Cacho Momory System 
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Sections of main memory data that can be identified asfrequently 
used can be stored in cache memoriesto ailowdecreased access 
times, thereby increasing the overall performance of the system. 
As different sections of a program become frequently used, they 
can be transferred to the cache memory, displacing other sec- 
tions that are no longer needed. Some of the application areas 
where caches are used to increase the system performance are 
for mini/micro-computers, signal processing and image process- 
ing systems. 

The main components of the cache memory system are the 
cache data memory, the cache tag buffer, the replacement logic, 
and the cache controller (see Figure 1). 



When a tag search is performed in the tag buffer, either a hit or 
miss condition results. A hit indicates that the desired word is in 
the cache; a miss indicates that the desired word must be ac- 
cessed from the main memory or a transfer from main memory to 
cache is needed. The performance of the cache system is 
dependent on the ratio of the number of hits to the total number 
of memory references made, which is called the hit ratio. The hit 
ratio depends on the size of the cache memory selected and the 
application of the system using the cache memory. A high hit ratio 
implies that most of the data accessed by the CPU was in the fast 
data-cache memory when requested. The performance of the 
entire system gets degraded if a poor hit ratio occurs. This is why 
"locality of reference" is important for performance. 



Cache Data Memory 

The cache data memory is the small, fast memory that contains 
the most often used data, copied from the slower main memory. 
The cache data memory size directly affects the performance of 
the entire system. The larger the cache data memory, the more 
data that can be accessed at the faster speeds. However, if the 
size of the memory device gets very large, the performance of the 
memory decreases. The larger the memory devices are, the 
longer are their access times, and, consequently, their cycle 
times. The optimum data cache size will vary depending on the 
application/algorithm being executed. 

Tag Buffer 

An identifier (or tag) accompanies every block (a contiguous set 
of data words) that can be transferred to or from main memory. 
This tag is stored in the tag buffer. This tag identifies where the 
data is located in main memory. To determine whether a block of 
data is already in the cache memory, the tags stored in the tag 
buffer are compared to the desired tag (Figure 2). Various 
methods exist for searching the tag buffer for a block of data. 
Three of the most popular approaches are: 

1. Fully associative 

2. Direct mapped 

3. Set-associative 

In fully associative caches, any memory word can be found in any 
of the cache-data blocks. The memory address is divided into two 
sections: the TAG and the BLOCK fields. The TAG is used to 
determine if the block of data is in the cache and the BLOCK 
indicates a data word within that block. 

For a direct-mapped cache the memory word can be found in only 
orie of the cache blocks. The memory address is sectioned into 
three fields: the TAG, the INDEX, and the BLOCK. The BLOCK 
indicates the data word in the block. The INDEX selects the 
program block that contains the word, and the TAG is used to 
determine if the block is in the cache. 

The third way of searching the cache for data words is the set- 
associative method. This divides blocks into sets and allows the 
cache block to be located in any one of these sets. Typically, the 
numberof sets used are two, four, or eight. The TAG, INDEX and 
the BLOCK fields are similar to those in the direct mapped 
techniques. Figure 2 shows the three different address tech- 
niques and their fields. 



Replacement Algorithms 

The replacement algorithm is used to decide the cache location 
to replace with new data when a miss occurs. Three of the most 
common data replacement algorithms currently in use are: 

1. First-in First-out (FIFO) 

2. Least Recently Used (LRU) 

3. Random Replacement (RR). 

The choice between these three techniques is dependent upon 
the nature of the data being transferred to and from the cache. 
The FIFO algorithm simply selects the oldest block of data written 
into the cache to be replaced. After the cache buffer is full, data 
blocks are replaced sequentially. In this manner, even a data 
block which is used very often, will be replaced. 

The LRU technique selects the data block to be replaced by 
determining which block in the cache was least used as com- 
pared to the other blocks at that time. With this strategy, the 
chances that the block just replaced in the cache will be required 
again soon is assumed to be relatively small. 



FULLY ASSOCIATIVE CACHE 
MEMORY SYSTEM 



TAG 


BLOCK 



TAG IDENTl- DATA WORD 
RES IF BLOCK IN CACHE 
IS IN CACHE BLOCK 



DIRECT MAPPED MEMORY 



TAG 


INDEX 


BLOCK 



TAG IDENTl- INDEX FOR 
FIES IF BLOCK CACHE 
IS IN CACHE 



DATA WORD 

IN CACHE 

BLOCK 



SET-ASSOCIATIVE 
CACHE MEMORY 



TAG 


INDEX 


BLOCK 



TAG IDENTl- INDEX FOR DATA WORD 
FIES IF BLOCK CACHE IN CACHE 

IS IN CACHE BLOCK 

Figure 2. Cache Organizations 
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The third method, the random replacement approach, randomly 
chooses the block of data to be replaced. The easiest to 
implement, this technique is optimum for some applications. 

Another important design consideration for cache memory sys- 
tems is the method used for keeping the cache data and main 
memory updated. When writing to the cache memory, the data 
change must be echoed to the main memory before that data 
word is replaced by new data in the cache block. One way this is 
handled is the "store-through" method. On a write command, the 
data is written to the cache as well as the main memory simulta- 
neously. In this way, the main memory data is kept current. This 
also saves the task of writing back old cache data into main 
memory when replacing a new data block into cache. 



Cache Controller 

All of the signals that are required to keep track of the various 
functional blocks involved in the cache memory system are 
controlled by the cache controller. This controller monitors the 
stale of the tag buffer search, the CPU, main memory and cache 
data memory condition, as well as handling all memory transfers 
for hits or misses and read and write operations. The high- 
performance and integrated functions available on a single chip 
make the design of a cache controller using the Am29PL141 
simple. The FPC provides 1 6 control signals for off-chip control 
and seven input test pins for monitoring external conditions. The 
flexible instruction set permits easy microprogramming of the 
desired function. 
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Figure 3. FPC Detailed Functional Diagram 
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The Am29PL14l Fuse-Programmable Controller 

The Ann29PL141 is a high-performance, single-chip, fuse- 
programmable controller (FPC). This chip is designed to allow 
implementation of complex state machines and controllers by 
programming the appropriate sequence of microinstructions in 
the on-chip microprogram memory. With its intelligent micropro- 
gram address sequencer, high-speed 64 x 32 bit microprogram 
memory, pipeline register, and an on-chip diagnostics register, 
this chip offers the benefits of low chip count, fast operation, easy 
development, and testability. This device is available with a 20 
MHz clock rate (50 ns cycle time) in a 28 pin dual-in-line package. 
A microprogram address sequencer is tlie heart of the FPC. The 



Am29PL141 has 29 high-level microinstructions which include 
jumps, loops, subroutine calls, and multiway branching. These 
microinstructions can be conditionally executed based on the test 
inputs. As a single chip solution, the FPC is designed to ease the 
implementation of distributed microprogrammed systems. It can 
be used to off-load the central controller by serving as an 
intelligent distributed controllerforvariousfunctionai units. Its on- 
chip diagnostics register is used to control and observe the 
parallel pipeline registerduring diagnostics mode. This capability 
provides both controllability and observability for the functional 
blocks connected to the FPC control outputs. A block diagram of 
the FPC is shown in Figure 3 and the micro-instruction format is 
shown in Figure 4. 
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Figure 4. Microinstruction Format 
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System Overview 

The cache scheme selected for implementation is a set-associa- 
tive cache with four sets (see Figure 5). The 18-bit physical 
memory address word is divided into 8 bits of tag, 8 bits of index, 
and 2 bits of block address. Four 256-byte deep, index-ad- 
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Figure 5. A Set-Associative Cache Scheme 



dressed tag buffers are used in this scheme. These tag buffers 
contain four possible sets of tags for the same index location. 
Corresponding to each tag bufferthere are four 1 6-bit wide cache 
memories, which contain four (block) words for each index 
location. The physical block address is also used to address 
these cache memories. 

Figure 6 shows the actual implementation where the main system 
controller is the Am29PL1 41 FPC. There is additional glue logic 
in the form of one PALI 8P8 and one PALI 6R6. The PALI 8P8 is 
used in the cache access path and is very fast. It generates the 
cache output enable, cache selects, and write signals for the 
cache. The PALI 6R6 is used to generate the Ready signal to the 
system CPU. It is also used to inform the FPC of the status of 
read/write cycles. All the PAL devices are controlled by the FPC 
for generation of the system control signals. 

System Operation 

There are four main system cycles possible: read and hit, read 
and miss, write and hit and write and miss. The response of the 
controller is different for each system operation cycle. For read 
& hit cycle the FPC performs no action. Only on detection of write 
cycle, or read & miss cycle, the PALI 6R6 asserts the CYCLsignal 
initiating FPC action. 

Read and Hit Cycle 

When the system CPU requests a read cycle, it sends out an 1 8- 
bit address. The most significant eight bits (Figure 6) 
TAGADDO-7 are the eight bit tag which is compared to the tags 
stored in the four tag buffers to detect a hit or a miss. The next 
eight bits IDXADDO-7 for index, address the correct tag buffer 
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Figure 6. Control Section for Cache System uses one FPC and two PAL Devices. 
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locations. These eight bits are also used to access the four cache 
sets simultaneously. The least significant BLOCKO-1 block 
address are directly used to address the four cache sets. A 
cache CACS (chip select) is generated by the PALI 8P8 to access 
the four cache sets simultaneously. Once the tag buffer match 
occurs, the selected tag buffers asserts the hit signal. The hit 
signal is used to enable the outputs of the corresponding cache 
set, in order to provide the correct cache data to the CPU. The 
flowchart is shown in Figure 9. 

Read and Miss Cycle 

if the proper tag does not appear in any of the four tag buffers, a 
miss signal is generated by the tag buffers on H/MO-3 signal lines 
(Figure 7 and 8). On detection of the four miss signals the 
PAL1 6R6 removes the "RDY" signal to CPU (for CPU wait cycles) 
and informs the FPC of the MISS by asserting the CYCL & HIT/ 
MISS signals. When the FPC detects this read and miss, it 
latches the current physical address by asserting the ADLATCH 
signal, and initiates a DRAM (System Memory) cycle by asserting 
the DREQ signal, to provide the data to the CPU from the system 
memory. When this data is available signalled by the DRAM data 
ready signal DRDY, the FPC signals the assertion of the RDY 
signal, allowing the CPU to read the requested data. Simultane- 
ously the FPC asserts the INHIBIT signal which disallows further 
CPU access to the cache. 

The FPC then performs an update algorithm which brings the 
data from the system memory to the cache. It asserts the CTRST 
signal which disables the CPU data lines to the cache system. It 
also disables CPU block addresses by asserting BLDBL signal 
and enables its own output address lines BADO-1 in order to 



address different memory words within a block itself, it then 
performs read cycles on the system memory by using DREQ 
(DRAM memory request), DRW (read or write) and DRDY 
(DRAM data ready) signals and write cycles on all the blocks of 
the corresponding cache set by asserting CACWRT signal. The 
replacement set is determined by the LRU control block which 
selects one of the four sets. 

Upon completion of the update of new cache data, the FPC 
removes the INHIBIT signal to allow further CPU access to the 
cache. 

Write and Hit Cycie 

The write cycle operation is different from the read cycle, since the 
main memory has to be updated along with the cache. This is 
called the "write-through" strategy. Since typical programs re- 
quire only 1 0-1 5% of the processor time for write cycles, this does 
not impact the overall performance. For every write cycle for 
which the tag buffer signals a hit, the CPU writes the data to the 
cache. In addition, the FPC initiates a DRAM (system memory) 
cycle and updates it with the same data (see Figure 10). 

Write and l\/liss Cycle 

For a write and miss cycle, the function of the system is similar to 
the read and miss cycle, except that a write of the main memory 
is performed Instead of a read. After the write to the main memory 
is completed, the FPC executes the update algorithm to bring the 
contents of that memory block into the cache. The flow chart for 
the microcode executed by the FPC for the write cycle is shown 
in Figure 10. 
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Figure 7. Memory Organization of Cache System 
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Figure 10. Write Control Flow Chart 
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FPC Operation 

The System Controller has five main functions: 

1 . Test for a cycle request (CYCL) 

2. Test for different cycle types 

3. Execute appropriate action for each cycle 

4. Execute update algorithm if required 

5. Generate update addresses if required. 

Test for Cycle Request 

The FPC samples the CYCL signal continuously. The CYCL 
signal is generated only for read & miss, write and hit, and write 
and miss cycles. The system requires FPC intervention only for 
these three cycles. When CYCL is asserted, the FPClests for the 
three cycle types. 

Test for Cycle Type 

This involves sampling the CPU, memory request MREQ, and 
read or write (RHWL) signals. On the basis of these signals the 
FPC decides between the initiation of read or write cycles. It then 
samples the HIT/MISS signal generated by the PAL16R6 to 
decide on the appropriate action to be taken. 

Action for Each Cycle 



The replacement algorithm then executes read cycles on the 
memory and write cycles on the cache for all the four block 
addresses. After the replacement is complete, the control returns 
for sampling of new CPU cycles. The design file for the Cache 
controller is shown in Figure 12. 

The replacement set selection can depend upon various selec- 
tion algorithms as described before. This set selection logic has 
not been shown in this design. However, the algorithm used here 
is LRU (Least Recently Used) method, and only its interface with 
the cache controller is shown. 

Timing and Performance 

For a cache read cycle when a hit occurs, the worst case access 
time for the cache system is the tag buffer delay (20 ns) along with 
the cache output enable delay (15 ns). The worst case access 
time for this cache implementation is 35 ns. The cycle time for the 
system will be a little longer than the access time. 

The 50 ns cycle time is a very high speed cache performance, 
considering that the usual cycle time for DRAMs is in the range of 
280-400 ns. Further improvements in the cycle time can be made 
by using PAL devices with a faster propagation delay of 1 ns. 

CALL REPLACEMENT 
SUBROUTINE 



Once the FPC determines the CPU cycle type and whether the 
location exists in the cache (HIT or IVIISS), it then performs 
appropriate actions such as dynamic RAM (System Memory) 
cycle, and Update algorithm. The functions performed by the 
FPC for each cycle type were detailed earlier. 

Update Algorithm 

On detection of a CPU cycle and a MISS, the FPC performs the 
update algorithm. It consists of reading data from the system 
memory (DRAM) and writing that data to the appropriate cache 
set. The address forthis replacement is the one on which the CPU 
cycle MISS occurs, which in turn is latched by the FPC on 
detection of the MISS. The selection of the appropriate cache set 
is dependent upon the information provided by the replacement 
logic block. The update is done for all four blocks (words) of data. 
During this update, the FPC isolates the CPU data bus from the 
cache data bus by asserting the CTRST signal. 

Update Address Generation 

The replacement address is the CPU address at which the MISS 
occurs. However, all the four blocks for the same memory 
address should be replaced in the cache. To generate these four 
addresses, two of the FPC's eight three-state lines are used. The 
two least significant bits of the CPU's physical address (Block 
Address bits) are disabled by the FPC by asserting the BLDBL 
signal, and the two output bits of the FPC (three-state control 
lines) are enabled onto the address bus. This is done in a manner 
to allow the system CPU to continue to work with the address and 
data bus, isolated from the cache system, during the execution of 
the replacement algorithm. 
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Figure 11. Fiow Chart for Cache Update Scheme 
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device (am29pll41) 
default = ; 

define 

mreq = tO 

rhwl = tl 

hitm = t2 

drdy = t3 

nocdn = t4 "No condition" 

intword = 07d#h; "Initialize output word" 

begin 

init : od intword, continue ; "Initialize Outputs" 

start : od intword, if (mreq) then goto pl(start); 

"Check for memory cycle" 
od intword, if (not rhwl) then goto pl(write); 

"Check for read or write" 

read : od intword, if (hitm) then goto pl(start); 

"Read & hit goto start" 

latch : od 079#h, if (not drdy) then goto pl(latch); 

"Read & miss latch address" 
od 069#h, continue; "DRAM req for CPU data" 

od 063#h, continue; "DRAM output enb & CPU cycle" 

od 063#h, if (nocdn) then call pl(rplce); 

"Goto replacement subroutine" 
od intword, if (nocdn) then goto pl(start); 

"Read cycle complete" 

write : od intword, if (not hitm) then goto pl(wrtms); 

"Write hit or miss" 

wrtht : od Ofd#h, if (not drdy) then goto pl(wrtht); 

"Hit write to cache" 
od 04d#h, continue; "Write to DRAM" 

od 04f#h, continue; 
od 04f#h, if (nocdn) then goto pl(start); 

"Write hit complete" 

wrtms : od intword, if (not drdy) then goto pl(wrtms); 

"Miss check DRAM for write" 
od 049#h, continue; 

od 04b#h, continue; "Latch address & DRAM write" 

od 04b#h, if (nocdn) then call pl(rplce); 

"goto replacement algorithm" 
od intword, if (nocdn) then goto pl(start); 

"Write miss complete" 

rplce : od 07b#h, if (not drdy) then goto pl(rplce); 

"This is replace routine" 

oe 022#h, continue; "Isolate CPU databus" 

oe 022#h, continue; "DRAM data enb for cache write' 

oe 0a2#h, continue; "Write to cache at address 00" 

waitl : od 13a#h, if (not drdy) then goto pl(waitl); 

"Change Block address" 

oe 122#h, continue; "Repeat as before address 01" 

oe 122#h, continue; 

oe la2#h, continue; "Write to cache at address 01" 

wait2 : od 23a#h, if (not drdy) then goto pl(wait2); 

"Change Block address" 

oe 222#h, continue; "Repeat as before address 02" 

oe 222#h, continue; 

oe 2a2#h, continue; "Write to cache at address 02" 

wait3 : od 33a#h, if (not drdy) then goto pl(wait3); 

"Change Block address" 

oe 322#h, continue; "Repeat as before address 03" 

oe 322#h, continue; 

oe 3a2#h, continue; "Write to cache at address 03" 

od 03a#h, if (nocdn) then ret; "Return from replace routine" 



end. 



Figure 12. Cache Controller Design File 
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PAL22RX8A Provides Control and 
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Two components used in the temporary storage of data are 
LIFO (Last In, First Out), and FIFO (First In, First Out) buffers. 
Usually, a FIFO would be used to store data between asyn- 
chronously operating devices. For example, data sent from 
one microprocessor system to a second might reside tempo- 
rarily in a FIFO until the second microprocessor is ready to 
read the data. Sometimes the FIFO is known as an asynchron- 
ous temporary buffer store. 

The LIFO also offers temporary storage and will store data in 
the same w&y as a FIFO but the sequence of reading the data 
will be reversed. LIFO applications in digital processing sys- 
tems would usually be to store return addresses and essential 
registers when the processor is executing a branch to subrou- 
tine or interrupt service routine. The processor will retrieve this 
information when it executes return instructions. The LIFO is 
an essentia! component in the processor's ability to perform 
context switching. 

The LIFO is sometimes called a stack, and the process of writ- 
ing to the stack is called a PUSH operation and reading data is 
a POP operation. In the central processor dedicated instruc- 
tions for PUSH and POP operations are decoded and executed 
to save and restore return addresses, contents of accumula- 
tors, status registers, and index registers. Other LIFO applica- 
tions could be found in data buffering where the order of data 
reception should be reversed. 

The concept of the stack is used in performing Reverse Polish 
Notation (RPN) mathematics, and it is claimed by some to be 
superior in performance to the conventional algebraic process- 
ing technique. With RPN, instructions and operands are 
pushed onto the stack, and evaluated as they are popped off; 
the result is usually left in the general purpose register. 

When a memory is used as a stack, the evaluation process 
may be repeated because the initial evaluation did not destroy 
the RAM contents. This non-destructive reading of data is a 
feature of RAM-based FIFO and LIFO architectures, and is not 
the case for register-based architectures. FIFOs and LIFOs 
made from register arrays rely on data shuffling through the 
array. Data in the registers are overwritten during each cycle. 
The RAM control for a LIFO essentially consists of an Up/Down 
counter to address locations in the memory, and a control 
sequencer to drive the memory's chip select (CS) and write 
(WR) inputs. 



The new PAL22RX8A has been programmed to perform a 
control function for a small 32-location-deep stack. A two-chip 
solution is achieved by using an Am 9128 static RAM and a 
PAL22RX8A (see Figure 1), for bytewide applications. 
Additional RAM devices may be added in parallel for 16- and 
32-bit wide applications. A deeper LIFO may be configured by 
cascading a second PAL22RX8A to create a higher-order Up/ 
Down counter. 
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Figure 1. 32-Locatlon-Deep LIFO Controller Uses PAL22RX8A 



Figure 2 shows the block diagram of the PAL22RX8A device. 
Fourteen dedicated inputs drive the single fuse array through 
true/complement buffers. One of those inputs includes the 
clock, so logic operations may be performed on the clock or its 
complement. 
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Figure 2. Block Diagram 



PLCC pin numbers are indicated in parentheses. 
PLCC pins 1,8,15 and 22 are not connected. 



There exist eight macrocells in the device. The schematic view 
of the macrocell architecture is shown in Figure 3. The D-type 
register is the storage element in the device and is clocked 
from a single clock input. The D input is fed from an exclusive- 
OR (XOR) gate, the two inputs of which are driven from one 
product term and a sum of eight product terms. This XOR gate 
can be used to control the polarity at the output pin; the single 
product term input can be used to create polarity Inversion. 




Figure 3. PAL22RX8A Macrocell 

With regard to registered output configurations, the XOR gate 
may be used to configure alternative registered functions. It 
can be shown that a J-K flip-flop may be derived by using feed- 
back, the sum-of-product inputs, and the XOR input, as shown 
in the equations below. Three out of the five equations use the 
XOR gate to create the J-K function. 



1.Q 
2. Q 
3.0 
4.0 
5.0 



= Q*J + Q*K 
= Q*J+Q*K 
= :+: (Q*J + Q*K) 
= Q:+:(Q*J + 0*K) 
= Q:+:(Q*J + 0*k') 



Also, S-R and other subset functions may be configured in the 
macrocell. 

There exist two multiplexers in the macrocell which select 
either a registered or combinatorial output path. If the control 
fuse is intact, the output is registered, and the feedback into 
the fuse array is through a buried feedback path from the O 
node. A combinatorial route may be selected by programming 
the fuse; the register is flushed and feedback to the array is 
from the output pin. Programmable I/O may be achieved in this 
configuration by controlling the three-state condition of the in- 
verting output buffer. A single product term can enable the out- 
put buffer with a logic High or disable it with a logic Low. When 
in a three-state condition the output pin may be used as an in- 
put. 

When using the flip-flop in the macrocell it is possible to use 
the asynchronous Reset or Preset feature. A single input pin 
may be dedicated to a Reset or Preset function, and driven ac- 
tive to initialize the register in the device. 

Toggle Function as a J-K Subset 

A frequently used subset of the J-K function used in binary 
count applications is the Toggle (T) function. If J and K are tied 
together and driven with a logic High then the flip-flop output 
will Toggle when clocked; if driven Low the output will Hold. For 
a given flip-flop (N) in the count sequence the toggle equation 
will be: 

6. ON := ON :+: ON-1*QN-2*...Q1*Q0. 

This will cause the flip-flop to Hold until the product term of all 
the lesser significant registers goes to a logic High; then, the 
flip-flop will toggle polarity. This equation is suitable for an Up 
count. A Down count is given by: 

7. ON := ON :+:QFri*QN^*.. 01*00. 
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Both equations can be combined with an Up/Down control to 
effect an Up/Down counter. With regard to the LIFO applica- 
tion, the Up/Down control for the address input to the CMOS 
RAM will be decoded from the PUSH or POP instruction. The 



sequence will be "Increment then Write" for a PUSH operation 
and "Read then Decrement" for a POP operation. The timing 
waveforms (Figure 4) show the sequence for PUSH and POP 
cycles. Note that the LIFO starts from memory address zero. 




m^^^^^^^^^ 



Active Write Cycle - PUSH 

1 . PUSH input goes active High at tsu or greater 
before the clock edge. 

2. First clock edge samples the active PUSH input. 
2a. WR goes active Low. 

2b. Address increments. . 

3. Second clock edge; PUSH still active High. 



i^ 



3a. OS goes active Low. 

4. Third clock edge; PUSH still active High. 
4a. OS goes inactive High. 

5. Fourth clock edge; PUSH still active High. 
5a. WR goes inactive High. 

6. PUSH instruction is removed. 




ADDR 



Active Read Cycle - POP 

1 . POP instruction goes active High at tsu or 
greater before the clock edge. 

2. First clock edge samples the active POP input. 
2a. CS goes active Low to select the RAM. 

3. Second clock edge; POP still active High. 
3a. CS goes inactive High. 



4. Third clock edge; POP still active High. 

4a. WR goes active Low; the memory is deselected 
so no Write will take place. 

4b. Address lines decrement. 

5. POP instruction is removed. 
5a. WR goes inactive High. 



Figure 4. LIFO Control Timing Diagrams 
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The control outputs to the RAM are CSX and WR. An additional 
register (XO) is required to enable all states to be reached; two 
registers alone are not enough. From the timing waveforms it 
can be seen that an active write cycle is initiated after an ac- 
tive PUSH input has been sampled by the system clock. The 
WR signal then toggles, causing the output pin to go active 
Low. This condition is maintained until the control register XO 
goes High, at the end of the PUSH cycle. 

The equations after the XOR symbol in the design specification 
(page 2-255) cause the toggle function; when these product 
terms become active High a toggle o perat ion is effected in the 
associate d flip -flop. For example, the CSX flip-flop is in a Hold 
slate until WR goes Low and while XO i s High during a P USH 
sequ^ce; during a POP sequence, CSX holds unless both WR 
and XO are High. The equations are written for each flip-flop by 
using one product term after the XOR label to effect the 
change of state or toggle condition to fit the requirements of 
the state machine. Without the XOR gate, an Up/Down counter 



would require more than the eight product terms available in 
standard PAL devices. 



Instruction Decoding 

Although not shown in the design specification, it is possible 
for the PAL device to decode instructions for PUSH and POP. 
Assuming the unused input pins are converted to 10-17, which 
come from an instruction register in a processor system, then 
decoding could be achieved in the fuse array. If an additional 
input VI qualif^es^avalid instruction and the code for PUSH is 
given as I7*I6*I5*I4*I3*I2*I1*I0*\/I, then this equation could 
be added to the string declaration statement of the design 
specification. The PAL device could decode POP operations 
similarly. In this way, external decoding logic may be incorpo- 
rated into the PAL device, which should increase the operation 
speed. 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



LIFO RAM CONTROLLER 

01. 

04. 

CHRIS JAY. 

MMI SANTA CLARA, CA 

23 APRIL 1987 



CHIP LIFOCONT PAL22RX8 

THE PAL2 2RX8 HAS BEEN DESIGNED TO CONTROL 32 RAM 
LOCATIONS AS A LIFO RAM CONTROLLER. THE PAL DEVICE 
DESIGN SPECIFICATION CONSISTS OF TWO STATE MACHINES. 
ONE STATE MACHINE IS AN UP/DOWN BINARY COUNTER THAT 
DRIVES THE ADDRESS LINES OF THE RAM, AND THE SECOND 
DRIVES THE /WR AND /CS LINES OF THE RAM TO CONTROL 
WRITE AND READ OPERATIONS. DURING A PUSH OPERATION 
DATA IS WRITTEN TO THE LIFO; THE PROCEDURE IS INC- 
REMENT ADDRESS COUNTER AND WRITE DATA. FOR A POP 
OPERATION THE READ CYCLE IS PERFORMED FOLLOWED BY 
AN ADDRESS DECREMENT. THE XOR GATE IN EACH MACRO- 
CELL ALLOWS INCREMENT AND DECREMENT OPERATIONS 
TO BE EFFECTED WITHOUT UTILIZING A LARGE NUMBER OF 
PRODUCT TERMS. 



PIN 



1 
CLK 



2 

PUSH 



3 
POP 



4 
NC 



5 
NC 



6 
NC 



;PIN 
;PIN 
;PIN 
GLOBAL 



7 
NC 

13 
NC 

19 
/Q3 



NC 

14 
NC 

20 
/Q2 



9 
NC 

15 
/CSX 

21 
/Ql 



STRING INC •/CSX*/WR*/XO*PUSH' 
STRING DEC '/CSX*/WR* X0*POP' 



10 
NC 

16 
/XO 

22 
/QO 



11 
/RST 

17 
/WR 

23 
NC 



12 
GND 

18 
/Q4 

24 
VCC 



GLOBAL TERM 
ENABLES THE 
GLOBAL RESET 

INCREMENT LABEL 
DECREMENT LABEL 



Figure 6. LIFO RAM Controller Design File 



2-254 



Monolithic 



icM 



Memories 



PAL22RX8A 



EQUATIONS 
GLOBAL. RSTF = RST 



QO 



Ql 



Q2 



Q3 



Q4 



CSX 



WR 



xo 



: + ; 

+ 



QO 

INC 

DEC 

Ql 

QO*INC 
/QO*DEC 

Q2 

QO* Q1*INC 
/Q0*/Q1*DEC 

Q3 

QO* Ql* Q2*INC 
/Q0*/Q1*/Q2*DEC 

Q4 

QO* Ql* Q2* Q3*INC 
/Q0*/Q1*/Q2*/Q3*DEC 

CSX 

WR*/XO*PUSH 
/WR*/X0*POP 

WR 
/WR*/CSX*/XO* PUSH 

WR*/CSX* XO* PUSH 
/WR*/CSX* XO* POP 

WR*/CSX*/X0*/POP*/PUSH 

XO 

WR* CSX*/X0* PUSH 
/WR*/CSX* XO*/PUSH 
/WR* CSX*/XO* POP 
/WR*/CSX* XO* POP 



RESET AT POWER ON 

QO HOLD 

INC ADDRESS FOR PUSH 

DEC ADDRESS FOR POP 

Ql HOLD 
INC Ql 
DEC Ql 

Q2 HOLD 
INC Q2 
DEC Q2 

Q3 HOLD 
INC Q3 
DEC Q3 

Q4 HOLD 
INC Q4 
DEC Q4 

CHIP SELECT 
TO RAM DEVICE 



WRITE CONTROL 
TO RAM DEVICE 



CONTROL REGISTER 
FOR READ- WRITE 
STATE MACHINE. 



Figure 6. LIFO RAM Controller Design Fiie (Continued) 
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SIMULATION 

TRACE_ON CLK QO Ql Q2 Q3 Q4 

/CSX /WR /XO PUSH POP 

RST 
SETF /CLK RST /PUSH /POP 
CLOCKF CLK 
SETF /RST 
SETF PUSH 
FOR I := 1 TO 4 DO 
BEGIN 
CLOCKF CLK 
END 

SETF /PUSH 
CLOCKF CLK 
SETF PUSH 

FOR I := 1 TO 4 DO 
BEGIN 

CLOCKF CLK 
END 

SETF /PUSH 
CLOCKF CLK 
SETF POP 

FOR I := 1 TO 4 DO 
BEGIN 

CLOCKF CLK 
END 

SETF /POP 

FOR I := 1 TO 3 DO 
BEGIN 

CLOCKF CLK 
END 
TRACE OFF 



SIMULATION 

SECTION. 

TRACE ESSENTIAL 

SIGNALS. 

RESET PAL DEVICE. 

CLOCK RESET TO 

REGISTERS. 

ENABLE PUSH INPUT. 

PERFORM ONE PUSH 

CYCLE. 

PUSH AND POP MUST 

BE INACTIVE BETWEEN 

CYCLES. SET PUSH 

INACTIVE. 

SET PUSH ACTIVE 

AND PERFORM SECOND 

PUSH. 

SET PUSH INACTIVE 
AND CLOCK DEVICE 
THEN SET POP 
ACTIVE. PERFORM 
POP CYCLE. 



SET PUSH AND POP 
INACTIVE. CLOCK 
SYSTEM FOR A PASSIVE 
MODE. 



Figure 6. LIFO RAM Controller Design File (Continued) 
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Graphics systems are commonly used in most computers for 
displaying pictures and text. Functions of a graphics system 
include creation, manipulation and storage of picture databases 
and text. Usually, graphics systems also allow users to dynami- 
cally control a picture's content, size or colors. 

Graphics systems are closely related to Image processing sys- 
tems. While graphics systems manipulate images which already 
exist as a database, image processing systems analyze images 
taken from cameras by converting them to a graphical database 
and then processing that database. The hardware used forthese 
systems is very similar. In fact, it is so similar that many systems 
offer integrated graphics and image processing functions. 

In this section we will examine the possible uses of PLDs in a 
graphics and image processing system and their implementation. 
We will also determine which PLDs are suited for different 
applications in graphics and image processing systems. Laterwe 
will also see some design examples of PLD applications. 

A simple graphics system consists of five components, as shown 
in Figure 1. 

The first component is the graphics processor, which draws the 
image (referred to as "image rendering"). It also provides an 
interface to the main system processor (also known as the host 
processor). The second component is the display controller, 
which is an interface between the graphics processor and the 
graphics frame buffer. The display controller is essentially a 
modified DRAM controller. The third component is the graphics 



frame buffer, which is typically a memory that stores the image 
data. The data from the frame buffer is then serialized by data 
serializers which make up the fourth component of a graphics 
system. The data serializers are essentially modified shift regis- 
ters. The serial data is then converted to an analog signal suitable 
for driving the video monitors for display. This function is per- 
formed by the fifth and the final component of the graphics 
system, a digital-to-analog converter (DAC). 

Local Graphics Processor 

The graphics (or image) processor gets all the commands from 
the host processor. The interface between the graphics proces- 
sor and the host processor is through a bus (Figure 2), where 
commands and data bytes are transferred. Most of the popular 
standard bus interfaces are used for this function. Some of the 
commonly used bus interfaces are: 

• Multibus 

• VME 

• PC-bus 

• Nu-bus 

• MicroChannel 

The bus interface performs three major functions: 

• Bus arbitration 

• Bus control 

• Address decoding 



FRAME BUFFER 
MEMORY 



DATA 
SERIALIZERS 



DISPLAY 
CONTROLLER 



DACs 



MONITOR 



GRAPHICS BOARD 



LOCAL GRAPHICS 
PROCESSOR (OPTIONAL) 



HOST 
PROCESSOR 



DISK 
APPLICATION 
SOFTWARE 



Figure 1. Components of Graphics System Hardware 
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There are two major types of buses, synchronous and asynchro- 
nous. For synchronous buses, the bus arbitration and control 
functions can be effectively performed by fast registered PLDs 
such as the PAL16R8/6/4, PAL20R8/6/4 and PAL22V10. For 
asynchronous buses, the arbitration and control functions can be 
provided by combinatorial PLDs, or by an asynchronous PLD (the 
PAL20RA1 or AmPALC29MA1 6). The details for designing bus 
interface functions are given in aseparate section on page 2-325. 
The address decoding function is performed most effectively by 
a combinatorial PAL16L8. PAL20L8 or PAL22\/10. Address de- 
coding is discussed along with other combinatorial functions in 
the section on page 2-35. 



LOCAL GRAPHICS 
PROCESSOR 




HOST 
PROCESSOR 



Figure 2. System Bus Interface 

The local graphics processor is used to interpret the database of 
the graphical image ("display list"), down-loaded from the host 
processor. The graphics processor interprets the display list into 
objects with geometrical entities. It also performs coordinate 
transformation functions for recalculating object size as the object 
moves or viewer perspective changes. The last function per- 
formed by the graphics processor is the actual image rendering 
(drawing) functions of drawing arc/vectors, moving blocks of 
image data (bit-blit), and character manipulation for handling 
large chunks of text. 

In an image processing system the local processor performs the 
role of an image processing engine. The image processing 
software tasks are very different from the graphics tasks. Image 
processing tasks include interpreting the list of commands sent by 
the host and executing required functions such as image en- 
hancement, pattern recognition, or computer vision. The most 
intensive task is image enhancement which requires generating 
image transforms in their spatial or frequency domain. Generat- 
ing transforms requires a large number of multiplication opera- 
tions. Typical image processing systems use either a special 
microprocessor or a floating point multiplier for this purpose. 
PLDs can also be used to interface this multiplier to the local 
microprocessor. Some image processing tasks are very uniform 
and repetitive, and dedicated hardware can be easily designed to 
speed up such tasks in the interest of overall system perform- 
ance. Such dedicated hardware can be readily made with PLDs. 

The Display Controller 

For implementing graphics functions the local processor must be 
able to write to the frame buffer (draw an image). Similarly for 
implementing image processing functions the local processor 
must be able to read from and write to the frame buffer. The 
reading and writing of frame buffer data is possible with the help 



of the display controller, as shown in Figure 3. The display 
controller is essentially a timing generator which performs tliree 
major functions: 

• DRAM timing 

• Monitor control signal timing 

• Screen refresh 

The display controller generates the read and write cycle timings 
as well as the refresh timing of the DRAMs. The read and write 
cycles are initiated by the local graphics processor. These 
functions are identical to a regular DRAM controller discussed in 
the section on memory control (page 2-1 79). The second function 
required for the display controller is the generation of monitor 
control signals. This function is usually integrated with the DRAM 
controller, although it can also be implemented separately. 



FRAME BUFFER 
MEMORY 




' ' 




^ SCREEN REFRESH 








TO SERIALIZER 


DISPLAY 


» MONITOR 


CONTROLLER 


' CONTROL 



Figure 3. The Display Controller 

The controller needs to generate the DOTCLK signal which 
determines the rate at which data is sent to the monitor. The video 
monitor synchronization signals HSYNC for every new line, and 
VSYNC for every new frame (Figure 4) are generated by dividing 
this DOTCLK signal. Anothersignal, BLANK, asserted only when 
the pixels are being displayed, is also generated by this controller. 
The HSYNC, VSYNC and BLANK signals not only drive the 
monitor, but are also used as synchronization signals by the 
subsequent stages of the graphics system. 

in an image processing system, the HSYNC and VSYNC signals 
are generated by the camera which captures the image. The 
internal DOTCLK signal is a multiple of these HSYNC and 
VSYNC signals and is generated by using a phase locked loop 
(PLL). 

The third function of the display controller is to refresh the screen. 
The controller automatically reads data in sequential addresses 
from the frame buffer at fixed time intervals, and sends it to the 
monitor at a rate determined by the DOTCLK signal. The 
sequential addresses are also generated by the display controller 
as shown in the example on page 2-261 . Each chunk of data is 
called a pixel and is displayed on the screen as a dot. 

Registered PLDs can easily implement these display control 
functions. As this would be a fast state machine, the range of 
options includes the PAL32VX10/A, PAL16/20R8/6/4, 
AmPAL23S8, and the PLS1 05/1 67/1 68. An example for a line 
syncgenerator (HSYNC), and anotherforaframesyncgenerator 
(VSYNC), are discussed later on pages 2-263 and 2-265 respec- 
tively. 
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Figure 4. A Raster Scan Display Timing Control 



The Frame Buffer 

The frame buffer (Figure 5) is traditionally made from dynamic 
RAMs (DRAtvIs). Tlie DRAINS provide cost effective means for 
storing image data. Graphics systems require two ports for 



FRAME 
BUFFER 




reading and writing the frame buffer data. The first port is for the 
display update to and from the graphics processor; the second 
port is for the screen refresh to the monitor. 

The performance of agraphics system is measured by the screen 
update bandwidth. This is the rate at which the image data can 
be written to the frame buffer. The DRAMs can be organized in 
various ways to provide access to multiple bits of image data 
simultaneously, which can increase update bandwidth. Another 
related factor is the screen resolution, which determines the rate 
at which pixels have to be sent to the monitor in order to provide 
a flicker free display. The table in Figure 6 shows this pixel clock 
(DOTCLK) rate required for various screen resolutions. Varying 
DRAM organizations can also be used to meet this screen 
resolution bandwidth. 



RESOLUTION 


FREQUENCY 


PIXEL CLOCK (DOTCLK) 


512X512 


25 MHz 


40 ns 


640 X 480 


25 MHz 


40 ns 


720 X 350 


50 MHz 


20 ns 


1000X1000 


100 MHz 


10 ns 


1280 X 1000 


125 MHz 


8 ns 



Figure 5. The Frame Buffer 



Figure 6. Various Screen Resolutions and 
Bandwidth Required 
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DRAM 
MEMORY 



VRAM SERIAL SHIFT REGISTER 



DATA TO SERIALIZER 
OR MONITOR 



Figure 7. A Video RAiVI 

Video DRAMs (VRAMs) are specialized memories wliicii provide 
two ports. Tlie VRAI^s (Figure 7) include a built-in serializer 
capable of supporting high data bandwidths. These serializers 
are used to support high bandwidth requirements of the monitors. 
The VRAMs require extra control signals which are typically 
generated by the display controller. 

The performance of a graphics system is also determined in part 
by the frame buffer organization. There are two main types of 
organizations used in the industry. 

• Bit-plane mapped 

• Pixel-pacl<ed mapped 

Bit-plane mapping allows the graphics processorto access single 
bits (also called planes) from a number of adjacent pixels simul- 
taneously. The pixel-packed organization allows the graphics 
processor to access all the bits of one pixel simultaneously. The 
formerwas used in the monochrometype of displays whereas the 
latter is increasingly being used in the modern color displays. 

For high performance and large screen resolution systems, 
multiple pixel access is possible only with custom DRAM organi- 
zations, which require a custom display controller design. Also, 
different frame buffer organizations (bit-plane or pixel-packed), 
and the special timing requirements of VRAMs, further reduce the 
application of dedicated VLSI display controllers. PLDs provide 
an easy means of designing custom display controllers and are 
often used in such systems. 

Serializers 

The function of the serializer is to accept multiple pixels of parallel 
data, as sent by the display controller at slow speeds, and 
transform them into a serial pixel data stream. The serializer is 
required even for VRAMs, which can only support 25 MHz 
bandwidths. The serializer speed requirements are based on 
monitor resolution and are shown in Figure 5. 

The serializer is synchronized by the monitor control signals 
HSYNC, VSYNC and BLANK, which are generated by the display 
controller. A serializer is essentially a shift register, similar to the 
one discussed earlier on page 2-90. Certain modifications are 
added to accommodate the specific design requirements of a 
graphics system. These include synchronization with the display 
control signals. 

Many standard video serializers (679501, 679502, and Am8177) 
are available, which can be used for standard DRAM organiza- 
tions. For non-standard organizations, custom serializers can be 
made with PLDs. Sometimes graphics-related functions, which 
are not available in standard serializers, can also be added to 
such custom serializers. For example, a ZOOM function can be 



easily implemented in a custom serializer by having it repeat the 
same pixel data two or more times. An example of a seven bit shift 
register used as a serializer is shown on page 2-271. This 
serializer has the capability of providing reverse video, where it 
inverts the outgoing pixel data stream. 

In certain character-based terminals, a dedicated DRAM or 
VRAM based frame buffer is not used. The screen characters are 
stored in a static RAM and are sent to the monitor through a font 
RAM or ROM which stores the displayed shape of each character. 
This font RAM or ROM is also known as the character generator. 
The data from the font RAM constitutes the pixels which are sent 
to the monitor via serializers. One such application of a dual port 
serializer with changeable fonts is illustrated on page 2-275. 

PLDs which allow fast state machines are ideal candidates for 
such serializers. Due to the very high speed requirements, the 
most suitable PLDs are the registered "D" PAL devices at 55 MHz 
or the registered ECL PAL devices running at 125 MHz speed. 
Sometimes buried registers are also very useful for state machine 
designs. The serializer application on page 2-275 uses a 
PAL32VX10A as it provides buried registers for state machine 
functions. 

Look Up Table and Digital to Analog 
Converter 

The last stage of a graphics system is the look-up table (LUT) and 
digital-to-analog converter (DAC). The numbers of simultane- 
ously displayable colors or the gray scales is determined by the 
number of bits in a pixel. The LUT is used to increase the number 
of possible colors or gray scales that can be displayed on the 
monitor. 

The LUT (Figure 8) is afast RAM addressed by the pixels from the 
serializers. The data stored in the RAMs is used as the color or 
gray scale value. Since the data speeds can be very fast (as little 
as 8 ns for a 125 MHz display) and the static RAM access times 
are relatively slow, multiple static RAMs are used to store identical 
copies of look up tables. Different static RAMs are addressed for 
sequential pixels in order to maintain a fast overall data rate. 
Controllers made with such fast PAL devices as the 
PALI 0H20EV8 or PALI 6R8/6/4D can be used to spread sequen- 
tial pixels between various static RAMs for this function. The LUT 
output is then routed to the DAC stage of the circuit. 

The DAC is the only analog portion of the circuit, and is used to 
convert the color or gray scale data into an analog format usable 
by the video monitors. There are many standard monolithic and 
hybrid solutions available for such functions. 



PIXEL . 
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RAM 

256 X 12 




DATA 
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12 BITS 




NUMBER OF COLORS 
POSSIBLE = 4K 



Figure 8. The Lookup Table 
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This small system video controller is designed for an 80 character 
widetermina!displayof24rows(Figure1). The display resolution 
is 640 X 400 pixels, with a character box of 8 X 16 pixels. The 
character font used is a dot matrix of 7 X 9 pixels which is stored 
in a character PROM. Thef irst seven horizontal pixels display the 
character while the eighth pixel provides the horizontal character 
separation. Similarly thef irst nine lines display a visible character 
while the remaining seven are blank to provide line separation. 
The frame buffer is an SRAIVl which stores character ASCII codes 
of 80 X 24 characters and can be addressed by the system 
processor for update or by the display controller. 

The display controller provides all the necessary timing and 
addressing for supporting this display. Synchronized to the 
monitor timing signals, it generates the row and column ad- 
dresses (Figure 2), of the character being scanned. The row and 
column addresses access the character ASCII code from the 
frame buffer at the instant the character is displayed. This ASCII 
code addresses the character PROM which stores the 7 X 9 dot 
matrix font. Suppose the letter "A" is to be displayed (Figure 3). 
The contents of the character PROM pertaining to the current 
character row and column are read and loaded into the video shift 
register to be clocked out as a video signal. The video shift 
register must be seven-bits long, and the entire character is 



CHARACTER SIZE = 7X9 
CHARACTER BOX = 8X16 
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Figure 1. A Character Display 
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Figure 2. Functional Block Diagram 
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displayed over nine lines. To display a character 'A', the ASCII 
code of '41 ' hexadecimal selecting the letter is read from a frame 
buff er location. This character code will point to the base location 
in the PROM. This base location holds the first line of a table of 
nine locations which hold the dot information for each scan line. 
The line address counter first selects the RO line address, then the 
next line address R1 and then R2 to R8, until the entire character 
is displayed. 

The character column, row addresses and the line address can 
be generated by what is essentially one big counter. As can be 
seen from Figure 1 , there are eight pixels in a column, 80 columns 
in a line, and 16 lines in a row. The characters are counted by 
dividing the pixel clock (DOTCLK) by eight, which is the number 
of pixels in a character box. The character column addresses are 
generated by a seven bit counter which counts up to 80 character 
columns. For every eighty columns of character addresses a line 
address (a four-bit counter) is incremented. And finally for every 
1 6 line addresses a character row address is incremented. This 
counter is implemented in two different devices which also gen- 
erate related monitor timing signals. 

The implementation 

The functions of the display controller (Figure 4) are divided into 
three portions. The related tasks of generating the system 
HSYNC and BLANK signals and charactercolumn addresses are 
performed by a line sync generator. The tasks of generating the 
VSYNC signal and the line address and character row address 
are performed by the frame sync generator. Finally, the task of 
generating character period reference from the DOTCLK has 



been conveniently combined with the video shift register, which 
also uses this reference to shift out appropriate font pixels. The 
character period reference is also used by line sync and frame 
sync generators to count number of characters. 

The sequential frame buffer column address MA0-MA6 for dis- 
playing these 80 characters per row, is generated by the line sync 
generator (IC2 in Figure 4). The line sync generator is a counter 
clocked by DOTCLK with a count enable function under the 
control of a character period reference. This enables it to count 
the number of character periods. The line sync generator also 
generates HSYNC and BLANK signals at the end of each line. 

The character display is divided into 24 rows of 1 6 lines each. The 
four-bit line address (R0-R3) is used by the character PROM to 
generate the appropriate bits of font for the different lines in a 
character row. The line address to the character PROM is 
generated by the frame sync generator (1C3), using the BLANK 
signal as a new line reference. For every new line (BLANK signal) 
the line address counter is incremented. The frame sync genera- 
tor also generates the five-bit row address (A4-A8) for the 24 
different rows of characters stored in the frame buffer. This 
address is generated by a counter which is incremented once for 
every 1 6 counts of the line address counter. 

The final component is the video shift register (IC1) which 
serializes the character PROM seven-bit character line data. This 
function is accomplished by using a three-bit counter which 
divides the DOTCLKf requency by eight to generate the character 
period reference. 



LINE LINE 1 LINE 2 

00010 00 0010100 0100010 
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■ 
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LINE 3 LINE 4 LINE 5 

1000001 1111111 1000001 
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LINE 6 LINE 7 LINE 8 

1000001 1000001 1000001 



i 
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Figure 3. Character Generation 
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Line Sync Generator 



Addressing the Frame Buffer 



The PAL32VX10 used for line sync generation (Figure 5) has a 
versatile macrocell structure capable of transforming a basic D- 
type flip-flop into a more complex T-type flip-flop which efficiently 
implements counter designs. The seven-bit character column 
address counter uses seven of these T-type flip-flops. The line 
sync (HSYNC) signal is generated based on the value of the 
counter. The HSYNC signal initiates line flyback which separates 
the many lines of which a visual display is composed. The device 
also generates the blanking signal for monitor control. This 
design has been developed into state machine logic contained in 
thePAL32VX10. 



The frame buffer address lines MA0-MA6 are shared by a host 
processor which updates the memory with new characters. 
These addresses are generated by a binary character period 
counter. The only system clock conveniently available is the 
DOTCLK. Since the counter has to keep track of character 
periods. One easy way is to clock the device with DOTCLK, but 
enable count only when all three bits of the pixel counter S2, SI 
and SO are high. These three bits high indicate a character 
boundary; this will effectively result in clocking the counter every 
character period. 
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Figure 4. Small System Video Controller 
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PAL32VX1 USES J-K AND T FLIP-FLOP 
CONFIGURATIONS. 



The count is extended up to 1 07 to provide a timing reference to 
generate the monitor control signals (HSYNC and BLANK) also. 
Seven registers in the PAL32VX10 provide this count. The 
registers are programmed as T-type flip-flops (Figure 5) for opti- 
mizing the number of product terms used. For each scan line after 
reaching 1 07 the T-type counters are reset to provide the start of 
the next active line. Of the ten registered cells in the PAL32VX1 0, 
nine registers are used for addressing memory (MA0-MA6), line 
sync generation (HSYNC) and blanking control (BLANK). 

Since the processor can not read or write to the f ram e buffer at the 
same time as the display controller, the address lines (MA0-f«1A6) 
ofthe display controller are three-stated during aprocessorcycle. 
When processor signal /WR is asserted LOW, indicating a write 
cycle, the MA0-MA6 address outputs are three-stated to aliowthe 
processor address bus to drive the memory address. 

The line blanking signal (BLANK) is designed to go active low 
after 80 character cycles. This is followed by the line sync signal 
(HSYNC), which goes active eight character cycles later. This 
gives an eight character back porch to the display (see Figures 6 
& 7). The line flyback time (HSYNC width) of 1 6 blank characters 
ensures adequate time for the CRT line circuitry to stabilize 
before starting the next active line scan. The blanking signal is 
then turned off three character periods later to provide a three 
character front porch to the display. The duty cycle for one scan 
line is (80+8+1 6+3)=1 07 character periods. 

Selecting a DOTCLK of 1 6 MHz, and a character of eight bits, the 
resulting 2 MHz character cycle can be divided down to a line duty 
cycle of: 

HSYNC Line frequency = Character freq/Line duty cycle 
= 2000 KHz /1 07 
= 18.7 KHz 

Video data in the frame buffer can be updated by the processor 
only for durations when video is not displayed. A BLANK signal 
is a good indicator to the processor for when to begin and stop 
updates. Most processors, however, need an advance warning, 
preferably a few microseconds earlier. 

The RDY signal is used to provide this advance warning to the 
processor. When HIGH, the RDY signal allows data modification 
in the frame buffer. This flag is turned off before the removal of the 
blanking pulse (BLANK) allowing a margin of four micro-seconds 
to elapse at the end of the blanking period. It is assumed that four 
microseconds is ample time to test the flag and perform the last 
write operation. However, the time taken for a processor to read 
this flag and respond can be programmed into the PAL device 



Figure 5. Schematic Diagram of Line Address 
Generator Circuit 
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depending upon the software requirements of the system. The 
RDY signal is also set during vertical retrace when all processor 
updates are allowed and no display controller accesses are 
required. The design uses the system VSYNC for accomplishing 
this. 

The RDY signal is also disabled unless it is being read by the 
processor. This is done because the RDY signal is usually 
directly connected to the processor data bus which is used to 
carry data from different sources (drivers). The disabling is done 
using the CS chip-select signal asserted by the processor. If CS 
is driven LOW, the RDYoutput buffer is enabled and the RDY flag 
may be read by the processor. 



Frame Sync Generator 

Another PAL32VX10 (Figures 8 & 9) is programmed to perform 
the function of a frame sync generator (VSYNC), with the ability 
to access the character PROM and the frame buffer. The VSYNC 
signal controls frame flyback for the CRT controller. Each frame 
of displayed data is terminated, and the electron beam in the CRT 
performs a retrace operation prior to the display of the next frame. 
Clocked by the system BLANK signal, it uses a counter to 
generate the row address for the character being displayed. It 
also generates the line address to access the dot pattern for each 
scan line, stored in character PROM. 
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Figure 6. Blanking and Sync. Signals. 
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LINE SYNC GENERATOR. 

02. 

02. 

CHRIS JAY. 

MMI SANTA CLARA, CA. 

20TH AUGUST 1986. 



THE PAL32VX10 HAS BEEN DESIGNED AS A LINE SYNC GENERATOR FOR A 
THREE PAL SOLUTION TO A VIDEO DISPLAY CONTROLLER. THE DEVICE 
HAS BEEN PROGRAMMED TO GENERATE ADDRESSING TO ACCESS A CHARACTER 
RAM MAO - MA6 ARE CONFIGURED TO ACCESS ASCCII CHARACTERS STORED IN 
THE RAM FOR AN EIGHTY CHARACTER WIDE DISPLAY. THESE ADDRESS OUTPUTS 
MAY BE PUT INTO A THREE STATE CONDITION IF THE HOST PROCESSOR 
REQUIRES TO CHANGE THE CONTENTS OF THE DISPLAYED MEMORY. WITH 
THE /WR INPUT ACTIVE LOW THE HOST PROCESSOR MAY DRIVE THE 
ADDRESS INPUTS TO ACCESS AND MODIFY THE CONTENTS OF THE VIDEO RAM. 
THE INTERFACE SIGNALS TO THE VIDEO SHIFT REGISTER (PAL DESIGN 01 
OF THE THREE PAL DESIGNS) ARE S0,S1,S2, AND BLANK. WHEN S0*S1*S2 
ARE HIGH THE ADDRESS OUTPUTS MAO - MA6 INCREMENT SYNCHRONOUSLY 
AS A RESULT OF THE DOT CLOCK. SO, SI AND S2 ARE STATE MACHINE 
OUTPUTS FROM THE VIDEO SHIFT REGISTER THAT CONTROL THE LOADING 
AND SHIFTING OF PIXEL DATA. THE BLANK OUTPUT IS ACTIVE HIGH AND 
IS FED TO THE VIDEO SHIFT REGISTER TO PROVIDE A BLANKING SIGNAL 
DURING LINE SYNC FLYBACK WITH A MARGIN FOR FRONT AND BACK PORCH. 
A RDY REGISTER IS AVAILABLE TO INDICATE THAT THE SCREEN IS BLANK 
DURING LINE AND FRAME FLYBACK PERIODS AND VIDEO RAM MAY BE UPDATED. 
THE OUTPUT IS 3-STATED AND IS ENABLED BY AN ACTIVE /CS INPUT. 

CHIP VIDE0_1 PAL32VX10 

LINE SYNC GENERATOR PAL 



PINS 


1 
CLK 


2 

/RST 


3 
SO 


4 
SI 


5 
S2 


6 
/VSYNC 


PINS 


7 
NC 


8 
NC 


9 

NC 


10 
NC 


11 
/WR 


12 
GND 


PINS 


13 
/CS 


14 
BLANK 


15 
MAO 


16 

MAI 


17 
MA2 


18 
MA3 


PINS 


19 

MA4 


20 
MAS 


21 
MA6 


22 

HSYNC 


23 
RDY 


24 
VCC 



GLOBAL QO Ql Q2 Q3 Q4 Q5 Q6 Q7 QB 09 
STRING HEX6B 'Q7*Q6*/Q5*04*/Q3*Q2*Q1 ' 



STRING HEX67 
STRING HEX6 3 
STRING HEX57 
STRING HEX50 
STRING CEI 
EQUATIONS 



•Q7*Q6*/Q5*/Q4*Q3*Q2*Q1 ' 
'Q7*Q6*/Q5*/Q4*/Q3*Q2*Q1 ' 
'Q7*/Q6*Q5*/Q4*Q3*Q2*Q1 ' 
•Q7*/Q6*Q5*/Q4*/Q3*/Q2*/Q1 ' 
•S0*S1*S2' 



GLOBAL. SETF = RST 
/QO : = 



BLANK 
BLANK . CMBF 



MAO 

MAO. CMBF = 

MAO.TRST = 

/Q2 : = 



MAI 

MAI. CMBF 

MAl.TRST 



/QO 
/Q0*HEX50*CEI 

Q0*HEX6B*CEI 

QO 
GND 



; STRING DECLARATIONS. 
; CLEAR ADDRESS COUNTER 
;AND TURN BLANK OFF 
;TURN LINE SYNC OFF 
;TURN RDY OFF 
;TURN LINE SYNC ON 
;TURN BLANK SIGNAL ON 
; COUNT ENABLE INPUT 



/Ql := /Ql ; COUNT EQUATION CEI = T 

:+: CEI ;TOGGLE. HEX6B*CEI = K 

Q1*HEX6B*CEI ;T0 RESET MAO AFTER ACTIVE 

Ql .-LINE. ASSIGN MAO TO Ql 

GND ; REGISTER. 

/WR ; ENABLE THREE-STATE 

DURING MPU/RAM ACCESS. 

/Q2 ; COUNT EQUATION Q2 

Q1*CEI ;Q1*CEI = T FUNCTION 

Q2*HEX6B*CEI ;HEX6B*CEI = K 

Q2 ; ENABLE Q2 REGISTER TO 

GND ;MA1 OUTPUT. CONTROL 

/WR ; THREE-STATE DURING 
MPU/RAM ACCESS. 

Figure 7. Design File of Line Sync Generator 



ASYNCHRONOUS RESET 

J - K EQUATION FOR 

BLANK OUTPUT. 

HEX50 = J. BLANK ON 

HEX6B = K. BLANK OFF 

ASSIGN QO TO BLANK PIN 

ENABLE REGISTERED OUTPUT 
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/Q3 


= 


/Q3 




+ : 


Q2*Q1*CEI*/Q7 




+ 


Q2*Q1*CEI*/Q6 




+ 


Q2*Q1*CEI*Q5 




+ 


Q2*Ql*CEI*/04 




+ 


Q2*Q1*CEI*Q3 


MA2 


= 


03 


HA2.CMBF 


= 


GND 


MA2.TRST 


= 


/WR 


/Q4 


= 


/Q4 




+ : 


03*Q2*Q1*CEI 




+ 


Q4*HEX6B*CEI 


MA3 


= 


Q4 


MA3.CMBF 


= 


GND 


MA3.TRST 


= 


/WR 


/Q5 


= 


/Q5 




+ : 


Q4*Q3*Q2*Q1*CEI 




+ 


Q5*HEX6B*CEI 


MA4 


= 


Q5 


MA4.CMBF 


= 


GND 


MA4.TRST 


= 


/WR 


/Q6 


= 


/Q6 




+ : 


Q5*Q4*Q3*Q2*Q1*CEI 




+ 


Q6*HEX6B*CEI 


MAS 


= 


Q6 


HA5.CMBF 


= 


GND 


MA5.TRST 


= 


/WR 


/Q7 


= 


/Q7 




+ : 


Q6*Q5*Q4*Q3*Q2*Q1*CEI 




+ 


Q7*HEX6B*CEI 


MA 6 


= 


Q7 


MA6.CMBF 


= 


GND 


MA6.TRST 


= 


/WR 


/Q8 


= 


/Q8 




+ : 


/Q8*HEX57*CEI 




+ 


Q8*HEX67*CEI 


/HSYNC 


= 


/Q8 


/HSYNC.CMBF = 


GND 


/Q9 


= 


/Q9 




+ ; 


/Q9*HEX50*CEI*/VSYNC 




+ 


/Q9*CEI*VSYNC 




+ 


Q9*HEX63*CEI*/VSYNC 


RDY 


= 


Q9 


RDY.CMBF 




GND 


RDY.TRST 


= 


CS 



SIMULATION 

TRACE_ON CLK RST SO SI S2 MAO MAI MA2 

MA3 MA4 MAS MA6 HSYNC BLANK 

RDY /VSYNC 
SETF WR /VSYNC 



SETF 


/CLK RST SO SI S2 /WR CS 


CLOCKF 


CLK 


SETF 


/RST 


PRLDF 


/QO /Ql /Q2 /Q3 Q4 




/Q5 /Q6 Q7 /OS /Q9 


FOR I : 


= 1 TO 60 DO 


BEGIN 




CLOCKF 


CLK 


END 




SETF VSYNC 


FOR I : 


= 1 TO 8 DO 


BEGIN CLOCKF CLK 


END 




SETF 


/SO 


CLOCKF 


CLK 


SETF 


/SI 


CLOCKF 


CLK 


SETF 


/S2 


CLOCKF 




TRACE OFF 



COUNT EQUATION Q3 
Q2*Q1*CEI = T FUNCTION 
COUNT ENABLE UNTIL 
END OF LINE BLANKING 
PERIOD. DE MORGAN OF 
EQU Q7*Q6*/Q5*Q4*/Q3=K 
/Q9 = RESET FUNCTION. 
ASSIGN 03 TO MA2 
REGISTERED OUTPUT. 
CONTROL O/P WITH /WR 
COUNT EQUATION 04 
Q3*Q2*01*CEI = T. 
04*HEX6B*CEI = K. 
ASSIGN 04 TO MA3. 
ENABLE REGISTER O/P. 
THREE - STATE CONTROL. 

COUNT EQUATION Q5 . 
Q4*Q3*Q2*Q1*CEI = T. 
Q5*HEX6B*CEI = K. 
ASSIGN Q5 TO MA4. 
ENABLE REGISTER O/P. 
THREE - STATE CONTROL. 

COUNT EQUATION 06. 
Q5*Q4*Q3*Q2*01*CEI = T. 
06*HEX6B*CEI = K. 
ASSIGN 06 TO MAS. 
ENABLE REGISTER O/P 
THREE - STATE CONTROL 

COUNT EQUATION Q7 . 
06*Q5*Q4*Q3*Q2*01*CEI = 
07*HEX6B*CEI = K. 
ASSIGN 07 TO MA6. 
ENABLE REGISTER O/P 
THREE - STATE CONTROL 

J - K EQUATION FOR Q8 
HEX57*CEI = J. 
HEX67*CEI = K. 
ASSIGN 08 TO HSYNC. 
ENABLE REGISTER O/P 
J - K EQUATION FOR Q9 
RDY STATUS 

FLYBACK. HEX50*CEI = Jl 
AND FRAME = J2 
HEX63*CEI = Kl RESET 
AFTER LINE AND FRAME 
ASSIGN RDY TO 09 
ENABLE REGISTER O/P 
WHEN /CS = LOW RDY 
OUTPUT IS ENABLED. 
START OF SIMULATION. 
TRACE ESSENTIAL I/P 
AND O/P SIGNALS. 



TEST THREE - STATE 
CONTROL. INITALISE 
INPUTS. PERFORM 
RESET. AND PRELOAD 
REGISTERS. 

APPLY 60 CLOCK PULSES 
TO TRACE THE MEMORY 
ADDRESS COUNTER, HSYNC, 
BLANK AND CEO. 

TEST THAT AN ACTIVE 

FRAME WILL SET THE 

RDY FLAG. 

DISABLE COUNTER BY 

SETTING SO, SI AND S2 

LOW. 



END OF SIMULATION. 



Figure 7. Design File of Line Sync Generator (Cont'd.) 
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RESET 
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^- 



T Q 
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H^- 



T 
b Q 



^- 
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^ 
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^ 
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> Q 
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J O —\^ VSYNC 



K Q 



The PALdevice is programmed as acounter. The least significant 
four bits count through sixteen line addresses R0-R3 for each 
character row. The higher significant bits of the counter track the 
24 rows of characters. The frame sync pulse is generated on the 
25th character row and frame flyback is turned on for sixteen line 
count periods, which is the duration of the twenty-fifth row. With 
24 character rows, each having sixteen lines, the system gave 
384 visible lines. Since the frame flyback occurs over sixteen 
lines, the total number of lines in the system is (384+16)= 400. 
The frame repetition rate is derived by dividing the line frequency 
by the total number of lines in the system. 



Frame repetition rate 



18.7X10*3 HZ/400 (1) 

46.75 Hz 



The video terminal used for testing the final design functioned 
best at this frame repetition rate. If desired, other frame repetition 
rates can also be selected. For example a thirteen character row 
with nine rows active and the same 7X9 character box would give 
a 57 Hz frame rate. Most video terminals have a synchronizing 
circuit that determines the choice of frame frequency. This is 
similar to a phase-locked loop, providing a window of acceptable 
frame oscillator frequencies to which the display will be locked. 
Outside of this capture range, the display will not be synchro- 
nized, and the result will look like a rolling unstable picture. 

Signal Assignments 

The A/SYNC output is an active LOW signal. When displaying 
data this output is driven HIGH. As with the line sync generator 
design, the /WR input is driven from the host processor system. 
When LOW, the video memory address outputs MA7-MA11 are 
disabled. The host system may enable its output buffers and drive 
the frame buffer's address inputs directly to modify video data. 

For normal counting, the CEJ input should be driven LOW. ATST 
input was provided to minimize the number of test vectors 
required to test the device. This pin is driven LOW for normal 
operation. 



PAL32VX1 USES J-K AND T FLIP-FLOP 
CONFIGURATIONS. 



Figure 8. Schematic of Frame Sync Generator 
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FRAME SYNC GENERATOR. 

02 

02. 

CHRIS JAY. 

MM I SANTA CLARA, CA . 

20TH AUGUST 1986. 



CHIP VIDEO_l PAL32VX10 

FRAME SYNC GENERATOR PAL 

THE PAL 32VX10 IS THE THIRD OF A THREE PAL SYSTEM DESIGNED 
AS A SIMPLE VIDEO CONTROLLER. THE DEVICE GENERATES ADDRESSING 
FOR A CHARACTER MEMORY AND CPU RAM. ALSO A FRAME SYNC PULSE 
/VSYNC IS USED TO PROVIDE FRAME FLYBACK AT THE END OF ONE 
COMPLETE FRAME. THERE ARE FOUR ROW OUTPUTS FROM THE PAL WHICH 
ADDRESS THE CHARACTER MEMORY. THESE OUTPUTS ARE DRIVEN FROM 
AN INTERNAL COUNTER WHICH IS INCREMENTED AFTER EACH LINE 
SCAN. THE R0,R1,R2 AND R3 INCREMENT AS A SIXTEEN BIT COUNTER 
ADDRESSING CHARACTER MEMORY. MA7 - MAll ADDRESS VIDEO MEMORY 
ACCESSING THE ASCII CODES OF THE CHARACTER TO BE DISPLAYED 
ON EACH ROW OF THE VIDEO DISPLAY. THE COUNTER MA7 - MAII 
IS INCREMENTED AFTER ONE COMPLETE CHARACTER SCAN THAT IS 
SIXTEEN LINES. THE /WR INPUT CAN BE DRIVEN BY THE HOST CPU 
TO 3 - STATE MA7 - MAll OUPUTS AND DYNAMICALLY CHANGE THE 
CONTENTS OF THE VIDEO RAM. THIS INPUT CAN BE MAPPED INTO 
HOST PROCESSOR'S MEMORY ADDRESS RANGE. UPDATING RAM SHOULD 
BE DONE BY THE HOST DURING LINE AR FRAME FLYBACK. THE PAL 
DEVICE MAY BE RESET AT POWER ON AND /CEI SHOULD BE WIRED LOW 
TO ENABLE THE ADDRESS COUNTERS. TST IS HARDWIRED LOW, THIS 
PIN WAS USED AS A TEST INPUT PULLED HIGH TO DISABLE SOME OF 
LOWER ORDER COUNTER REGISTERS WHEN GENERATING TEST VECTORS 
OF THE TEST VECTORS. THIS AVOIDED GENERATING UNECESSARY 
REPETATIVE LOWER ORDER COUNTS WHEN TESTING THE COUNT 
EQUATIONS FOR HIGHER ORDER REGISTERS IN THE COUNTER 
CHAIN. 



PINS 
;PINS 
;PINS 
;PINS 



1 
CLK 



7 
NC 



13 

NC 



19 



2 
/RST 



14 
MA7 



20 



3 
/CEI 



9 
NC 



15 
MAS 



21 
Rl 



4 
NC 



10 
/WR 



16 
MA9 



22 
R2 



5 
NC 



11 

TST 



17 
MAIO 



23 
R3 



6 

NC 



12 
GND 



18 

MAll 



24 
VCC 



/VSYNC RO 
GLOBAL QO Ql 02 Q3 Q4 Q5 Q6 Q7 08 Q9 
STRING VARIABLE DECLARATIONS FOR A NON-INTERLACED SYSTEM. 



STRING HEX18NI " Q9*Q8*Q7*Q6*Q4*Q3*/Q2*/Q1 */Q0 ' 
STRING HEX19NI " Q9*Q8*Q7*Q6*Q4*Q3*/Q2*/Q1 *Q0 ' 
STRING ROWNI '06*07*08*09' 



SET VSYNC 
CLEAR VSYNC 
ROW COMPLETE 



EQUATIONS 



GLOBAL. SETF 



/QO 



: + : 



MA7 

MA7.CMBF = 
MA7.TRST = 

/Ql : = 

: + : 
MAS 

MA8.CMBF = 
MA8.TRST = 



/QO 

CEI*ROWNI 

CEI*HEX19NI 

QO 

GND 
/WR 

/Ql 

Q0*CEI*ROWNI*/Q5 

Ql 

GND 
/WR 



RESET AT POWER UP 

COUNT EQUATION. 

TOGGLE WHEN CEI*ROWNI TRUE 

RESET AFTER FRAME SCAN. 

ENABLE MA7 OUTPUT 

SET REGISTER MODE 

THREE - STATE 0/P WHEN /WR 

ACTIVE. 

COUNT EQUATION Ql 

ENABLE TOGGLE 

ENABLE MAS OUTPUT. 

SET REGISTER MODE. 

THREE - STATE O/P WHEN /WR 

ACTIVE. 



Figure 9. Design Fiie of Frame Sync Generator 
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/Q2 


: = 


/Q2 




: + : 


Q1*QO*CEI*ROWNI 




+ 


Q2*HEX19NI*CEI 


MA9 


= 


Q2 


MA9.CMBF 


= 


GND 


MA9.TRST 


= 


/WR 


/Q3 


: = 


/Q3 




: + : 


Q2*Q1*QO*CEI*ROWNI 




+ 


Q3*HEX19NI*CEI 


MAIO 


= 


Q3 


MAIO.CMBF = 


GND 


MAIO.TRST = 


/WR 


/Q4 


. = 


/Q4 




: + : 


Q3*Q2*Q1*QO*CEI*ROWNI 




+ 


Q4*HEX19NI*CEI 


MAll 


= 


Q4 


MAll.CMBF = 


GND 


MAll.TRST = 


/WR 


/Q5 


: = 


/Q5 




: + : 


/Q5*HEX18NI*CEI*ROWNI 




+ 


Q5*HEX19NI*CEI*ROWNI 


/VSYNC 


= 


/Q5 


/VSYNC.CMBF = 


GND 


/Q6 


: = 


/Q6 




; + : 


CEI 


RO 


= 


Q6 


RO.CMBF 


= 


GND 


/Q7 


: = 


/Q7 




: + : 


Q6*CEI*TST 




+ 


/Q7VTST 


Rl 


= 


Q7 


Rl.CMBF 


= 


GND 


/Q8 


: = 


/Q8 




: + : 


Q6*Q7*CEI*TST 




+ 


/Q8*/TST 


R2 


= 


Q8 


R2.CMBF 


= 


GND 


/Q9 


: = 


/Q9 




: + : 


Q6*Q7*Q8*CEI*TST 




+ 


/Q9*/TST 


R3 


= 


Q9 


R3.CMBF 


= 


GND 



SIMULATION 

TRACE_ON CLK RST MA7 MAS MA9 MAIO 

MAll /VSYNC RO Rl R2 R3 
SETF /CLK RST CEI /WR 
CLOCKF CLK 
SETF /RST /TST 
PRLDF QO Ql Q2 /Q3 Q4 

/Q5 /Q6 /Q7 /Q8 /Q9 

I := 1 TO 40 DO 

CLOCKF CLK 



FOR 

BEGIN 
END 



SETF TST 

PRLDF QO Ql Q2 /Q3 Q4 

/Q5 /Q6 /Q7 Q8 Q9 



FOR 

BEGIN 

END 

SETF 

FOR 

BEGIN 

END 

TRACE OFF 



I := 1 
CLOCKF 



/CEI 
I := 1 
CLOCKF 



TO 40 DO 
CLK 



TO 8 DO 
CLK 



COUNT EQUATION Q2 

ENABLE TOGGLE 

RESET AFTER FRAME SCAN 

ENABLE MA9 OUTPUT. 

SET REGISTER MODE, 

THREE - STATE O/P WHEN /WR 

ACTIVE. 

COUNT EQUATION Q3 

ENABLE TOGGLE 

RESET AFTER FRAME SCAN 

ENABLE MAIO OUTPUT. 

SET REGISTER MODE 

THREE - STATE O/P WHEN /WR 

ACTIVE. 

COUNT EQUATION Q4 

ENABLE TOGGLE 

RESET AFTER FRAME SCAN 

ENABLE MAll OUTPUT 

SET REGISTER MODE 

THREE - STATE O/P WHEN /WR 

ACTIVE. 

J - K VSYNC O/P 

J EQUATION TO SET VSYNC 

K EQUATION TO RESET VSYNC 

ENABLE Q5 TO VSYNC. 

SET REGISTER MODE 

COUNT EQUATION FOR Q6 

TOGGLE 

FRAME SYNC. ENABLE Q6 

REGISTER OUTPUT FOR 

ROW 0. 

COUNT EQUATION FOR Q7 

ENABLE TOGGLE 

DISABLE COUNT DURING TEST. 

ENABLE Q7 REGISTER OUTPUT 

FOR ROW 1. 

COUNT EQUATION FOR Q8 
ENABLE TOGGLE 

DISABLE COUNT DURING TEST. 
ENABLE Q8 REGISTER OUTPUT 
FOR ROW 2. 

COUNT EQUATION FOR Q9 
ENABLE TOGGLE 

DISABLE COUNT DURING TEST. 
ENABLE Q9 REGISTER OUTPUT 
FOR ROW 3. 

SIMULATION SECTION 
TRACE ESSENTIAL SIGNALS 

SET INITIAL CONDITIONS 
ENABLE COUNT ON 
DISABLE ROW Rl, R2 , R3 . 
PRELOAD INITIAL STATE 
TWO COUNTS BEFORE FRAME 
SYNC. CLOCK INPUT TO VIEW 
WAVEFORMS OF ROW 0, VSYNC 
AND MEMORY ADDRESSES MA7 TO 
MAll. 

DISABLE TEST MODE, THE ROW 
COUNTERS ARE ENABLED. 
PROVIDE CLOCK INPUTS TO 
GENERATE TEST VECTORS TO 
VERIFY THE NORMAL COUNT 
OPERATION OF THE ROW 
ADDRESS COUNTER. 
DISABLE COUNT NODE AND 
APPLY CLOCK INPUT TO 
VERIFY THE COUNTER 
HOLD OPERATION. 



Figure 9. Design File of Frame Sync Generator (Cont'd.) 
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Video Shift Register 

The third component of this video system (Figures 10 & 11), 
provides a means of converting parallel data into a high speed 
serial data stream. The PAL32VX1 implements one such video 
shift register, where it receives parallel data from a character 
memory and clocks the serial data out at a rate determined by the 
pixel clock (DOTCLK). The conversion of parallel character data 
to a serial stream is synchronized to the character period of the 
system. The character period is generated by dividing the 
DOTCLK by eight and this circuit has been conveniently imple- 
mented in the same device. The design also offers blanking 
controls and a reverse video option. 

The PAL32VX1 contains ten registers, three of which are used 
as a binary counter which synchronously counts through eight 
states at a rate determined by the DOTCLK of the video system. 
This count to eight determines a character period reference since 
there are eight pixels (dots) in every character. The outputs 
S0-S2 are used by the rest of the system as a character period 
reference. 

Thethree-bitoutputS2,S1 and SO of thecharacterperiod counter 
are also used internally as a timing reference for loading new 
character data. When all three S2, SI and SO are high, the 
remaining seven registers of the device 01-07 are loaded with 
the seven bit character data from inputs D0-D6. The registers 
used as a counter are Q8, 09 and 01 0. 01 0, the least significant 
bit, is assigned to the output pin SO, 09 is assigned to pin SI ; and 
08 is assigned to pin 32. 

The "new character" load command is given as a string statement 
in the design specification: STRING LOAD '08*09*010'. 

This parallel load is followed by seven shift states when the data 
is transferred from one register to another. The binary sequence 
of 0-6 at counter S2-S0 enable this synchronous data shift 
through registers Q1-07, 07 being the last register in the chain. 
The VIDEO output of the PAL device is therefore assigned to the 
last register 07. 

The shift command is devel oped from the DeMorgan expansion 
of NOT(LOAD): 08 + 09 + 01 0. 

Additional Control Features 

The video shift register has a number of inputs which control the 
serial output of the video data stream. The BLANK input, when 
asserted, inhibits the video, causing it to go LOW independent of 
activity at the parallel data inputs. It is necessary to use the 
BLANK sign al to ove rride active video during line flyback or 
retrace. The VSYNC input also inhibits the loading of video data 
and suppresses video drive during frame flyback. 



The reverse video input REV can be used to create an inverse 
video display. For normal video this input is LOW. The characters 
displayed on the video screen will be bright on a dark background. 
When the reverse video control is set HIGH, the video data is 
inverted when it is loaded. The display background is then bright, 
and the displayed characters are dark. The BLANK and VSYNC 
controls have the same effect of driving the video output LOW 
during line and framef lyback whetheror notthe video is reversed. 

THREE BIT BINARY COUNTER 





LOAD SHIFT 
DIN 01 

. — ^ POUT RST b - 



LOAD SHIFT 
DIN 02 



. >- t> DOUT RST }f > 



LOAD SHIFT 
DIN 03 

. >- b DOUT RST 



LOAD SHIFT 
DIN 04 

> DOirr RST 



LOAD SHIFT — 
DIN 05 

> DOUT RST b - 



LOAD SHIFT — 
DIN 06 

. >- |> DOUT RST p - 



LOAD SHIFT 
DIN 07 

M> POUT RST t>- 



Figure 10. Schematic of Video Shift Register 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



VIDEO SHIFT REGISTER. 

01 

02. 

CHRIS JAY. 

MMI SANTA CLARA, CA. 

20TH AUGUST 1986. 



THE PAL32VX10 HAS BEEN DESIGNED AS A VIDEO SHIFT REGISTER 
TO SUPPORT A THREE PAL SOLUTION TO A SMALL VIDEO CONTROLLER 
CIRCUIT. SEVEN INPUTS FROM THE CHARACTER GENERATOR ARE LOADED 
INTO THE INTERNAL SHIFT REGISTER THROUGH INPUTS Dl - D7 , 
AND SUBSEQUENTLY SHIFTED THROUGH IT. AN INTERNAL THREE BIT 
BINARY COUNTER CONTROLS THE LOAD AND SHIFT ACTIVITY. WHEN 
THE BINARY OUTPUTS SO, SI AND S2 ARE ALL HIGH THE SHIFT 
REGISTER IS LOADED WITH NEW DATA FROM THE CHARACTER MEMORY, 
SHIFTING TAKES PLACE FOR ALL THE OTHER BINARY STATES. 
THE DATA IS LOADED AND SHIFTED SYNCHRONOUSLY, AT A RATE 
DETERMINED BY THE APPLIED CLOCK INPUT. THE DEVICE MAY BE 
ASYNCHRONOUSLY RESET BY APPLYING A LOGIC LOW TO THE /RST PIN. 
THERE IS A REV INPUT, WHICH CAUSES THE DATA TO BE REVERSED 
FOR REVERSE VIDEO. THIS INPUT IS HELD LOW FOR NORMAL DATA. 
THE BLANK AND /VSYNC CONTROLS ARE FEEDBACK INPUTS FROM THE 
LINE SYNC AND FRAME SYNC PAL DEVICE OUTPUTS. THESE CAUSE THE 
VIDEO DATA OUTPUT TO BE INACTIVE DURING LINE AND FRAME SYNC. 
PERIODS. THE SO, SI AND S2 OUTPUTS DRIVE THE LINE AND FRAME 
SYNC PAL32VX10 AND ACT AS COUNT ENABLE INPUTS TO THOSE 
DEVICES. 

CHIP VIDE0_1 PAL32VX10 

VIDEO SHIFT REGISTER PAL 



PINS 
;PINS 
;PINS 
;PINS 



1 
CLK 



7 
D5 



13 
REV 



19 
NC 



2 

/RST 



D6 



14 
NC 



20 
VIDEO 



3 
Dl 



9 
D7 



15 

NC 



21 
S2 



4 


5 


6 


D2 


D3 


D4 


10 


11 


12 


BLANK 


/VSYNC 


GND 


16 


17 


18 


NC 


NC 


NC 


22 


23 


24 


SI 


SO 


VCC 



GLOBAL 

STRING 

EQUATIONS 

GLOBAL. SETF 

/QIO := 
SO 
SO.CMBF = 

/Q9 : = 

: + : 
SI 
Sl.CMBF = 

/Q8 : = 



S2 
S2.CMBF 



Ql Q2 Q3 04 Q5 Q6 Q7 
LOAD 'Q8*Q9*Q10' 



QIO 
QIO 
GND 

/Q9 

QIO 

Q9 
GND 

/Q8 

Q10*Q9 

QB 

GND 



Q8 Q9 QIO 

ENABLE LOADING OF CHARACTER DATA. 



ASYNCHRONOUS RESET. 

INTERNAL STATE COUNTER 
LEAST SIGNIFICANT BIT QIO. 
ENABLE REGISTER TO PIN 
SO. 

INTERNAL STATE COUNTER 
QIO = HIGH = TOGGLE. 
ENABLE REGISTER OUTPUT 
TO PIN SI. 



INTERNAL STATE COUNTER 
Q10*Q9 = HIGH = TOGGLE. 
ENABLE REGISTER OUTPUT 
TO PIN S2. 



/Ql 



/D1*L0AD*/REV* /BLANK */VSYNC 
D1*L0AD*REV* /BLANK */VSYNC 

/Q8*/REV 

/Q9*/REV 

/QIO* /REV 
BLANK 
VSYNC 



LOAD DATA INPUT Dl . 
LOAD DATA INPUT /Dl . 
SHIFT HIGH INTO Ql . 



BLANK DATA DURING 
LINE AND FRAME SYNC 
INPUTS. 



Figure 11. Design File of Video Shift Register 
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/Q2 



/Q3 



/Q4 



/Q5 



/D2*L0AD*/REV*/BLANK*/VSYNC 
D2*L0AD*REV*/BLANK*/VSyNC 

/Q1VQ8 

/Q1VQ9 

/Ql*/Q10 
BLANK 
VSYNC 

/D3*L0AD*/REV*/BLANK*/VSYNC 
D3*L0AD*REV*/BLANK*/VSYNC 

/Q2VQ8 

/Q2VQ9 

/Q2VQ10 
BLANK 
VSYNC 

/D4*L0AD*/REV*/BLANK*/VSYNC 
D4*L0AD*REV*/BLANK*/VSYNC 

/Q3VQ8 

/Q3VQ9 

/Q3VQ10 
BLANK 
VSYNC 

/D5*L0AD*/REV*/BLANK*/VSYNC 
D5*L0AD*REV*/BLANK*/VSYNC 

/Q4VQ8 

/Q4VQ9 

/Q4*/Q10 
BLANK 
VSYNC 



/Q6 




: = 


/D6*L0AD*/REV*/BLANK*/VSYNC 






+ 


D6*L0AD*REV*/BLANK*/VSYNC 






+ 


/Q5VQ8 






+ 


/Q5VQ9 






+ 


/Q5VQ10 






+ 


BLANK 






+ 


VSYNC 


/Q7 




: = 


/D7*L0AD*/REV*/BLANK*/VSYNC 






+ 


D7*L0AD*REV*/BLANK*/VSYNC 






+ 


/Q6VQ8 






+ 


/Q6VQ9 






+ 


/Q6*/Q10 






+ 


BLANK 






+ 


VSYNC 


/VIDEO 






= /Q7 


/VIDEO 


.CMBF 


= GND 


SIMULATION 




TRACE ON 




CLK SO SI S2 








VIDEO /VSYNC BLANK 
REV Ql Q2 Q3 Q4 Q5 
Q6 Q7 01 D2 D3 D4 D5 
D6 D7 RST 


SETF 






/CLK RST REV 
/Dl /D2 D3 D4 
D5 /D6 /D7 
BLANK /VSYNC 


SETF 






/RST 


FOR J 


= 


TO 


5 DO 


BEGIN 








IF J = 


1 


THEN 




BEGIN SETF VSYNC /BLANK 


END 









LOAD DATA INPUT D2. 
LOAD DATA INPUT /D2 . 
SHIFT DATA FROM Ql 
INTO Q2, FOR SIX 
BINARY COUNT PERIODS. 
BLANK DURING LINE 
AND FRAME PERIODS. 

LOAD DATA INPUT D3 . 
LOAD DATA INPUT /D3 . 
SHIFT DATA FROM Q2 
INTO Q3, FOR SIX 
BINARY COUNT PERIODS. 
BLANK DURING LINE 
AND FRAME PERIODS. 

LOAD DATA INPUT D4 . 
LOAD DATA INPUT /D4 . 
SHIFT DATA FROM Q3 
INTO Q4, FOR SIX 
BINARY COUNT PERIODS. 
BLANK DURING LINE 
AND FRAME PERIODS. 

LOAD DATA INPUT D5 . 
LOAD DATA INPUT /D5 
SHIFT DATA FROM Q4 
INTO Q5, FOR SIX 
BINARY COUNT PERIODS. 
BLANK DURING LINE 
AND FRAME PERIODS. 

LOAD DATA INPUT D6 . 
LOAD DATA INPUT /D6 
SHIFT DATA FROM Q5 
INTO Q6, FOR SIX 
BINARY COUNT PERIODS. 
BLANK DURING LINE 
AND FRAME PERIODS. 

LOAD DATA INPUT D7 
LOAD DATA INPUT /D7 
SHIFT DATA FROM Q6 
INTO Q7, FOR SIX 
BINARY COUNT PERIODS. 
BLANK DURING LINE 
AND FRAME PERIODS. 
ENABLE REGISTER 
OUTPUT Q7 TO VIDEO 
PIN. 



START OF SIMULATION 
TRACE ALL ESSENTIAL 
INPUTS AND OUTPUTS. 



SET INITIAL CONDITIONS. 
APPLY ASYNCHRONOUS 
RESET. SET REVERSE 
VIDEO INACTIVE AND 
CREATE ACTIVE BLANK 
AND REMOVE RESET. 
INPUTS GENERATE FIVE 
LOOPS . 

SET VSYNC TO TEST 
BLANKING. 



Figure 11. Design File of Video Shift Register (Cont'd.) 



.Giliill. 



MonoUihlo UUHU Memories 



2-273 



Small System Video Controller 



IF J = 2 THEN 

BEGIN SETF /REV /VSYNC BLANK 

END 

IF J = 3 THEN 

BEGIN SETF Dl /D2 D3 /D4 D5 

/D6 D7 /BLANK /REV 
END 

IF J = 4 THEN 
BEGIN SETF REV 
END 

FOR I := 1 TO 8 DO 
BEGIN 

CLOCKF CLK 
END 
END 
TRACE OFF 



TEST ACTIVE BLANK 
INPUT. REMOVE ACTIVE 
VSYNC. CLEAR ALL 
BLANKING CONTROLS. 
TEST THE LOADING 
AND SHIFTING OF 
DATA. FOR REVERSE 
VIDEO ACTIVE AND 
INACTIVE. 
GENERATE LOOPS 
TO LOAD AND SHIFT 
DATA. 



END OF SIMULATION 



Figure 11. Design File of Video Shift Register (Cont'd.) 



DOTCLK C 


^ 


D Vcc 


DOTCLK C 


<J 


D Vcc 


DOTCLK C 


>^ 


DVcc 


RST C 




3 RDY 


RST C 




3 R3 


RST C 




3 so 


SO C 




J HSYNC 


cEi n 




D R2 


D1 C 




DS1 


S1 C 




3 MA6 


NC C 




3 R1 


D2 C 




3 S2 


32 C 




:i MAS 


NC C 




D RO 


D3 C 




2 VIDEO 


VSYNC C 




3 MA4 


NC C 




J VSYNC 


D4 C 




3 NC 


NC C 




3 MA3 


NC n 




3 MA11 


DSC 




D NC 


NO C 




3 MA2 


NC C 




D MA10 


D6 C 




3 NC 


NC C 




J KM1 


NC C 




3 MA9 


D7C 




D NC 


NC n 




3 MAO 


WR C 




D MA8 


BLANK C 




3 NC 


WR C 




2 BLANK 


TST C 




3 MA7 


VSYNC C 




U NC 


GND C 




3 cs 


GND C 




3 NC 


GND C 




D REV 




PAL32VX10 






PAL32VX10 






PAL32VX10 




LINES 


5YNCGENEF 


ATOR 


FRAME 


SYNC GENE 


RATOR 


VIDEO SHIFT REGISTER 



Figure 12. Pinouts for Various Components of a Small System Graphics Controller 
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The PAL®32VX10 offers design engineers several features not 
previously found in programmable logic devices (PLDs). Many 
of these features are found in the output macroceli; which al- 
lows over thirty possible configurations. This application note 
will highlight some of these features, and show how they are 
used in a Dual Port Video Shift Register. 

The PAL32VX10 is a 24-pin PLD with twelve dedicated inputs. 
The ten I/O macrocells have dual feedback, providing twenty 
additional array inputs. Each macroceli has the architecture 
shown in Figure 1 . 

CLOCK INPUT 



:^ 



FUSE SYNCHRONOUS 
ARRAY RESET 



OUTPUT ENABLE 




ll> 



VARIED 
PRODUCT TERMS; 
8, 10, 12, 14 OR 16 



:^ 



:^ 



REGISTER 
BYPASS CONTROL 




BURIED REGISTER 
FEEDBACK 



FEEDBACK FROM INPUT PIN 



AP 

ASYNCHRONOUS 
PRESET 

Figure 1. PAL32VX10 Macroceli 

Macroceli Description 

A D-type flip-flop is clocked by a dedicated input which also 
feeds the programmable array. The flip-flop data input is fed by 
a varied number of product terms, from eight to sixteen per out- 
put. The sum of these products feeds an Exciusive-OR (XOR) 
gate controlled by an additional product term. Global reset and 
preset signals are also controlled by product terms. 
A fuse controls flip-flop output polarity. A product term allows 
the flip-flop to be bypassed, and another product term enables 
the output. Feedback signals come from the flip-flop and the 
I /O pin, providing dual feedback. 



The device is manufactured with Monolithic Memories' ad- 
vanced oxide-isolated process, providing high speed and low 
power consumption. The propagation delay is 30 ns maximum 
for the standard version and 25 ns maximum for the A-speed 
version. Power consumption is only 180 mA maximum. 

J-K Programmable Function 

The XOR gate in each of the ten macrocells provides the de- 
signer with the option to extend the functionality of the D-type 
register to perform J-K and subset functions, such as T-type 
toggle flip-flops for binary counter applications. This option can 
aid product term economy considerably. In many state ma- 
chine designs large numbers of product terms are needed as 
holding functions for a D-type flip-flop. The hold function is not 
inherent to the truth table of a D-type register, but in a J-K 
function it is, so a logic designer can invoke the hold condition 
without using up any of the product terms in the macroceli. 
It can be shown that there exist five logic equations for a J-K 
function using a D-type flip-flop, an XOR gate, and sum of 
product inputs; 

1.Q:=Q*J + Q*K 
2.Q:=Q*J+Q*K 

3. Q := :+: (Q*J + Q*K) 

4. Q := Q :+: (Q*J + Q*K) 

5. Q := Q :+: {Q*J+ Q*K) 

See Appendix 1 for the derivation of these equations. 

The key feature to note about these equations is that the first 
two can be realized from the sum of two product terms but ei- 
ther J or K must be inverted. A DeMorgan expansion of a com- 
plex J or K input might result in using too many product terms, 
and may not fit in the PAL32VX10. This limitation can be cir- 
cumvented by using equation 3; here, J and K inputs need no 
inversion but there is a requirement for the XOR operator, as 
shown in equations 3, 4 and 5. Alternatively, the non-inverted J 
and K functions may use more terms than the inverted ver- 
sions, pointing to the use of equations 1 , 2, or 5. The macroceli 
structure is capable of making use of any one of these five 
equations, allowing the designer to choose the one that makes 
optimum use of product terms. 

The fuse array was extended to 9,738 programmable fuses to 
accommodate ten additional feedback paths from the ten ma- 
crocells. These are the ten buried feedback paths (from the 
flip-flop) that make it possible to realize buried register state 
machine designs. Any macroceli whose output pin is pro- 
grammed as an input still has available its internal feedback 
path, and the registered cell's functionality need not be lost. 
The cell could be programmed as a buried register for use in a 
burled state machine. This feature extends the application of 
this device to input-intensive designs such as the Dual Port 
Video Shift Register. 
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RO- 

R1 ■ 

R2' 

R3- 

MDO- 

MD1 • 

MD2 ' 

MD3 ' 

MD4 ' 

MD5 ' 

MD6 ' 



DISPLAY 
MEMORY 
BUFFER 



SYSTEM DATA BUS 
D0D1D2D3D4D5D6 



DATA 

INTERFACE 

BUFFER 



^w^ 



CHARACTER 

GENERATOR 

PROM 



AO 
A1 
A2 
A3 
A4 
AS 
A6 
A7 
A8 
A9 
A10 



DOT 

CLOCK 



SELECT PORT 1 

INPUT FROM 
MPU SYSTEM 



DUAL PORT 

VIDEO 

SHIFT 

REGISTER 

DAI DB1 



DA2 
DA3 
DA4 
DAS 
DA6 
DA7 
SELA 



DB2 
DB3 
DB4 
DBS 
DB6 
DB7 
SO 
SI 
S2 



VSYNC 
BLANK 

VIDEO 
PAL32VX10 



FROM 

■ HORIZONTAL 

SYNC 

GENERATOR 



•RAW 



AO 

A1 

A2 

A3 

A4 

AS 

A6 

A7 

A8 

AG 

A10 

CHARACTER 

GENERATOR 

RAM 



-SERIAL 
VIDEO 
OUTPUT 



FROM VERTICAL 
SYNC GENERATOR 



AO 
•A1 
■A2 
■A3 
■A4 
■AS 

■ A6 

■ A7 
•A8 
-A9 
-AID 



^ 



SYSTEM 
ADDRESS 



Figure 2. Dual Port Video Shift Register in Video System 



Dual Port Video Sliift Register 

This design highlights the the advantages of using PAL devices 
for implementing custom designs. This design is shown in the PAL 
device Design Specification file on page 2-279. Figure 2 shows 
how the device would be configured to a Microprocessor 
Unit (MPU) system as a parallel-to-serial converter in video 
applications. 

The PAL32VX10 in the system is capable of supporting one of 
two parallel input paths from character storage elements. The 
selected port input is converted to a serial video data stream 
through the VIDEO output for display on a CRT screen. Events 
of loading and shifting are synchronized to a Dot Clock input 
which is applied to pin 1 of the device, and is the rate at which 
the character dots are displayed on the CRT. There are two in- 
put ports, DA1-DA7 and DB1-DB7. One of these inputs is select- 



ed by the SELA control. When SELA is High the DA1-DA7 input 
is loaded, and subsequently shifted to the VIDEO output. When 
SELA is Low the DB1-DB7 input is loaded and shifted. 

The DA1-DA7 port could be configured to a character PROM, 
the content of which is a fixed character font. A second character 
memory could be configured to the DB1-DB7 port, which could 
be a RAM circuit. While the standard character font of the PROM 
is being displayed on the CRT, the RAM memory could be pro- 
grammed by the host CPU with a specialized character font. The 
contents of the RAM could be changed at any time, so a wide 
variety of custom fonts could be loaded, while the system is dis- 
playing the regular standard font stored in the PROM. Switching 
of the SELA input is controlled by the host system to select the 
required character memory. 
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Each of the two input ports is seven bits wide and is designed 
to interface with character matrix sizes of 7 by 9. However, if 
the SELA input is controlled by an external register which de- 
codes the state outputs SO, S1 and S2, the two ports may be 
configured as one single port, fourteen bits wide. 
The inputs BLANK and VSYNC are used to force a video blank- 
ing output during line and frame flyback. The BLANK input 
overlaps the line sync signal in a practical application to create 
front and back porch blanking on the CRT screen. 
Of the ten registered macrocells in the PAL32VX10, seven are 
connected as a parallel load serial shift register to the VIDEO 
output. The three remaining registers are configured as a 
three-bit binary counter. The counter outputs SO, S1 and S2 
determine whether the device is in load or shift mode. When 
S0-S2 are all High the shift register loads character informa- 
tion; otherwise, shifting takes place for the seven remaining 
states of the binary counter. The output pins SO, S1 and S2 are 
connected to the buried registers 01 0, 09 and 08 respective- 
ly, and are assigned in the PAL device Design Specification by 
SO = 01 0, etc. Similarly, the VIDEO output pin is assigned reg- 
ister 07. Registers Q1 to 06 remain buried; the output pins 
normally associated with these register cells have been as- 
signed as the dedicated DB port inputs DB2-DB7. 

The three-bit binary counter has been designed using a T-type 
toggle register. Choosing Equation 4 and equating J = K = 1 for 
the Toggle function and J = K = for the Hold function, in gen- 
eral J = K = T. The new input for a Toggle function becomes T 
and the equation becomes: 

6.Q:=Q:+:(Q+-Q)*T 
but since 

7. (Q + 0)=1 

the equation becomes: 



The oscilloscope photograph in Figure 4 shows the SO, SI and 
S2 outputs, running from the top trace down, and the serial 
VIDEO data on the bottom trace. The Load event takes place 
when all state outputs are High. For the first section of the 
trace, port DB7-DB1 is loaded and subsequently shifted 
through the buried register file. The cursor on the right marks 
the end of one Load/Shift cycle. The bit pattern loaded into 
DB7-DB1 is HLLHHHL. In the second half of the the trace, port 
DA7-DA1 is selected. The bit pattern of LLHLHLH is clocked 
into the DA7-DA1 port and shifted through to the VJDEO out- 
put. 
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HLLHHHL LLHLHLH 

Figure 4. Video Shift Register Waveforms 



8. O := O :+: T. 

The basic function in a binary counter is that a flip-flop must re- 
main in a Hold state until the product term made up of all of the 
less significant flip-flops in the count sequence becomes a log- 
ic High. So, the equation for 08 in the design specification be- 
comes: 

9. 08:= 08:+: 09*010. 

When 09*010 = flip-flop 08 holds its contents, and when 
09*010 = 1 08 changes state. 

The PAL device Design Specification shows that eighteen in- 
puts and four outputs are required, but the buried register fea- 
ture has not wasted the macrocells whose associated pins 
have been used as inputs. The design has been programmed 
with two other PAL devices (not shown here), one as a line 
sync generator and one as a frame sync generator, to form the 
basis of a low-cost video system. 

A simulation section has been included with the design to veri- 
fy functional operation before the Design Specification is com- 
mitted to the fuse array of the PAL device. It is recommended 
that the A-speed part is used in this specific application to ac- 
commodate good resolution of the displayed video information. 
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Appendix 1 

Derivation of the J-K Function 

The complete truth table of a J-K function is shown in Table 1 . 
Inputs J and K are tabulated with the current state of the regis- 
ter (QN) and the required next state {QN+1). Equations 1 and 2 
from the text are derived by directly grouping ones (S and s) or 
zeros (R and r) in the truth table. The other equations for the J- 
K function are derived by tabulating active 'SET' and 'RESET' 
functions and passive 'set' and 'reset' functions. 





J 


K 


On 


°N+1 


FUNCTION 


1 














reset 


2 








1 


1 


set 


3 


1 








1 


SET 


4 


1 





1 


1 


set 


5 





1 








reset 


6 





1 


1 





RESET 


7 


1 


1 





1 


SET 


8 


1 


1 


1 





RESET 




Q = /Q*J + QVK 


EQ.1 


/Q = /QVJ + Q*K 


EQ.2 


Q= Q :+: (/Q*J + Q*K) 


EQ.3 


/Q = /Q :+: (/Q*J + Q*K) 


EQ.4 


Q = /Q :+: (/QVJ + QVK) 


EQ.5 



Tablel. Truth Table of J-K Function Using Change of State. 



Identifying Transition and Hold conditions as well as logic High 
and Low states enables the designer to draw a Karnaugh map 
for an XOR-gate register input. The XOR gate has the capabili- 
ty of a Hold function for the remain in 'set' and 'reset' states, 
and a Change-of-State function for active 'SET' and 'RESET'. 
To derive Equation 3, 

Q := Q :+: (Q*J + Q*K), 
the following equation is considered: 
Q := s :+: (R + S), 

or, "Q holds unless an active transition is true." The entries in 
the equation show Hold and Transition states. When both 'S' 
and 'R' are inactive Low the register Q is in a Hold condition by 
the 's' entry in the equation. If 's' and 'R' become High simulta- 
neously the result will be an active Reset after the following 
clock edge; for this reason minimization of the 's' equation can 
include any 'R' entries in the Karnaugh Map. Also, the states of 
'S' and 's' are mutually exclusive, so with 's' inactive and 'S' ac- 
tive the output will transition to a Set condition. 

Minimize all the Hold 'set' states but include the active 'RESET' 
states for the entry to the left of the XOR function.This gives Q 
as shown in group 2 of the Karnaugh map. Then identify the 
change-of-state entries shown by the capital letters 'S' and 'R'. 
Minimization achieved in groups 1 and 3 are shown to the right 
of the equation. 

With regard to the equation for Q a similar procedure applies: 
Q := r :+: (S + R) 

This time all the passive 'reset' conditions are minimized with 
active 'SET' states, and XORed with minimization of active 
'SET' and 'RESET' states (group 4). This leads to Equation 4. 

Equation 5 is derived by examining the equation for "Q will tog- 
gle unless a Hold is true", or 

Q:='Q:-(-:(s + r); 

the derivation is similar to those above. 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



DUAL PORT VIDEO SHIFT REGISTER. 

OIB 

01 

CHRIS JAY. 

MMI SANTA CLARA, CA. 

15TH MARCH 1987. 



THE PAL32VX10 HAS BEEN DESIGNED AS A VIDEO SHIFT REGISTER 
WITH A DUAL SEVEN BIT PORT INPUT. PARALLEL CHARACTER DATA 
MAY BE LOADED FROM A FIXED CHARACTER ROM VIA PORT INPUTS 
DAI - DA7, WHILE THE SELA INPUT IS HIGH. PORT INPUTS DB1-DB7 
MAY BE CONNECTED TO A CHARACTER RAM, THE CONTENTS OF WHICH 
CAN BE CHANGED BY THE HOST MICROPROCESSOR. THIS ENABLES THE 
VIDEO SYSTEM DISPLAY A STANDARD CHARACTER FONT WHILE DYNAMICALLY 
CHANGING CUSTOM CHARACTER FONTS IN THE CHARACTER RAM. WHEN A 
NEW CHARACTER DATA IS LOADED INTO THE RAM THE SELA INPUT CAN BE 
DRIVEN LOW TO SELECT THE DB1-DB7 PORT INPUT. THE ON CHIP 
STATE COUNTER SO, SI AND S2 FORMS A SYNCHRONOUS BINARY COUNT. 
STATES TO 6 REPRESENT SHIFT ACTIVITY IN THE SEVEN BIT VIDEO 
SHIFT REGISTER. WHEN THE COUNTER REACHES A COUNT OF SEVEN THE 
NEXT CHARACTER DATA IS LOADED INTO THE REGISTER VIA PORT DA OR DB. 
THERE ARE TWO BLANKING INPUTS /VSYNC AND BLANK. WHEN /VSYNC IS 
LOW THE CONTENTS OF THE VIDEO SHIFT REGISTER ARE LOADED WITH LOGIC 
ZEROS. THIS IS ALSO THE CASE WHEN BLANK IS ACTIVE HIGH. THE 
VSYNC AND BLANK INPUTS SHOULD BE USED DURING FRAME AND HORIZONTAL 
FLYBACK, RESPECTIVELY. 

CHIP VIDE0_2 PAL32VX10 

DUAL PORT VIDEO SHIFT REGISTER PAL 



PINS 


1 . 
CLK 


2 
DAI 


3 
DA 2 


4 
DA 3 


5 
DA4 


6 
DAS 


PINS 


7 
DA6. 


8 
DA7 


9 
BLANK 


10 
/VSYNC 


11 
SELA 


12 
GND 


PINS 


13 
DBl 


14 
DB2 


15 
DB3 


16 
DB4 


17 
DB5 


18 
DB6 


PINS 


19 
DB7 


20 
VIDEO 


21 
S2 


22 
SI 


23 
SO 


24 
WCC 



GLOBAL Ql Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 QIO 

STRING LOAD 'Q8*Q9*Q10' ;ENABLE LOADING OF CHARACTER DATA. 



EQUATIONS 

/QIO : = 

SO 

SO.CMBF 



/Q9 

SI 
Sl.CMBF 



+ : 



QIO 
QIO 
GND 

/Q9 

QIO 

Q9 
GND 



INTERNAL STATE COUNTER 
LEAST SIGNIFICANT BIT 
QIO. ENABLE REGISTER 
OUTPUT FOR PIN SO. 
INTERNAL STATE COUNTER 
QIO = HIGH = TOGGLE. 
ENABLE PIN SI 
REGISTERED OUTPUT. 



H 
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/Q8 : = 
: + ; 
S2 
S2.CMBF = 



/Ql 



/Q2 



/Q3 



/Q4 



/Q5 



/Q6 



/Q8 

Q10*Q9 
Q8 
GND 



INTERNAL STATE COUNTER 
Q10*Q9 = HIGH = TOGGLE, 
ENABLE PIN S2 AS A 
REGISTERED OUTPUT. 



/DA1*L0AD*SELA*/BLANK*/VSYNC 

/DBl*LOAD*/SELA*/BLANK*/VSyNC 

/Q8 

/Q9 

/QIO 

BLANK 

VSYNC 

/DA2*L0AD*SELA*/BLANK*/VSYNC 

/DB2*L0AD*/SELA*/BLANK*/VSyNC 

/Q1VQ8 

/Ql*/Q9 

/Ql*/Q10 

BLANK 

VSYNC 

/DA3*L0AD*SELA*/BLANK*/VSYNC 

/DB3*L0AD*/SELA*/BLANK*/VSYNC 

/Q2*/Q8 

/Q2*/Q9 

/Q'2*/Q10 

BLANK 

VSYNC 

/DA4*L0AD*SELA*/BLANK*/VSYNC 

/DB4 *LOAD*/SELA*/BLANK*/VSYNC 

/Q3*/Q8 

/Q3*/Q9 

/Q3*/Q10 

BLANK 

VSYNC 

/DA5*L0AD*SELA*/BLANK*/VSYNC 

/DB5*L0AD*/SELA*/BLANK*/VSYNC 

/Q4VQ8 

/Q4*/Q9 

/Q4*/Q10 

BLANK 

VSYNC 

/DA6*L0AD*SELA*/BLANK*/VSYNC 

/DB6*L0AD*/SELA*/BLANK*/VSYNC 

/Q5*/Q8 

/Q5*/Q9 

/Q5*/Q10 

BLANK 

VSYNC 



LOAD DATA INPUT DAI, 
LOAD DATA INPUT DBl , 
SHIFT HIGH INTO Ql . 
WHEN Q10,Q9 AND Q8 
ARE NOT ALL HIGH 
SIMULTANEOUSLY. 
LINE/FRAME BLANKING 

LOAD DATA INPUT DA2 , 
LOAD DATA INPUT DB2, 
SHIFT DATA FROM Ql 
INTO Q2. 

LINE AND FRAME 
BLANKING. 

LOAD DATA INPUT DAB, 
LOAD DATA INPUT DBS 
SHIFT DATA FROM Q2 
INTO Q3. 

LINE AND FRAME 
BLANKING. 

LOAD DATA INPUT DA4 , 
LOAD DATA INPUT DB4 
SHIFT DATA FROM Q3 
INTO Q4. 

LINE AND FRAME 
BLANKING. 

LOAD DATA INPUT DAB, 
LOAD DATA INPUT DB5 
SHIFT DATA FROM Q4 
INTO Q5. 

LINE AND FRAME 
BLANKING. 

LOAD DATA INPUT DA6 
LOAD DATA INPUT DB6 
SHIFT DATA FROM Q5 
INTO Q6. 

LINE AND FRAME 
BLANKING. 
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/Q7 






/DA7*L0AD*SELA*/BLANK*/VSYNC 


?LOAD DATA INPUT DA6 . 




+ 




/DB7*L0AD*/SELA*/BLANK*, 


/VSYNC 


;LOAD DATA INPUT DB6 . 




+ 




/Q6*/Q8 










■SHIFT DATA FROM Q6 




+ 




/Q6*/Q9 










;INTO Q7. 




+ 




/Q6*/Q10 














+ 




BLANK 










•LINE AND FRAME 




+ 




VSYNC 










•BLANKING. 


/VIDEO 




= /Q7 












/VIDEO. CMBF 




= GND 










•OUTPUT Q7 TO VIDEO. 


SIMULATION 














•START OF SIMULATION 


TRACE_ 


ON 




CLK SO SI S2 








•TRACE ALL ESSENTIAL 








VIDEO /VSYNC 


: BLANK 


SELA 












Ql Q2 Q3 


Q4 


Q5 Q6 






•INPUTS AND OUTPUTS. 








DAI DA 2 DA3 


DA4 DAS DA6 


DA7 










DBl DB2 DB3 


DB4 DBS DB6 


DB7 




PRLDF 






Ql Q2 Q3 


Q4 


QS Q6 


Q7 Q8 Q9 QIO 


PRELOAD REGISTERS 


SETF 






/CLK 










SET INITIAL CONDITIONS. 








DAI /DA 2 


DA 3 


/DA4 


DAS /DA6 DA7 


ON DATA A INPUT PORT 








/DBl DB2 


/DB3 DB4 


/DBS DB6 /DB7 


AND DATA B INPUT PORT. 








SELA BLANK /VSYNC 






ENABLE A PORT AND 


FOR I 


:= TO 


7 


.DO 










BLANK INPUT. 


BEGIN 


CLOCKF CLK 










BEGIN THE DOT CLOCK 


END 
















INPUT FOR EIGHT CLOCK 


SETF 


/SELA 














CYCLES. SELECT B INPUT 


FOR I 


:= TO 


7 


DO 










ENABLE CLOCK FOR EIGHT 


BEGIN 


CLOCKF CLK 










CLOCK CYCLES. 


END 


















SETF 


/BLANK 


SELA 










REMOVE BLANKING INPUT 


FOR I 


:= TO 


7 


DO 










SELECT A PORT CLOCK 


BEGIN 


CLOCKF CLK 










FOR EIGHT CYCLES. 


END 


















SETF 


/SELA 














ENABLE B INPUT PORT 


FOR I 


:= TO 


7 


DO 










CLOCK FOR EIGHT CLOCK 


BEGIN 


CLOCKF CLK 










CYCLES. 


END 


















SETF 


VSYNC 














ENABLE VSYNC INPUT 


FOR I 


:= TO 


7 


DO 










CLOCK FOR EIGHT 


BEGIN 


CLOCKF CLK 










CLOCK CYCLES. 


END 


















TRACE 


OFF 














END OF SIMULATION 



MonoliihiG 



m 



Memories 



2-281 




Notes 



2-282 



a 



M. 



Monollthlo innU Memories 



Digital Signal 
Processing 



The field of Digital Signal Processing (DSP) was originally limited 
to the digital filtering and processing of analog signals. Today, the 
field encompasses a wide range of numeric processing applica- 
tions, primarily due to the versatility of the Fast FourierTransform 
algorithm with its accompanying repetitive addition and multipli- 
cation operations. 

The ability to perform multiplication and addition in a single clock 
cycle is the key to achieving high-speed mathematics in DSP 
hardware. When executing math-intensive algorithms, even the 
slowest DSP components are at least ten times faster than con- 
ventional microprocessors. When a DSP solution is used, it is 
usually for speed considerations. Conventional microprocessors 
are not optimized for arithmetic, and have to simulate math 
through software. This simulation tends to require several clock 
cycles. DSP systems use specialized hardware, including a 
multiplier and adder, that allow them to execute numeric opera- 
tions in a single cycle. A typical DSP system is shown in 
Figure 1. 
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. 





Figure 1. High-Performance DSP System 

In addition, the architecture must try to provide data to the 
multiplierand adder at every clock cycle, forthe highest efficiency. 
This can be done by using an architecture where data and 
instructions are stored separately and have their own buses, one 
for data and one for instructions. If two data buses are used, two 
data elements can be moved to the multiplier or adder in parallel 
for execution, while the CPU is fetching an instruction. Another 
efficient architecture uses pipelining, allowing the processor to 
work on subsections of the operation in parallel. 



Application Areas 

The algorithms implemented in DSP components are being 
applied to a wide variety of systems. Examples include standard 
signal processing for applications from compact disc players to 
artificial intelligence, matrix-manipulation algorithms for graphics 
systems, filtering algorithms used in image-processing systems, 
and floating-point algorithms used in workstations and scientific 
computers. Many of these applications are often categorized 
under "numerical processing" or "number crunching" rather than 
DSP, but any algorithm that makes extensive use of repetitive 
multiplication and addition operations can benefit from DSP 
hardware. The primary application area is communications, with 
military applications also a major part of the market. The 
application areas are listed below. 

• Communications 
-Codecs 
-Modems 

-Multiplexers/Demultiplexers 
-Echo Cancellation 
-Filtering 

-Gain equalization 

• Military 
-Radar/Sonar 
-Guidance and Control 
-Telemetry 

-Electronic Warfare/Countermeasures 

• Computers 

-PC-Board Add-Ons for PCs 
-Minicomputers/Workstations 
-Parallel/Array Processors 

• Graphics and Image Processing 
-CAD/CAE 

-Image Enhancement and Restoration 

-Image Subtraction, Compression, Decompression 

-CAT Scanning 

• Industrial Automation 
-CAM 

-Vision Systems/Robotic Vision and Other Senses 
-Artificial Intelligence 

• Voice Data Entry/Synthesis 

• Test Equipment 

• Process Control 

• Consumer Products 
-Television 
-Compact Disc Players 
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Figure 2. Typical General-Purpose DSP Processor System 



Speed versus Other Requirements 

As mentioned, most DSP designs are chosen because of speed. 
However, speed is not always a critical element once a DSP 
solution is selected: the slowest DSP design will still be f asterthan 
the fastest conventional microprocessor design. 

Speed remains a critical element in most communications appli- 
cations. An exception is modems, which require single-chip 
solutions for size reduction and do not require high speed to 
handle voice frequencies. Speed is also critical in computer 
applications, but price and performance advantages are also 
important. DSP add-in boards require small size, and therefore 
single-chip solutions. In military applications, however, price is 
less critical, and designs are typically customized for 
performance. 

Consumer applications use custom single-chip solutions for both 
size reduction and cost reduction; these are usually produced by 
a captive supplier. Many of the other applications require a 
flexible solution and a short development time, both efficiently 
provided by programmable logic devices. 



Single Chip versus Building Block 

Similar to the beginnings of the conventional microprocessor, 
dedicated DSP processors are being introduced along with 
development software. Single-chip DSP solutions offer easier 
software development, smaller size, and lower cost. Several 
DSP processors are commercially available, and are taking the 
place of conventional microprocessors in DSP systems (Figure 
2). Most are general-purpose processors, but application-spe- 
cific processors are also being developed. 

In addition to these dedicated DSP processors, custom proces- 
sors can be built with an ASIC methodology. Custom processors 
are especially efficient for high-volume applications. 

The alternative to a single-chip DSP processor is a building-block 
solution using multiple components. These components would 
typically involve bit-slice, multiplier/accumulators, ALUs, and 
program memory, as performed in the application note on page 
2-307. These solutions provide higher speed and are efficient in 
speed-critical or low-volume applications. An example is medical 
image processing, where only a few dedicated image filtering 
routines are performed repeatedly on the images — ^the building 
blocks can be designed to perform quickly a small set of functions. 
Many applications use a mix of single-chip solutions with accom- 
panying building blocks for added functionality (see Figure 3). 



HOST COMPUTER 



ALGORITHM #1 

Am29540 FFT 

ADDRESS SEQUENCER 



ALGORITHM #3 



ADDRESS 

PIPELINE 

Am 29520/21 



ADDRESS 
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Am29825 



m 



IMAGINARY DATA 



REAL 

PROCESSOR 

Am29501 



SINE/COSINE 

COEFFICIENTS 

Am 29526/7 



IMAGINARY 

PROCESSOR 

Am 29501 



MULTIPLIERS 
Am29516A/17A 



MICROPROGRAMMED 
CONTROL UNIT 



Utf 



Figure 3. Am29500 Processor Family Architecture. 
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PLD Applications 

PLDs are useful both in single-chip and building block designs. If 
DSP components are used as accelerators in conjunction with 
standard microprocessors, the interface logic can be placed 
into PLDs. 

When supporting DSP processors, PLDs perform functions 
similar to those supporting a general purpose microprocessor. 
These include address decoding, interrupt control, DMA control, 
and bus control. See the section on Microprocessor-Based 
Systems for more detail. 

Programmable logic devices are also useful in the building-block 
applications. PLDs often perform functions similar to their use in 
standard bit-slice designs, as counters or sequencers. 

The need for a continuous supply of data to the numeric compo- 
nents of the DSP system provides opportunities for PLDs in both 
building-block and single-chip designs. Handshaking between 
components, especially between the numeric chips and the 



memory or sequencer, is also required, and with multiple buses 
in the system, complex bus interface logic is needed. 

The application note on page 2-307shows PALdevices providing 
several functions for a building-block DSP system which imple- 
ments an audio spectrum analyzer. The PALdevices create the 
sequencerforprogram execution, multiplex the dataoutputs to an 
oscilloscope, and provide several miscellaneous control func- 
tions. 

PLDs provide an effective solution in any type of DSP architec- 
ture. General-purpose DSP processors need PLDs to perform 
typical processor support functions, while ASIC processors may 
need additional logic to provide the proper interface to other 
chips. In building block systems, PLDs perform both the basic 
control functions and interface functions. With limited availability 
of software development tools for most DSP applications, pro- 
grammable logic devices offer the flexibility to implement last- 
minute design changes to adapt to the needs of the software and 
application. 
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Waveform Generator 



A simple waveform generator can be built with a PAL device ad- 
dressing locations within a PROM. Using a PAL20X1 device to 
address locations in a registered PROM (Monolithic Memories 
63RA1 681 ), digital data may be accessed and fed to a Digital to 
Analog Converter (DAC). The locations in the memory are pro- 
grammed with appropriate data to provide a sinusoidal, triangu- 
lar, sawtooth or step function output. A square wave output may 
be generated directly from the PAL device itself. 

Digital vs. Analog Waveform 
Generation 

There are advantages and disadvantages in generating analog 
functions by this method when considered against a dedicated 
analog circuit design. The main advantages favoring the digital 
method of waveform generation involve controllability. In this 
design example phase shift generation can be controlled within a 
tolerance of one degree. The PALdevice has been programmed 
as a 180 degree up/down counter. An initial count and up/down 
control may be loaded intothe counter so precise phase shiftscan 
be generated. When considering the phase shift of a sinusoidal 
waveform in an analog circuit, complex Resistor-Capacitor (RC) 
networks might need to be added to generate phase shifting. 

Another advantage is the ability of the system to generate a wide 
range of waveform types, and the convenience of generating new 
waveforms by simply re-programming a PROM, or programming 
blank locations in the existing PROM. Also, if it were a require- 
ment to generate a burst of any number of cycles, this could be 
controlled by a digital interface from a microprocessor. 

Thedisadvantages associated with the digital circuit would be the 
generation of quantization noise due to the encoding of a digital 
code into a close approximation of the analog conversion. For 
example, an eight-bit digital code can be converted to 256 differ- 
ent voltage levels. Supposing a 1 volt signal were to be evenly 
encoded into 256 discrete levels, the difference between each 
level would be about 39 mV. The approximation of the digital 
signal to its analog equivalent gives rise to quantization noise, in 
this example 39 mV. 



A way of reducing the quantization noise is to generate a larger 
digital code. For a 1 0-volt dynamic range encoded by a fourteen- 
bit digital code, the quantization noise is reduced to 0.61 mV. Of 
course there are problems in increasing the resolution of the 
digital code in the respect that a much higher sampling frequency 
would be required to generate the analog output of a given period. 

Additional to the problem of quantization noise is the noise 
generated by the super-imposition of elements of the master 
sampling clock on the analog wave output. Fortunately the 
frequency of this clock is much higher than the analog output, for 
a resolution of eight bits, and may be filtered out with a simple RC 
filter. 

In summary, the advantages in a digital waveform generator are 
in the controllability of phase shifting, and the number of cycles 
generated. Also, the ability to guard against drift might be an 
advantage. With an analog system, purity of signal output and the 
ability to generate higher frequencies are the main advantages. 

PAL Device Implementation 

The diagram in figure 1 shows the circuit used to create the 
waveforms. Photographs of the waveforms are provided in figure 
2. The different functions generated were selectable by a unique 
digital code applied to the higher order address inputs A8, A9, and 
A1 of the PROM. The lower order addresses were driven from 
the PAL20X10; outputs Q0-Q7 were fed to address inputs 
A0-A7. The design specification of the 180 Degree Up/Down 
counter (page 2-288) was encoded into the PALdevice. 

Look-Up Table Generation 

To generate the hexadecimal look-up table, which was pro- 
grammed into the PROM, a small TURBO PASCALprogram was 
written to provide a conversion from the angle (degrees) to the 
sine of the angle, in a hexadecimal value (page 2-290). The 
program generated the nearest hexadecimal value to 100 
(sine(x) +1), for a range of 180 degrees, at a resolution of one 
degree. The sawtooth and triangular waveforms were simple 
identity functions; the PROM location address was programmed 
into its own contents. The step waveform was programmed as 
constant values in a number of successive adjacent locations. 
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FREQUENCY 
ADJUST 



LOAD 
PHASE 
INPUT 



VOLTAGE 
CONTROLLED 
OSCILLATOR 



CLOCK 

LD 

DO 

D1 

D2 



D3 PAL20X10 Q4 



D4 
D5 
D6 
D7 
- DOWN 



SELECT FUNCTION 
WITH DIGITAL TTLCODE 



A8 A9 A10 



CLK 
AO 
A1 
A2 



A3 63RA1681 04 



T 



DIGITAL GROUND 



5 VOLT 

Figure 1. 



AD588 

VOUT 

SENSE 

10VOLT 

SELECT 



ANALOG OUTPUT 



ANALOG GROUND 
DIGITAL GROUND 
+VCC = 12V 



DIGITAL GROUND 





Triangular Wavoform 



Sinowavo 





Step Waveform 



Sawtooth Waveform 



Figure 2. 



^ 



.liiliill. 



Monolithio UUmJ Memories 



2-287 



Waveform Generator 



TITLE 180DEGREE COUNT 
PATTERN 01 
REVISION 01 
AUTHOR CHRIS JAY 
COMPANY MMI SANTA CLARA 
DATE 17 OCTOBER 1986 

CHIP DEGREE PAL20X10 



rPIN 



1 


2 


3 


4 


5 


6 


CLK 


/LD 


DO 


Dl 


D2 


D3 


7 


8 


9 


10 


11 


12 


D4 


D5 


D6 


D7 


DOWN 


GND 


13 


14 


15 


16 


17 


18 


/OE 


QO 


Ql 


Q2 


Q3 


Q4 


19 


20 


21 


22 


23 


24 


Q5 


Q6 


Q7 


CUP 


CDN 


VCC 



fTHE PAL20X10 HAS BEEN DESIGNED AS A 180 DEGREE 
; LOADABLE UP/ DOWN COUNTER. THE COUNTER OUTPUTS 
? DRIVE THE ADDRESS INPUTS OF A PROM WHICH IS 
; PROGRAMMED WITH A 'LOOK UP' SINE TABLE. A COUNT 
;UP FROM ZERO TO 180 WILL GENERATE ONE HALF 
; CYCLE FROM THE PROM CONTENTS. THE COUNTER WILL 
;AUTOMATICALLLY SWITCH TO A DOWN COUNT AND GENERATE 
;THE SECOND HALF OF ONE COMPLETE CYCLE. IF THE 
;PROM OUTPUTS ARE REGISTERED AND FED TO A D/A 
.•CONVERTER A CONTINOUS SINUSOIDAL SIGNAL WILL BE 
.•GENERATED. COUNTING CAN BE INTERRUPTED WITH A 
.•LOAD. PHASE SHIFTING OF THE SINUSIODAL WAVEFORM 
;MAY BE ACHIEVED WITH A RESOLUTION OF 1 DEGREE. 



EQUATION 




/QO 


+ 


QOVLD 
/DO*LD 


/Ql 


+ 
: + 

+ 


/Q1*/LD 
/D1*LD 
: QO*/I'D*CUP 
/QO*/LD*/CUP 


/Q2 


+ 
: + 

+ 


/Q2*/LD 
/D2*LD 

: Q0*Q1*/LD*CUP 
/Q0*/Q1*/LD*/CUP 



.•COUNT 
.•LOAD 

;HOLD 
.•LOAD 
;UP COUNT 
.•DOWN COUNT 

;HOLD 
;LOAD 
;UP COUNT 
;DOWN COUNT 



/Q3 :- /Q3*/LD 

+ /D3*LD 

:+: Q0*Q1*Q2*/LD*CUP 
+ /Q0*/Q1*/Q2*/LD*/CUP 

/Q4 :- /Q4*/LD 

+ /D4*LD 

!+: Q0*Q1*Q2*Q3*/LD*CUP 
+ /Q0*/Q1*/Q2*/Q3*/LD*/CUP 



;HOLD 
;LOAD 
.•UP COUNT 
.•DOWN COUNT 

;HOLD 
.•LOAD 
;UP COUNT 
.•DOWN COUNT 

; 
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/Q5 := /Q5*/LD 

+ /D5*LD 

:+: Q0*Q1*Q2*Q3*Q4*/LD*CUP 
+ /Q0*/Q1*/Q2*/Q3*/Q4*/LD*/CUP 

/Q6 := /Q6VLD 

+ /D6*LD 

:+: Q0*Q1*Q2*Q3*Q4*Q5*/LD*CUP 
+ /Q0VQ1*/Q2*/Q3*/Q4*/Q5*/LD*/CUP 

/Q7 := /Q7*/LD 

+ /D7*LD 

: + : Q0*Q1*Q2*Q3*Q4*Q5*Q6VLD*CUP 
+ /Q0*/Q1*/Q2VQ3VQ4*/Q5*/Q6*/LD*/CUP 

/CUP := /CUPVLD 

+ DOWN*LD 

: + : /Q7*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1*Q0*/I'D*/CUP 
+ Q7*/Q6*Q5*Q4*/Q3*/Q2*Q1*Q0*/I'D*CUP 

/CDN := CUP*/LD 
+ /DOWN*LD 

: + : /Q7 */Q6 */Q5*/Q4 */Q3 */Q2 */Ql*QO */LD*/CUP 
+ Q7*/Q6*Q5*Q4*/Q3*/Q2*Q1*Q0*/LD*CUP 

SIMULATION 

TRACE_ON CLK /DOWN LD CUP CDN 

QO Ql Q2 Q3 Q4 Q5 Q6 Q7 
SETF /CLK LD OE DO Dl D2 

/D3 /D4 /D5 /D6 /D7 DOWN 
CLOCKF 
SETF 
CLOCKF 
FOR I 

BEGIN CLOCKF 
END 



/LD 



TO 15 DO 



SETF 



/LD 



CLOCKF 

SETF 

CLOCKF 

FOR I 

BEGIN CLOCKF 

END 

TRACE OFF 



/CLK LD /DO /Dl /D2 
/D3 D4 D5 /D6 D7 /DOWN 



TO 15 DO 



HOLD 

LOAD 

UP COUNT 

DOWN COUNT 

HOLD 

LOAD 

UP COUNT 

DOWN COUNT 

HOLD 

LOAD 

UP COUNT 

DOWN COUNT 

COUNT UP 
REGISTER 



COUNT DOWN 
REGISTER 



LOAD 03H 
AND DOWN 
COUNT. SET 
INTO COUNT 
MODE AND 
GENERATE 
VECTORS TO 
TEST DOWN 
TO UP TRANS- 
-ITION. LOAD 
BOH AND UP 
COUNT. 
GENERATE 
VECTORS TO 
TEST UP TO 
DOWN TRANS - 
-ITION. 
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program sinetable; 

I *********************************** A****************************** J 

(* *) 

(* The program 'sinetable' has been designed to generate *) 

(* the closest possible HEXIDECIMAL value to 100 X [sine(x) +1 *) 

(* for -90 <= X => +90, in steps of delta x = 1 degree. The *) 

(* HEXIDECIMAL PROM address locations have been evaluated as *) 

(* as n = X + 90. The program will generate a SINE. DAT file *) 

(* which will contain a table of PROM address locations from *) 

(* from 00 to B4 HEX, and the contents of each location as the *) 
(* closest possible sine value as computed in the above equation. 
(* Program written by: 
(* Chris Jay and Ser-Hou Kuang. MMI Santa Clara. 



') 
') 

■) 

') 

f ******************************************* ***********************) 

10; 



const 

hexsize 
type 

hex_type = packed array [1.. hexsize] of char; 



var 

SinFile: text; 

i,k:integer ; 

s,x, f : real ; 

hexar,hexcon : hex_type; 

(******************************************************************) 
(* *) 

(* Procedure to convert decimal values into hexadecimal values *) 
(* *) 

/******************************************************************) 

Procedure DecToHex (dec : integer; var tohex: hex_type); 
var 

i, dig, idx, next : integer; 

Function hex (i : integer): char; 
begin 



case 


i of 






: 


hex 


= 


•0' 


1 : 


hex 


= 


'1' 


2 : 


hex 


= 


'2' 


3 : 


hex 


= 


'3' 


4 : 


hex 


= 


'4' 


5 : 


hex 


= 


'5' 


6 : 


hex 


= 


•6' 


7 : 


hex 


= 


'7' 


8 : 


hex 


= 


'8' 


9 : 


hex 


= 


'9* 


10 : 


hex 


= 


'A' 


11 : 


hex 


= 


'B' 


12 : 


hex 


= 


'C 


13 : 


hex 


= 


'D' 


14 : 


hex 


= 


•E' 


15 : 


hex 


= 


,pi 


end; 








end; 
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begin 

for i := 1 to hexsize do 

tohex [ i ] := ' ' ; 
idx := hexsize; 
dig := dec; 
next := dig div 16; 
while (next <> 0) and (idx > 0) do 
begin 

ToHex (idx] := hex (dig - (next * 16)); 
idx := idx -1; 
dig := next; 
next := dig div 16; 
end; 
ToHex [idx] := hex (dig); 
end; 



begin 

Assign(SinFile, 'B:SINE.DAT' ) ; 
Rewrite(SinFile) ; 
X := 0; 
f := 0; 
i := -90; 
s := pi/180; 
while i < 91 do 
begin 

k := i + 90; 
X := (i*s); 
f := 1 + sin(x) ; 
f := 100*f; 

DecToHex (round (f), hexar); 
DecToHex (k,hexcon); 
if i = -90 then 
begin 

writeln ('DEGREES LOG 100*SIN X + 1 HEXLOC HEX'); 
end; 

write (i:3, ' ' ,k:3, ' ' ); 
writeln (f:10:5,' ',hexcon,' ' , hexar); 
if i = -90 then 
begin 

writeln(SinFile, 'DEGREES 100*SIN(X)+1 HEXLOC 100*SIN( X)+l [ HEX ] ' ) 
end; 

writeln(SinFile,k:5, ' ',f:10:5, ' ',hexcon,' ', hexar); 
i := i + 1; 
end; 
Close(SinFile) ; 

end . 
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AN-160B 



The PAL®32VX10 offers design engineers several features not 
previously found in programmable logic devices (PLDs). Many 
of these features are found in the output macrocell, which al- 
lows over thirty possible configurations. This application note 
will highlight some of these features, and show how they are 
used in Dual Binary Rate Multipliers. 

The PAL32VX10 is a 24-pin PLD with twelve dedicated inputs. 
The ten I/O macrocells have dual feedback, providing twenty 
additional array inputs. Each macrocell has the architecture 
shown in Figure 1 . 



A fuse controls flip-flop output polarity. A product term allows 
the flip-flop to be bypassed, and another product term enables 
the output. Feedback signals come from the flip-flop and the 
I/O pin, providing dual feedback. 

The device is manufactured with Monolithic Memories' ad- 
vanced oxide-isolated process, providing high speed and low 
power consumption. The propagation delay is 30 ns maximum 
for the standard version and 25 ns maximum for the A-speed 
version. Power consumption is only 180 mA maximum. 



CLOCK INPUT 




AP 

ASYNCHRONOUS 

PRESET 



Figure 1. PAL32VX10 Macrocell 

Macrocell Description 

A D-type flip-flop is clocked by a dedicated input which also 
feeds the programmable array. The flip-flop data input is fed by 
a varied number of product terms, from eight to sixteen per out- 
put. The sum of these products feeds an Exclusive-OR (XOR) 
gate controlled by an additional product term. Global reset and 
preset signals are also controlled by product terms. 



Dual Binary Rate Multipliers 

The availability of the clock input in the fuse array avoids the 
addition of external logic in applications where it is necessary 
to use the system clock as a logic input. In many synchronous 
PLDs the system clock input is connected only to the register 
file. The rising edge of the clock input synchronizes data flow 
through the D-type registers, but the clock is not available as a 
Boolean signal in the fuse array. 

The application of the Dual Binary Rate Multipliers uses the 
buried register capability, the XOR function to build counters 
with a T-type register, and the presence of the clock input in 
the fuse array. The circuit requires sixteen data inputs. There 
are two data byte inputs DA0-DA7 and DB0-DB7. The two serial 
outputs are QA associated with the DA port and QB with the DB 
port. The clock input is divided by sixteen and is available at 
the QA or QB outputs multiplied by the binary data inputs DAO- 
DA3 or DA4-DA7, and DB0-DB3 or DB4-DB7. 

The output SELOP (Select Output) selects one half of each A 
or B byte as the multiplying vector. When SELOP is High the 
lower half of each input byte is selected to multiply the clock in- 
put. When Low, the upper half of each input is used to multiply 
the clock input. 

For example, if data inputs DA4-DA7 and DB4-DB7 were binary 
encoded as six and three, respectively, then six and three 
clock pulses would be gated through to the outputs QA and QB 
respectively while SELOP is Low. When SELOP goes High the 
data inputs DA0-DA3 and DB0-DB3 are selected as the multipli- 
ers. If DA0-DA3 contained the binary encoded input of eight, 
then eight clock pulses would be gated to QA. At the same 
time, if DB0-DB3 contained fifteen, fifteen clock pulses would 
appear at the QB output. These waveforms are shown in the 
oscilloscope photograph in Figure 2. 
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PAL32VX10 



Figure 2. Dual Binary Rate Multipliers Waveforms 



The two additional outputs, LSB and MSB, are byte-select out- 
puts. This internal state machine has four unique states, per- 
forming a binary count zero to three. Just as the SELOP output 
can be used to select the multiplying half of each byte, LSB 
and MSB may be decoded externally to select a one-of-four 
byte data input to be applied to parallel inputs A or B. A pro- 
grammable controlled pulse train from four byte inputs can then 
be available at the QA and QB outputs. 




Dual Binary Rate Multipliers 
Figure 3. Pin Layout 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



DUAL BINARY RATE MULTIPLIERS 

05 

01 

CHRIS JAY 

MMI SANTA CLARA, CA 

AUGUST 6, 1986 



CHIP BINRATE PAL32VX10 

THE PAL32VX10 HAS BEEN PROGRAMMED TO PERFORM BINARY 
RATE MULTIPLICATION. THE OUTPUTS AT 
QA AND QB ARE ONE-SIXTEENTH OF THE CLOCK INPUT 
MULTIPLIED BY THE BINARY VALUE OF THE INPUTS 
DA0-DA3 AND DB0-DB3, RESPECTIVELY, WHEN SELOP 
IS HIGH; OR, ONE-SIXTEENTH OF THE CLOCK INPUT 
MULTIPLIED BY THE BINARY VALUE OP THE INPUTS DA4-DA7 
AND DB4-DB7, WHEN THE SELOP SIGNAL IS LOW. 



PINS 1 

CLK 



2 
DAO 



3 
DAI 



4 
DA2 



5 
DA 3 



6 

DA4 



;PINS 



7 
DAS 



8 
DA6 



9 
DA7 



10 
DBG 



11 
DBl 



12 
GND 



;PINS 



13 
DB2 



14 
DB3 



15 
DB4 



16 
DB5 



17 
DB6 



18 
QA 



;PINS 



19 
QB 



20 
DB7 



21 
SELOP 



22 

LSB 



23 
MSB 



24 
VCC 



GLOBAL QO Ql Q2 Q3 NC NC NC SEL LSBI MSBI ; BURIED NODES 



STRING Kl 'QO* Ql* Q2*/Q3' 
STRING K2 'QO* Ql*/Q2' 
STRING K3 •Q0*/Q1' 
STRING K4 'Q0*/Q1* Q2* Q3 ' 
STRING K5 'Q0*/Q1*/Q2*/Q3' 
STRING CKNL '/CLK* SEL' 
STRING CKNH •/CLK*/SEL' 

EQUATIONS 

/QO := QO 



STRING STATEMENTS 
TO DECODE USABLE 
STATES OF INTERNAL 
DIVIDE BY SIXTEEN 
BINARY COUNTER. 
CLOCK SELECTION 
STRING STATEMENTS 



; CLOCK DIVIDE BY TWO. 
; BURIED REGISTER. 



/Ql 



:= /Ql :+: QO 



; CLOCK DIVIDE BY FOUR. 
; BURIED REGISTER. 



/Q2 



;= /Q2 :+: Q0*Q1 



; CLOCK DIVIDE BY EIGHT. 
; BURIED REGISTER. 



/Q3 



= /Q3 :+: Q0*Q1*Q2 



; CLOCK DIVIDE BY SIXTEEN, 
; BURIED REGISTER. 
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/SEL 
SELOP 



:= /SEL :+: Q0*Q1*Q2*Q3 
:= SEL 



SELECT 0/P HIGH TO SELECT 
DATA 0-3, LOW TO 
SELECT DATA 4-7. 



QA 


= 


DAO 


* K1*CKNL 




+ 


DAS 


*/QO*CKNL 




+ 


DAI 


* K2*CKNL 




+ 


DA2* DAB 


* K2*CKNL 




+ 


DA2*/DA3 


* K3*CKNL 




+ 


DAI* DA 2 


* K3*CKNL 




+ 


/DAI* DA2* 


DA3* K4*CKNL 




+ 


/DAI* DA2* 


DA3* KS*CKNL 




+ 


DA4 


* K1*CKNH 




+ 


DA7 


*/QO*CKNH 




+ 


DAS 


* K2*CKNH 




+ 


DA6* DA7 


* K2*CKNH 




+ 


DA6*/DA7 


* K3*CKNH 




+ 


DAS* DA6 


* K3*CKNH 




+ 


/DAS* DA6* 


DA7* K4*CKNH 




+ 


/DAS* DA6* 


DA7* KS*CKNH 


QA.CMBF 


= 


VCC 




QB 


_ 


DBO 


* K1*CKNL 




+ 


DB3 


*/QO*CKNL 




+ 


DBl 


* K2*CKNL 




+ 


DB2* DB3 


* K2*CKNL 




+ 


DB2*/DB3 


* K3*CKNL 




+ 


DBl* DB2 


* K3*CKNL 




+ 


/DBl* DB2* 


DB3* K4*CKNL 




+ 


/DBl* DB2* 


DB3* K5*CKNL 




+ 


DB4 


* K1*CKNH 




+ 


DB7 


*/QO*CKNH 




+ 


DBS 


* K2*CKNH 




+ 


DB6* DB7 


* K2*CKNH 




+ 


DB6*/DB7 


* K3*CKNH 




+ 


DBS* DB6 


* K3*CKNH 




+ 


/DBS* DB6* 


DB7* K4*CKNH 




+ 


/DBS* DB6* 


DB7* KS*CKNH 


QB . CMBF 


= 


VCC 




/LSBI 


: = 


/LSBI :+: 


SEL*Q3*Q2*Q1* 


LSB 


: = 


LSBI 




/MSB I 


; = 


/MSB I 






: + 


: LSBI*SEL* 


23*Q2*Q1*Q0 


MSB 


: = 


MSBI 





COMBINATORIAL OUTPUT OF 
CLOCK MULTIPLIED BY THE 
DA0-DA3 INPUT WHEN SEL 
IS HIGH, AND DA4-DA7 
WHEN SEL IS LOW. 



ENABLE COMBINATORIAL 
OUTPUT FUNCTION. 

COMBINATORIAL OUTPUT OF 
CLOCK MULTIPLIED BY THE 
DB0-DB3 INPUT WHEN SEL 
IS HIGH, AND DB4-DB7 
WHEN SEL IS LOW. 



ENABLE COMBINATORIAL 
OUTPUT FUNCTION. 
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SIMULATION 
TRACE ON 



PRLDF 
SETF 



FOR J 
BEGIN 
IF J 



CLK QA QB QO Ql Q2 Q3 
SELOP LSB MSB DAO DAI 
DA2 DA3 DA4 DAS DA6 DA7 
DBO DBl DB2, DB3 DB4 DBS 
DB6 DB7 
/QO /Ql /Q2 /Q3 SEL /LSBI /MSBI 
/CLK /DAO /DAI /DA2 /DA3 
DBO /DBl /DB2 /DB3 
/DA4 DAS /DA6 /DA7 
DB4 DBS /DB6 /DB7 
:= TO 4 DO 



= 1 THEN 
BEGIN SETF DA2 DB2 DA6 DB6 
END 
IF J = 2 THEN 
BEGIN SETF /DA2 DA3 /DB2 DB3 
/DA6 DA7 /DB6 DB7 
END 
IF J = 3 THEN 



BEGIN SETF 

END 
FOR I := 1 

BEGIN 

SETF CLK 

SETF /CLK 

END 
END 
TRACE OFF 



DA2 DB2 DA6 DA6 



TO 32 DO 



SIMULATION SECTION. 
TRACE ESSENTIAL 
WAVEFORMS . 



PRELOAD INITIAL VALUES. 
SET INPUT DA0-DA3 = 0. 
SET INPUT DB0-DB3 = 1. 
SET INPUT DA4-DA7 = 2. 
SET INPUT DB4-DB7 = 3. 

GENERATE FOUR LOOPS. 
ON SECOND LOOP SET 
DA0-DA3 = 4, DB0-DB3 = S, 
DA4-DA7 = 6, DB4-DA7 = 7. 
ON THIRD LOOP SET 
DA0-DA3 = 8, DB0-DB3 = 9, 
DA4-DA7 = 10, DB4-DB7 = 11. 
ON FOURTH LOOP SET 
DA0-DA3 = 12, DB0-DB3 = 13, 
DA4-DA7 = 14, DB4-DB7 = 15. 



ON EACH LOOP APPLY 

32 CLOCK PULSES. 

OUTPUTS QA AND QB 

WILL GENERATE THE 

CLK DIVIDED BY 16 MULTIPLIED 

BY THE BINARY CODE AT THE 

SELECTED INPUT. 
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It is often necessary to interface analog inputs into a digital sys- 
tem. In digital instruments the signal being measured is usually 
an analog level. Transducers are used to provide a linear relation- 
ship between a measured input signal and a voltage output. An 
example of a transducer would be a strain gauge. In this case the 
amplitude of the voltage output would be a measure of mechani- 
cal stress. If the measuring device were microprocessor-based 
or minicomputer-based, the voltage level would have to be 
converted to a digital binary code before it could be read and proc- 
essed. An analog to digital converter would be required to accom- 
plish this task. 

Conversion Techniques 

There are a number of techniques used to provide analog to 
digital (A to D) conversion. The slowest methods of A to D 
conversion are based on counters, and would probably be used 
in instruments such as multimeters. An example would be the 
self-calibrating Dual Slope Integrator. The fastest AtoD convert- 
ers are based on look-up tables and might be used in video 
applications. In the case of a Flash Converter high conversion 
speed is obtained at the expense of a very large number of input 
comparator circuits. An eight-bitdigitaloutput code would require 
2'-1 comparator circuits at the input. The comparator outputs 
would drive a look-up table from which the closest binary weight- 
ing to the analog input would be referenced. Techniques to 
simplify the input circuitry would reduce the number of compara- 
tors required. A Half Flash converter would accomplish this but 
with some degradation in conversion speed. 



Successive Approximation 

A medium to high conversion speed can be realized with the 
Successive Approximation technique of conversion, and many 
circuits exist to perform this type of conversion. Figure 1 shows 
a diagram of an A to D converter that uses a PAL20RS10 as a 
Successive Approximation Register. The advantage of using the 
successive approximation technique of conversion is that the 
circuit is relatively simple when using a PALdevice, and the cycle 
of conversion is constant. Conversion using counters is no more 
simple, and does not give aconstant conversion time because the 
time required to produce the closest possible binary output is 
dependent on the amplitude of the input signal. 

The PAL20RS10 has been programmed as a state machine to 
perform the approximation cycle. The Design Specification title 
SAR8 (page 2-300) refers to the eight-bit Successive Approxima- 
tion Register. To provide more bits of resolution, a fourteen-bit 
SAR14 has been programmed into a PAL32R16 (page 2-303). 

Quantization Error 

Why would a system require additional bits of data in the SAR 
circuit? Suppose a two-digit code is used to represent a dynamic 
voltage range of V to 9 V. A linear mapping would be: 



BINARY CODE 


00 


01 


10 


11 


Input Voltage (Volts) 


0.0 


3.0 


6.0 


9.0 



EIGHT BIT DIGITAL OUTPUT 



SYSTEM CLOCK INPUT - 



SAR8 



GND 
^CC 

T 



CLK 
CMP 
POL 



PAL20RS10 



CC 

Qe 
Qs 
Q4 
Q3 
02 

Qi 
Qo 

EOC 



CLK 

OCTAL REGISTER 
LD 



- EOC (END OF COUNT) 



COMPARATOR 
CIRCUIT 



ANALOG INPUT 




Figure 1. Eight-Bit Successive Approximation Analog to Digitai Converter 
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For a two-digit code there exist 2^ codes which represent four 
discrete analog voltage levels. The difference between each 
voltage level is 3.0 V. In this simple example there are not enough 
binary bits to represent any voltages between each separate 
level. The difference between the actual value and the approxi- 
mation is the quantization error. The resolution can be increased 
and quantization error reduced by adding more bits to encode the 
data. With a three-digit code resolution improves: 



BINARY CODE 


VOLTAGE LEVELS (VOLTS) 


000 


0.0000 


001 


1.2857 


010 


2.5714 


011 


3.8571 


100 


5.1429 


101 


6.4286 


110 


7.7143 


111 


9.0000 



However, there remain voltage levels that cannot be accurately 
represented. In general, if we assume an input dynamic range 
(Vdr) is to be encoded into equal discrete levels (Vq) by a number 
(n) of bits of binary data, the relationship becomes: 

Vdr/(2"-1) = Vq. 

If a dynamic range of 1 V were represented by eight bits of digital 
data the input voltage level could be resolved to an accuracy of 
approximately 

10/(2=- 1) = approximately 40 mV. 

For some systems this might be accurate enough. To decrease 
quantization error even further the PAL32R16 has been pro- 
grammed to perform successive approximation with 14 bits of 
binary information. With the given figures the resolution is 

10/(2'*- 1) = approximately 0.6 mV. 

PAL20RS10 Implementation 

The design shown in Figure 1 is an eight-bit Ato D system. The 
PAL20RS10 has a clock input which synchronously clocks the 
state machine programmed into the PAL device. The Qo-Q^ 
outputs will contain the closest digital binary weighting to the 
analog input (to a resolution of 40 mV for a 10-volt range) after 
each approximation cycle. Q^ is the most significant bit. 

The CC signal is an active-HIGH Conversion Complete signal. 
CC can be converted to provide an additional bit of code to make 
a nine-bit SAR if increased system resolution were required. The 
EOC output is an End Of Count or Conversion signal. 

The input CMP is driven by the output of a very high tolerance 
comparator circuit. The TTL level output is fed to the CMP input 
and clocked into the selected register during each compare 
operation. The POL input is hard-wired HIGH or LOW; this 
influences the polarity of the CMP input. The CMP input can be 
either active-HIGH or LOW as selected by the POL control. 

The EN input is the ENabie control. If HIGH the SAR is configured 



for eight bits, and if LOW CC becomes an additional register to 
create a nine-bit SAR. The registers increase in significance by 
one and CC becomes O;,. 

An active-LOW RESET input has been added to the design so a 
synchronous reset may be applied to the circuit. When LOW, the 
clock input will set 0^ HIGH, CC, EOC, and Qf-Q^ LOW, shown 
in Figure 2 at t,. 

The SAR output (Q^ HIGH and Oe-Qo LOW) provides a binary 
input to the D to A converter, generating a corresponding analog 
signal. The comparator will compare this output with the current 
value of the analog input. If voltage level of the converted signal 
is greater than the current level of the analog input the binary 
weighting of the SAR contents is too large. The feedback from the 
comparator output to the CMP input will cause the contents of Q^ 
to be RESET on the next clock rising edge, t^ in Figure 2. If the 
level of the converted signal is less than the input, the binary 
weighting of the contents of the SAR is not large enough. The 
comparator feedback will cause the Q^ register contents to 
remain SET after the arrival of the clock rising edge t^. Simulta- 
neously Og will be SET HIGH to perform the next lesser significant 
approximation. 

At tj the result of the next CMP input is registered at Q^ while Q^ 
is SET for the next lesser significant compare. The entire cycle 
is completed with the contents of Q^-Qj containing the closest 
approximation of an eight-bit digital code to the current analog 
input. The end of one complete cycle finishes at t,„ and starts 
again att„. 

After the CMP input has been loaded into a register, the contents 
of that register must be held forthe duration of the approximation 
cycle. When the digital code is set up in the register file of the 
PAL20RS10 it may be synchronously loaded into an octal re- 
gister, as shown in Figure 1. CC will provide an enabling HIGH 
(Figure 2) and the digital data is registered at time t,^ in the cycle. 
The input to the comparator should be held constant during the 
conversion cycle in a SAMPLE and HOLD circuit. The EOC signal 
may be used as an interrupt input to the host processor, which can 
read the data over an eight-bit data bus by enabling the output of 
the octal register. 

The PAL20RS10 was selected because some of the outputs 
require more than the eight product terms available in standard 
PAL devices. The PAL20RS10 features product term steering, 
whereby sixteen product terms are shared between two outputs. 
To fit in the device, the outputs are arranged such that an output 
using more than eight product terms is adjacent to an output using 
less than eight product terms. 

The oscilloscope trace (Figure 3) shows one complete cycle of 
successive approximation. A PAL20RS10 device was pro- 
grammed as an SAR and configured with an AD558 D to A 
converter and a comparator circuit. The top trace shows the 
output of the comparator and the bottom trace shows the output 
of the D to A converter. Initially Q^ is set HIGH, and the CMP 
feedback holds the Q^ contents HIGH as 0^ is set HIGH for the 
next compare. O^ is SET HIGH as a result of the next compare. 
When Qj is set HIGH the D to A output is greater than the input 
voltage level so the content of Qj is reset LOW. When the cycle 
is complete the result HHLH HLLH is the closest binary weighting 
to the level of the analog input. 
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; EIGHT BIT SUCCESSIVE APPROXIMATION REGISTER 
;WITH CONVERSION COMPLETE AND END OF CONVERSION. 

TITLE SAR 8 
PATTERN 01 
REVISION 01 
AUTHOR CHRIS JAY 
COMPANY MMI SANTA CLARA 
DATE 29 JANUARY 1985 

CHIP SAR 8 PAL20RS10 



;PIN 


1 


2 


3 


4 


5 


6 




CLK 


/RST 


CMP 


POL 


NC 


NC 


;PIN 


7 


8 


9 


10 


11 


12 




NC 


NC 


NC 


NC 


EN 


GND 


;PIN 


13 


14 


15 


16 


17 


18 




/OE 


EOC 


CC 


Q3 


Q2 


Q4 


;PIN 


19 


20 


21 


22 


23 


24 




Ql 


Q5 


QO 


Q6 


07 


VCC 


EQUATIONS 













/Q7 := /RST*Q7*CMP*/Q6*/Q5*/Q4*/Q3 
*/Q2*/Ql*/Q0*/CC*/EOC*POL 

+ /RST*Q7*/CMP*/Q6*/Q5*/Q4 

*/Q3*/Q2*/Ql*/Q0*/CC*/EOC*/POL 

+ /RST*/Q7*/E0C 

Q6 := /RST*Q7*/Q6*/Q5*/Q4 

*/Q3*/Q2*/Ql*/Q0*/CC*/EOC 

+ /RST*Q6*/CMP*/Q5*/Q4 

*/Q3*/Q2*/Ql*/Q0*/CC*/EOC*POL 

+ /RST*Q6*CMP*/Q5*/Q4*/Q3 
*/Q2*/Ql*/Q0*/CC*/EOC*/POL 

+ /RST*Q6*Q5*/EOC 

+ /RST*Q6*Q4*/EOC 

+ /RST*Q6*Q3*/EOC 

+ /RST*Q6*Q2*/EOC 

+ /RST*Q6*Q1*/E0C 

+ /RST*Q6*Q0*/EOC 

+ /RST*Q6*CC*/E0C 

QO := /RST*Q1*/QO*/CC*/EOC 

+ /RST*Q0*/CMP*/CC*/EOC*POL 
+ /RST*Q0*CMP*/CC*/EOC*/POL 
+ /RST*Q0*CC*/EOC 

Q5 := /RST*Q6*/Q5*/Q4*/Q3 

*/Q2*/Ql*/Q0*/CC*/EOC 

+ /RST*Q5*/CMP*/Q4*/Q3 

*/Q2*/Ql*/Q0*/CC*/EOC*POL 

+ /RST*Q5*CMP*/Q4*/Q3 

*/Q2*/Ql*/Q0*/CC*/EOC*/POL 

+ /RST*Q5*Q4*/EOC 

+ /RST*Q5*Q3*/EOC 

+ /RST*Q5*Q2*/EOC 

+ /RST*Q5*Q1*/E0C 

+ /RST*Q5*Q0*/EOC 

+ /RST*Q5*CC*/E0C 

Ql := /RST*Q2*/Q1*/QO*/CC*/EOC 

+ /RST*Q1*/CMP*/QO*/CC*/EOC*POL 
+ /RST*Q1*CMP*/QO*/CC*/EOC*/POL 
+ /RST*Q1*QO*/EOC 
+ /RST*Q1*CC*/E0C 



ACTIVE LOW RESET WILL SET 
Q7 HIGH. THE CMP OR /CMP 
INPUT IS REGISTERED 
DEPENDING ON POL INPUT. 
Q7 IS HELD UNTIL THE END 
OF CONVERSION. 
ACTIVE LOW RESET WILL 
RESET Q6 LOW. Q6 GOES 
HIGH AFTER Q7 AND THE 
CMP OR /CMP INPUT IS 
REGISTERED. POL SELECTS 
CMP POLARITY. THE DATA 
REGISTERED IN Q6 IS 
HELD THROUGH THE WHOLE 
APPROXIMATION CYCLE Q5 
DOWN TO CONVERSION 
COMPLETE. THE EOC LOW 
SIGNAL, END OF CONVER- 
-SION CLEARS THE 
REGISTERS. 
SET REGISTER QO 
REGISTER CMP 
OR /CMP INPUT 
HOLD QO DATA 

SET REGISTER Q5 
AFTER Q6. DURING 
COMPARE CYCLE. 
REGISTER CMP 
OR /CMP INPUT 

HOLD Q5 DATA 
DURING THE 
APPROXIMATION 
CYCLE FROM Q4 
TO CONVERSION 
COMPLETE. 

SET REGISTER Ql HIGH 
AFTER Q2 REG CMP OR /CMP 
INPUT SELECTED BY POL 
HOLD Ql DATA FOR 
REST OF CONVERSION 
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Q4 := /RST*Q5*/Q4*/Q3*/Q2 

*/Ql*/Q0*/CC*/EOC 
+ /RST*Q4*/CMP*/Q3*/Q2 

*/Ql*/Q0*/CC*/EOC*POL 
+ /RST*Q4*CMP*/Q3*/Q2 

*/Ql*/Q0*/CC*/EOC*/POL 
+ /RST*Q4*Q3*/EOC 
+ /RST*Q4*Q2*/EOC 
+ /RST*Q4*Q1*/E0C 
+ /RST*Q4*Q0*/EOC 
+ /RST*Q4*CC*/E0C 

Q2 := /RST*Q3*/Q2*/Q1*/QO*/CC*/EOC 

+ /RST*Q2*/CMP*/Q1*/QO*/CC*/EOC*POL 
+ /RST*Q2*CMP*/Q1*/QO*/CC*/EOC8/POL 
+ /RST*Q2*Q1*/E0C 
+ /RST*Q2*Q0*/EOC 
+ /RST*Q2*CC*/E0C 

Q3 := /RST*Q4*/Q3*/Q2*/Q1*/Q0 

*/CC*/EOC 
+ /RST*Q3*/CMP*/Q2*/Q1*/Q0 

*/CC*/EOC*POL 
+ /RST*Q3*CMP*/Q2*/Q1*/Q0 

*/CC*/EOC*/POL 
+ /RST*Q3*Q2*/EOC 
+ /RST*Q3*Q1*/E0C 
+ /RST*Q3*Q0*/EOC 
+ /RST*Q3*CC*/E0C 

CC := /RST*Q0*/CC*/EOC 

+ /RST*CC*/CMP*/EOC*/EN*POL 
+ /RST*CC*CMP*/EOC*/EN*/POL 



EOC := /RST*CC*/EOC 



SET REGISTER Q4 
AFTER Q5 THEN 
COMPARE CYCLE 
REGISTER CMP OR 
/CMP INPUT. 

HOLD Q4 DATA 

DURING THE 

APPROXIMATION 

CYCLE OF 03 

DOWN TO CC 

RESET REGISTER Q3 

SET REGISTER Q2 HIGH 

AFTER Q3 REG CMP OR 

/CMP INPUT SELECT W/POL 

HOLD Q2 DATA 

FOR APPROXIMATION 

CYCLE TO CC. 

RESET REGISTER Q3 

THEN SET Q3. 

REGISTER THE CMP 

OR /CMP INPUTS 

AND HOLD CONTENTS 

OF Q3 DURING THE 

REST OF THE APP- 

-ROXIMATION CYCLE 

DOWN TO CC 



RESET CC. CC GOES HIGH 
AFTER QO TO INDICATE 
THAT CONVERSION IS NOW 
COMPLETE. IF EN IS LOW 
THEN CC CAN PROVIDE ONE 
ADDITIONAL BIT OF 
RESOLUTION. 

END OF COUNT 
GOES HIGH TO 
RESET THE SAR . 



SIMULATION 



TRACE_ON CLK /RST EN CMP POL 

07 06 05 04 Q3 Q2 01 00 CC EOC 

SETF OE /CLK RST EN CMP /POL 

CLOCKF 

SETF /RST 

CLOCKF 

FOR I := 1 TO 9 DO 

BEGIN CLOCKF 

END 

SETF /CMP 

FOR J := 1 TO 9 DO 

BEGIN CLOCKF CLK 

END 

SETF POL 

FOR K := 1 TO 9 DO 

BEGIN CLOCKF CLK 

SETF CMP 

FOR L := 1 TO 9 DO 

BEGIN CLOCKF CLK 

END 

CLOCKF 

END 

TRACE OFF 



SET INITIAL CONDITIONS 
AND RESET 06-00, CC 
AND EOC, SET Q7 
FOR NINE CLOCK CYCLES 
PERFORM ONE PASS OF 
SUCCESSIVE APPROXIMATION 
WITH CMP HIGH AND POL 
LOW. PERFORM SECOND 
PASS WITH CMP LOW 

SET POLARITY PIN ACTIVE 
TEST SUCCESSIVE CYCLE OF 
APPROXIMATION WITH THE 
INVERSION INPUT ACTIVE 
FOR CMP HIGH AND CMP 
LOW 
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PALASM SIMULATION, V2.21A MARKET VERSION (07-24-86) 
(C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PALASM SIMULATION SELECTIVE TRACE LISTING 



Title : SAR 
Pattern : 01 
Revision : 01 



Author : CHRIS JAY 
Company : MMI SANTA CLARA 
Date : 29 JANUARY 1985 



SAR 

Page : 1 
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; FOURTEEN BIT SUCCESSIVE APPROXIMATION REGISTER 

TITLE SAR14 
PATTERN 01 
REVISION 01 
AUTHOR CHRIS JAY 
COMPANY MMI SANTA CLARA 
DATE 29 JANUARY 1985 

THE PAL32R16 HAS BEEN DESIGNED AS A FOURTEEN BIT SUCCESSIVE 
APPROXIMATION REGISTER FOR MEDIUM TO HIGH SPEED ANALOG TO 
DIGITAL CONVERSION. THE FOURTEEN OUTPUTS ARE CONNECTED TO 
THE INPUTS OF A FOURTEEN BIT DAC, WITH Q13 AS THE MOST SIG- 
NIFICANT BIT DOWN TO QO, WHICH IS THE LEAST SIGNIFICANT 
BIT. THE CC OUTPUT IS THE CONVERSION COMPLETE SIGNAL, WHEN 
HIGH IT INDICATES THAT THE CONTENTS OF THE FOURTEEN BIT REGISTER 
HOLDS THE CLOSEST BINARY WEIGHTING TO THE CURRENT VERSION OF 
THE ANALOG SIGNAL UNDERGOING CONVERSION. THE DAC OUTPUT AND 
THE ANALOG INPUTS ARE FED TO A HIGH QUALITY COMPARATOR CIRCUIT, 
THE OUTPUT OF WHICH IS A TTL LEVEL THAT DRIVES THE CMP COMPARE 
INPUT OF THE PAL32R16. 

CHIP SAR14 PAL32R16 



;PIN 


1 


2 


3 4 




5 


6 




7 


8 




Q13 


NC 


Q12 CC 


/OEl 


/RST 


CMP 


NC 


;PIN 


9 


10 


11 12 


13 


14 




15 


16 




NC 


VCC 


NC NC 


NC 


NC 




/PLD2 


/CLK2 


;PIN 


17 


18 


19 20 


21 


22 




23 


24 




Qll 


QO 


QIO Ql 


Q9 


Q2 




Q8 


Q3 


;PIN 


25 


26 


27 28 


29 


30 




31 


32 




/0E2 NC 


NC NC 


NC 


GND 


NC 


NC 


;PIN 


33 


34 


35 36 


37 


38 




39 


40 




NC 


NC 


/PLDl /CLKl 


Q7 


Q4 




Q6 


Q5 


STRING 


HI 3 


'Q13*/CC' 




HOLD 


EQUATION 


FOR 


Q13 






STRING 


H12 


•Q12*/CC*/RST' 


HOLD 


EQUATION 


FOR 


Q12 






STRING 


Hll 


•Q11*/CC*/RST' 


HOLD 


EQUATION 


FOR 


Qll 






STRING 


HIO 


•Q10*/CC*/RST' 


HOLD 


EQUATION 


FOR 


QIO 






STRING 


H9 


'Q9*/CC*/RST 




HOLD 


EQUATION 


FOR 


Q9 






STRING 


H8 


'Q8*/CC*/RST 




HOLD 


EQUATION 


FOR 


Q8 






STRING 


H7 


•Q7*/CC*/RST 




HOLD 


EQUATION 


FOR 


Q7 






STRING 


H6 


•Q6*/CC*/RST 




HOLD 


EQUATION 


FOR 


Q6 






STRING 


H5 


'Q5*/CC*/RST 




HOLD 


EQUATION 


FOR 


Q5 






STRING 


H4 


'Q4*/CC*/RST 




HOLD 


EQUATION 


FOR 


Q4 






STRING 


H3 


■Q3*/CC*/RST 




HOLD 


EQUATION 


FOR 


Q3 






STRING 


H2 


'Q2*/CC*/RST 




HOLD 


EQUATION 


FOR 


Q2 
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EQUATIONS 



Q13 



Q13*CMP*/Q12*/Q11*/Q10*/Q9*/Q8*/Q7 
*/Q6*/Q5*/Q4*/Q3*/Q2*/Ql*/QO*/CC 



H13*Q12 

H13*Q9 

H13*Q6 

H13*Q3 

H13*Q0 



H13*Q11 

H13*Q8 

H13*Q5 

H13*Q2 

RST 



H13*Q10 

H13*Q7 

H13*Q4 

H13*Q1 

CC 



Q12 := Q13*/Q12*/Q11*/Q10*/Q9*/Q8*/Q7*/Q6 
*/Q5*/Q4*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+ Q12*CMP*/Q11*/Q10*/Q9*/Q8*/Q7*/Q6 

*/Q5*/Q4*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+ H12*Q11 + H12*Q10 + H12*Q9 
+ H12*Q8 + H12*Q7 + H12*Q6 
+ H12*Q5 + H12*Q4 + H12*Q3 
+ H12*Q2 + H12*Q1 + H12*Q0 

Qll := Q12*/Q11*/Q10*/Q9*/Q8*/Q7*/Q6*/Q5 
*/Q4*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+ Q11*CMP*/Q10*/Q9*/Q8*/Q7*/Q6*/Q5 

*/Q4*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+ H11*Q10 + H11*Q9 + H11*Q8 
+ H11*Q7 + H11*Q6 + H11*Q5 
+ H11*Q4 + H11*Q3 + H11*Q2 
+ H11*Q1 + H11*Q0 

QIO := Qll*/Q10*/Q9*/Q8*/Q7*/Q6*/Q5*/Q4 
*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+ Q10*CMP*/Q9*/Q8*/Q7*/Q6*/Q5*/Q4 

*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+ H10*Q9 + H10*Q8 + H10*Q7 
+ H10*Q6 + H10*Q5 + H10*Q4 
+ H10*Q3 + H10*Q2 + H10*01 
+ H10*Q0 

Q9 := Q10*/Q9*/Q8*/Q7*/Q6*/Q5*/Q4*/Q3 

*/Q2*/Ql*/Q0*/CC*/RST 
+ Q9*CMP*/QB*/Q7*/Q6*/Q5*/Q4*/Q3 

*/Q2*/Ql*/Q0*/CC*/RST 
+ H9*Q8 + H9*Q7 + H9*Q6 
+ H9*Q5 + H9*Q4 + H9*Q3 
+ H9*Q2 + H9*Q1 
+ H9*Q0 

Q8 := Q9*/Q8*/Q7*/Q6*/Q5*/Q4*/Q3*/Q2 

*/Ql*/QO*/CC*/RST 
+ Q8*CMP*/Q7*/Q6*/Q5*/Q4*/Q3*/Q2 

*/Ql*/QO*/CC*/RST 
+ H8*Q7 + H8*Q6 + H8*Q5 
+ H8*Q4 + H8*Q3 + H8*Q2 
+ H8*Q1 + H8*Q0 
Q7 := Q8*/Q7*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1 

*/QO*/CC*/RST 
+ Q7*CMP*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1 

*/CC*/RST 
+ H7*Q6 + H7*Q5 + H7*Q4 
+ H7*Q3 + H7*Q2 + H7*Q1 
+ H7*Q0 



REGISTER CMP I/P 

HOLD RESULT OF COMPARE 
THE APPROXIMATION CYCLE 
Q12 TO QO CLEAR ON RESE 
CYCLE Q12 TO QO 
AND CONVERSION COMPLETE 

CLEAR Q12. DURING 
FIRST APPROXIMATION 
REGISTER COMPARE 
INPUT. 

HOLD CONVERSION RESULT 
DURING APPROXIMATION CY 
Qll TO QO CLEAR ON RESE 
AND CONVERSION COMPLETE 

HOLD Qll LOW 
BEFORE COMPARE 
CYCLE. REGISTER 
COMPARE INPUT. 
HOLD RESULT OF COMPARE 
DURING APPROXIMATION 
CYCLE QIO TO QO. RESET 
AND CC CLEARS REGISTER. 

HOLD QIO LOW 
BEFORE COMPARE 
CYCLE. REGISTER 
RESULT OF COMPARE 
HOLD RESULT ON 
APPROXIMATION 
CYCLE Q9 TO QO. 
RESET AND CC 
CLEARS REGISTER. 
HOLD Q9 LOW 
BEFORE COMPARE. 
REGISTER RESULT 
OF COMPARE. 
HOLD RESULT ON 
APPROXIMATION 
CYCLE Q8 TO QO . 
RESET AND CC 
CLEARS REGISTER. 
HOLD Q8 LOW 
BEFORE COMPARE 
REGISTER RESULT 
OF COMPARE. 
HOLD RESULT ON 
APPROXIMATION 
CYCLE Q7 TO QO 
HOLD Q7 LOW 
BEFORE COMPARE 
REGISTER RESULT 
OF COMPARE. 
HOLD CONTENTS 
OF RESULT ON 
APPROXIMATION 
CYCLE Q6 TO QO 
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Q6 := Q7*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+ Q6*CMP*/Q5*/Q4*/03*/Q2*/Q1*/QO*/CC*/RST 

+ H6*Q5 + H6*Q4 + H6*Q3 

+ H6*Q2 + H6*Q1 + H6*Q0 

Q5 := Q6*/Q5*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+ Q5*CMP*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+ H5*Q4 + H5*Q3 + H5*Q2 

+ H5*Q1 + H5*Q0 

Q4 := Q5*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+ Q4*CMP*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+ H4*Q3 + H4*Q2 + H4*Q1 

+ H4*Q0 

Q3 := Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+ Q3*CMP*/Q2*/Q1*/Q0*/CC*/RST 

+ H3*Q2 + H3*Q1 + H3*Q0 

Q2 := Q3*/Q2*/Q1*/Q0*/CC*/RST 

+ Q2*CMP*/Q1*/Q0*/CC*/RST 

+ H2*Q1 + H2*Q0 

Ql := Q2*/Q1*/Q0*/CC*/RST 

+ Q1*CMP*/Q0*/CC*/RST 

+ Q1*Q0*/CC*/RST 

QO := Q1*/Q0*/CC*/RST 

+ QO*CMP*/CC*/RST 

CC := QO*/CC*/RST 



HOLD Q6 LOW BEFORE COMP 

REGISTER RESULT OF 

COMAPRE. HOLD RESULT 

ON APPROXIMATION 

Q5 - QO. 

HOLD Q5 LOW BEFORE COMP 

REGISTER RESULT OF COMP 

HOLD RESULT ON 

APPROXIMATION 

CYCLE Q4 - QO. 

HOLD Q4 LOW 

REGISTER COMPARE 

HOLD RESULT ON 

Q3 - QO. 

HOLD Q3 LOW 
REGISTER COMPARE 
HOLD RESULT ON 
CYCLE 02 - QO. 
HOLD Q2 LOW. 
REGISTER COMPARE 
HOLD RESULT 
ON Q2 - QO. 
HOLD Ql LOW. 
REGISTER COMPARE. 
HOLD RESULT. 

HOLD QO LOW. 
REGISTER COMPARE. 

HIGH CONVERSION 
COMPLETE . 



SIMULATION 



TRACE_ON CLKl CMP RST Q13 Q12 Qll QIC Q9 
Q8 Q7 Q6 Q5 Q4 Q3 Q2 Ql QO CC 

SETF /CLKl /CLK2 OEl 0E2 /PLDl /PLD2 RST /CMP 

CLOCKF CLKl CLK2 

CHECK Q13 /Q12 /Qll /QIO /Q9 /Q8 /Q7 /Q6 /Q5 

/Q4 /Q3 /02 /Ql /QO 
/RST 

CLKl CLK2 

I := 1 TO 14 DO 

CLOCKF CLKl CLK2 



SETF 

CLOCKF 

FOR 

BEGIN 

END 
SETF CMP 
FOR 

BEGIN 

END 
TRACE OFF 



I := 1 TO 14 DO 
CLOCKF CLKl CLK2 



rSET INITIAL 

CONDITIONS 
SYNCHRONOUS 
RESET CHECK 
INITIAL REGISTER 
STATE . 

APPLY 14 CLOCK 
PULSES WITH CMP 
INPUT LOW. 

SET CMP INPUT 
HIGH AND APPLY 
14 CLOCK PULSES. 
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Title : SAR14 
Pattern : 01 
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SAR14 
Page : 1 
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PAL Devices, PROMs, FIFOs, and Multipliers 
Team up to Implement Single-Board 
High-Performance Audio Spectrum Analyzer 



AN-100 



Introduction 

This application note illustrates a high-performance audiospec- 
trum analyzer This circuit can analyze high-fidelity audio sig- 
nals with a resolution of 20 Hz and an input bandwidth of 20 kHz. 
It is useful in production test, performance evaluation, or 
adjustment of high-fidelity audio equipment. The analyzer pro- 
vides a sweep generator output for rapid analysis of audio filter 
frequency response. 

The design techniques used to implement the analyzer are quite 
general, and can be applied to a wide variety of DSP tasks. An 
understanding of the approach used will suggest solutions to a 
number of DSP problems. The architecture chosen for the 
spectrum analyzer is controlled by a microprogram stored in 
PROM. Many other applications can be accommodated by 
changing the microprogram. The high performance of this 
architecture provides an attractive price/performance alterna- 
tive to other DSP approaches. 

Spectrum Analyzer Functions 

The spectrum analyzer requires many of the functions com- 
monly used in DSP. Figure 1 shows the analyzer functions. An 
input signal is mixed with a swept audio sinewave oscillator 
(below). 




Frequency Sweep 

The frequency sweep acts as a sampler, starting from DC and 
increasing to its maximum frequency. 

Mixing is accomplished by multiplying the input signal by the 
sinewave. From basic trigonometry: 

cos w-|t X cos W2t = 1/2 cos {wi + W2)t + 1/2 cos (wi -W2)t (1) 

The mixing process generates two new sinewaves whose fre- 
quencies are the sum and difference of the input sinewave 
frequencies. When the sinewave oscillator matches the fre- 
quency of an input signal component, a DC term is generated in 
proportion to the amplitude of that component: 

cos wit X cos wit = cos2 wit = 1+2 cos wit 

The DC term is extracted by a narrow lowpass filter. Due to the 
finite bandwidth of this lowpass filter, mixer output signals 
whose frequencies fall within the filter passband also appear at 
the filter output. As a result, the analyzer output will represent 
the energy contained in a range of frequencies, from the sine- 
wave frequency minus the filter cutoff frequency, to the sine- 
wave frequency plus the filter cutoff frequency. The effective 
bandwidth of the analyzer is twice the lowpass filter bandwidth. 



INPUT 
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LOWPASS 
FILTER 
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-^ OVERTICAL 
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SWEEP 
GENERATOR 



Figure 1. Spectrum Analyzer Functions 

A detector converts the lowpass filter output to a DC voltage 
representing the total energy in the filter passband. If this DC 
voltage is plotted on a vertical axis with the sinewave oscillator 
frequency (represented by the sweep voltage) controlling the 
horizontal axis, a spectrum of the input signal results. 

Other mixing schemes can be used to extract the spectrum. 
However, this "direct conversion" approach has two significant 
advantages. As shown in Figure 2, the swept oscillator output can 
be used to plot the frequency response of an audio filter. Other 
schemes require additional mixing to achieve the same results. 
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Figure 2. Filter Test Mode Setup 

The direct conversion scheme confines the frequencies of all 
signals following the mixer to the lowpass filter bandwidth. 
Limiting the signal bandwidth has great benefit when the 
analyzer is implemented digitally. This benefit can be better 
understood with a brief review of DSP theory. 
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Digital Signal Processing 
Theory Review 

Digital signal processing is accomplished by first converting the 
continuous analog input signal to a series of digital numbers. 
The digital numbers are then manipulated to perform the 
required signal processing. The processed digital numbers are 
then converted back to a continuous analog signal, completing 
the processing. The functions required for DSP are shown in 
Figure 3. 

ANALOG ►' DIGITAL ~ ANALOG ► 
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FILTER 
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SMOOTHING 
FILTER 
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Figure 3. DSP Functions 



Sampling 

Representing a continuous input signal w/ould require an infinite 
array of digital numbers. A finite collection of digital numbers 
can be obtained by considering the signal amplitude at discrete, 
periodic points in time. This process is called sampling, and is 
equivalent to multiplying the input signal by a periodic train of 
impulses of unit amplitude. The sampling theorem states that 
the input signal can be reconstructed without distortion if the 
input is bandlimited to contain no frequency components 
greater than half the sampling frequency. The sampling theorem 
means that the discrete samples completely represent the input 
signal, as long as the bandwidth constraint is met. 

Aliasing 

What is really happening during the sampling process? Consider 
the Fourier series representation of a periodic unit impulse train. 
It can be shown that: 



f(t) = ^ cos (27rk fgt), k = 0, 1, 

k = - 00 

1 
where f^ 



2, 3,. 



(2) 



sample period 

The periodic impulse train is equivalent to a series of sinusoids 
consisting of all harmonics of the sampling frequency, including 




FREQUENCY— DC fs/2 (s 2 Is 3fs 

Figure 4. Aliasing Spectra of Figure 3 DSP Functions 



a DC term. Recalling Eq. (1) all possible sum and difference 
frequencies will be generated when the impluse train and the 
input signal are multiplied. This process is shown graphically in 
Figure 4. Observe that if the input contains frequencies greater 
than half the sampling frequency, the spectra in Figure 4 will 
overlap. This overlap phenomenon is known as aliasing distor- 
tion, and introduces noise in the signal. 

Another consequence of the sampling process is that high- 
frequency signal components near a harmonic of the sampling 
frequency will be mixed to produce new signal components near 
DC. These new components have frequencies within the desired 
signal passband, but are really "alias" high-frequency compo- 
nents. The phenomenon is called aliasing. 

To eliminate the undesirable effects of aliasing, a continuous 
analog lowpass filter is placed before the sampler. Thisa/Zas/ng 
filter removes frequency components beyond the fg/S limit. 

Quantizing 

The input samples are converted to a series of digital numbers by 
an analog-to-dig ital (A/D) converter. The A/D converter operates 
by quantizing the continuous sample amplitude into a finite 
number of amplitude ranges, and then assigning a digital 
numbertorepresentthequantizedamplitudevalue. Asmightbe 
expected, this process introduces noise in the signal, known as 
quantization distortion. The quantization distortion is in the form 
of a "white" or broadband random noise, whose RMS ampli- 
tude is: 



1 o-2b 



= -1-2 
12 



(3) 



where b is the number of bits in the output 
digital word, excluding the sign bit 

The effect of aliasing on quantization noise is to alias high 
frequency noise components to the DC to fs/2 range. The result- 
ing noise spectral density is equivalent to a white noise of ampli- 
tude a2, bandlimited to fs/2. 

Dynamic Range 

The A/D output contains a finite number of bits. Dynamic range 
is defined as the ratio of the maximum-to-minimum signal 
amplitude that can be represented by the digital numbers. 
Dynamic range is determined by the number of bits in the digital 
numbers, and by the noise "floor." 

For a digital number containing b bits plus a sign bit, the 
dynamic range would be: 

Dynamic range (dB) = 10 log.|o 2~'^^ (4) 

The noise floor is the sum of all noise components that can 
appear at the DSP output. The primary noise factors are 
quantization noise and limit cycle noise (to be discussed 
shortly). Digital filtering will affect the noise floor by eliminating 
components of the noise signal. For example, the quantization 
noise at the DSP output is: 



NQ(dB) = 10logiopM'] 



(5) 



where BW is the net bandwidth of the digital filters 

The noise components are uncorrelated, and are therefore 
combined by adding the power of each noise component. 
Remember that 



Power (absolute) = log 



^o-'[ 



Power (dB)/10 



(6) 
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The resulting dynamic range is: 

Dynamic range (dB) = 10 log^Q 



X'N 



0.5 



S Noise Power 



(7) 



where 0.5 = the maximum mean-squared amplitude 

The overall dynamic range Is the lesser of the result given by Eq. 
(4) or Eq. (7). In a practical system, the width of the digital 
numbers can vary. The dynamic range is usually calculated for 
all critical points in a digital system, with the overall dynamic 
range being the worst case value. 

Digital Processing 

The digital numbers from the A/D converter are manipulated to 
process the signal. Carrier generation, filtering, and nonlinear 
operations are performed by appropriate "number crunching". 

Generation of sinusoidal carriers is easily accomplished using a 
linear ramp function (digital up/down counter) and converting 
the results to sinusoidal samples using ROM lookup tables. 
Alternately, recursive equations can produce the desired carriers. 

Nonlinear operations on the digital numbers must be handled 
with care. Since aliasing is always present in the sampled 
domain, harmonics generated by nonlinear operations can alias 
to lower frequencies. The aliasing occurs "immediately," since it 
can be shown that performing a nonlinear operation in the 
sampled domain is equivalent to first performing the nonlinear 
operation on a continuous signal and then sampling the result 
without bandlimiting the sampler input. 

The sampling rate can be changed to improve the efficiency of 
the digital processing. For example, discarding every other 
digital number would reduce the effective sampling rate by a 
factor of two. If the processing at the higher sample rate 
includes digital aliasing filters to remove components greater 
than half the lower sample rate, the requirements of the 
sampling theory are still met. The sampling rate can be 
increased by repeating digital sample values. This repetition is 
equivalent to a "sample and hold" operation, and modifies the 
signal spectrum by 



F(jw)=F(jw)X§i!li^) 
wT/2 



(8) 



where w = 27r x f req 

T= input (longer) sample period 
The effects of changing the sampling rate are best determined 
by plotting the resulting aliasing spectra. 

Digital Filtering 

Digital filtering is accomplished using multiplication, addition, 
and delay. For example, consider the biquadratic filter section 
in Figure 5. If z~^ is defined to be a unit sample period delay 
operator, then the input-to-output transfer function of the 
biquadratic section is: 



H(z) = 



1 +a-|Z 



-1 



-2 



(9) 



1 +biz ' +boz' 



j.,^"'' +b2' ^ 

The biquadric sections can be cascaded to implement higher- 
order filters. 

The Laplace transform of a unit delay is e"^"", where T is the 
delay period. Remember that z~Vepresents an inverse opera- 
tor, so that z X z""" = I.Thus, 
,-„ sT 




X'n-2 

X' N = Xn -bi X' N-1 - b2 X' n_2 
Yfj = X' N +ai X' N_1 + 32 X' n_2 

Figure 5. Digital Biquadratic Filter Section 

Digital filter poles and zeroes (in the z-plane) can be mapped 
into the s-plane to determine the equivalent analog filter 
function, and vice-versa. The digital filter section of Eq. (9) 
corresponds to an analog biquadratic filter section with. 



H(s) 



2 

3 +a-| w.| ; 



(11) 



'1 



where s = a + jw 



(10) 



However, the periodic nature of the e^"'^ function causes the 
digital filter passband to repeat periodically. The effect is the 
same as aliasing. The analog filter response is mixed with the 
sampling frequency harmonics to generate the true digital filter 
response. 

Designing Digital Filters 

How does one go about designing a digital filter? One approach 
is to perform a least mean squared error optimization using a 
computer. The desired function is specified, and the computer 
adjusts the a^ and b^ values until the desired response is 
achieved. 

A second approach is to design an equivalent analog filter and 
then convert that design to a digital filter. This approach has 
great merit, since analog filter design theory is well developed. 
However, the digital passband will be distorted if the analog 
equivalent filter has significant response to frequencies greater 
than fs/2. The aliased passbands overlap at that point. 

To circumvent this problem, the analog filter function can be 
modified to compensate for the aliasing effects. The analog 
transfer response is modified using several transforms to 
compensate for aliasing. Unfortunately, the nature of the 
s-plane to z-plane mapping is such that no transform can 
compensate for all aliasing effects without introducing other 
forms of distortion. 

The standard (or impulse invariant) z-trans(orm represents a 
direct mapping to the z-plane. No frequency, amplitude, or 
phase distortion is introduced, but aliasing effects are not 
compensated. This transform should be used when the analog 
filter has negligible response to frequencies greater than f2/2. 

The bilinear z-transform preserves the filter amplitude response 
in the presence of aliasing. However, the bilinear transform 
Introduces a distortion or warping of the frequency axis. As a 
result, only the filter cutoff frequency can be accurately trans- 
formed, using apre-warping technique. Frequencies within the 
filter passband remain warped, introducing phase distortion in 
the digital filter response. The bilinear transform is used when 
the filter amplitude response is more critical than the phase 
response. 
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The matched z-transform preserves the filter phase response at 
the expense of amplitude response distortion. However, this 
amplitude distortion, unlike the aliasing distortion, can be 
corrected by placing additional zeroes in the transfer function. 
The matched transform is used when the filter phase response is 
critical, and either the amplitude response is not critical or the 
additional compensation zeroes can be accommodated. 
Performing the transforms by hand is quite tedious. Fortunately, 
computer programs are widely available which handle the 
complete filter synthesis procedure, including z-transforms and 
pre-warping. 



Limit Cycle Noise 

An effect of using digital numbers with a finite number of bits is 
the generation of quantization noise. When implementing digital 
filters, the quantization noise introduces oscilaltions that are 
analogous to ringing In analog filters. These oscillations are 
cal led limit cycles. The limit cycle generates a noise which peaks 
at frequencies corresponding to the filter pole frequencies. The 
noise power is roughly proportional to pole Q. Limit cycle noise 
for a second order filter section of Equation (1 1) is given by: 



2 -2b 1 + r 
NL(dB) = 10logio(— 2 



1 



2,4 



l-r'^r +1-2r'^cos2w 



(12) 



where b = number of digital number bits (excluding sign bit) 
pole freq. = w.| 
pole Q = 1/«-| 



W = 277 



pol e freq. 
f„ 



expl 



2 • pole Q 



The limit cycle noise must be calculated for each complex pole 
pair, and adjusted to reflect the response of subsequent filter 
stages to the limit cycle frequency. Computer programs can 
calculate limit cycle noise power, including all of these considerations. 

Output Signal Reconstruction 

Once manipulation of the digital sample numbers is complete, 
the resulting digital numbers must be converted back to a 



continuous analog signal. Referring back to Fig. 3, a digital-to- 
analog (D/A) converter transforms the digital numbers to a 
series of analog output pulses. 

A sample-and-hold (S&H) circuit eliminates transients that are 
introduced during the D/A conversion process. The spectrum of 
the S&H output is modifed as follows: 



S&H F' (jw) 



t sin 
F(jw)- 



(wt/2) 



wt/2 



where t = hold time T = sample period 

An output smoothing filter completes the reconstruction by 
removing all components with frequencies greater than fs/2. 
The smoothing filter is often optional, depending on the impor- 
tance of removing the high-frequency output components. 

The spectral effects of reconstruction are shown in Figure 4. 

Implementing the Spectrum Analyzer 

The architecture used for the spectrum analyzer is shown in 
Fig. 6. Input signals are digitized and buffered with FIFOs before 
interface with a common 16-bit data bus. The 16-bit arithmetic 
unit (AU) provides multiply and accumulate operations. A 16-bit 
wide RAM stores intermediate results. A 16-bit temporary 
register facilitates z~^ delays and data movement. Outputs 
are provided using a D/A converter and S&H circuits. 

The VGO output is buffered using FIFOs to provide a uniform 
high-speed sample rate. The VGO output is 12 bits wide, 
providing a signal-to-quantization noise ratio of 91 dB, using 
Equations (5) and (7). The calculation assumes a 500-Hz band- 
width. A smoothing filter at the VGO output is not necessary. The 
filter test configuration of Figure 2 allows the input aliasing filter 
to remove the effects of VGO high-frequency components, as 
long as the filter under test is a linear analog circuit. 

The vertical and horizontal outputs are intended to interface an 
oscilloscope or X-Y plotter. The sampling of these outputs can 
be non-uniform, as long as the outputs track each other. The 
elastic storage at the input and VGO interfaces permits arbitrary 
non-uniform processing of the analyzer functions. 

The 16-bit resolution of the internal data word provides 90-dB 
dynamic range according to Equation (4), or 115-dB dynamic 
range according to Equations (5) and (7), assuming 500-Hz 
bandwidth and no limit cycle noise and aliasing. 
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Figure 6. Analyzer Architecture 



Microprogram control was selected for the analyzer. PAL devi- 
ces can efficiently implement sequential state machines. It is 
possible to encode all control information in PAL devices, but 
only three packages w^ould be saved (one PROM and two 
buffers). Distributing the control among several PAL devices 
would reduce flexibility and make corrections very difficult. The 
few extra packages required for microprogram control provide 
an extremely flexible architecture and greatly simplify the PAL 
device functions. 



With the theoretical background and architecture in mind, the 
spectrum analyzer functions can be defined in detail. The objec- 
tive Is to realize a circuit capable of high-resolution analysis of 
audio signals in the DC to 20-kHz range. Selectable bandwidth 
and linear/logarithmic output display are highly desirable. 
Detailed functions are shown in Figure 7. 
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Figure 7. Detailed Functional Diagram 
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Input Aliasing Filter 

An analog lowpass filter removes high-frequency components 
from the input signals. With a sample frequency of 50 kHz and a 
maximum input frequency of 20 kHz, the lowest aliasing fre- 
quency is (50-20) = 30 kHz. 

An eighth-order Chebychev filter with 0.1-dB passband ripple 
will provide 44 dB of attenuation at 30 kHz, and 86 dB attenuation 
at 50 kHz. It is desirable to provide at least 60-dB overall dynamic 
range for high-performace analysis. To eliminate spurious 
responses above the -60-dB "floor," the input signal should have 
all components above 30 kHz suppressed by at least (60-44) = 
16 dB. Most input signals will meet this requirement. If not, 
additional filtering must be provided. 

S&H and A/D Converter 

The input S&H maintains a constant sampled signal level while 
the A/D conversion is in progress. No sin X/X correction is made 
for ttiis S&H since the net effect of the S&H plus A/D action is an 
Impulse sample at the start of the "hold" period. 

The A/D conversion time should be less than 16 ns. The /VD 
converter output digital number should "saturate" when the 
input signal exceeds the maximum level. The digital numbers 
should be in inverted two's complement form. The S&H 
acquisition time should be less than 4 ns. 
For a full 60-dB overall dynamic range, a 1 2-bit /VD is required. 



The calculation of sin y and cos y can take place at a reduced 
sample rate to save processing time. Only the last execution of 
equations (13) and (14) must be performed at the full 50-kHz 
sample rate. 

A linear ramp is generated to provide horizontal drive for an 
oscilloscope or X-Y plotter. The ramp is incremented each time 
the sin y and cos y values are updated, tracking the VCO sweep. 
When the ramp value overflows, the analyzer sweep cycle is re- 
initialized. 




t>— Yn 



IJ'n-I "'^' = 1.biz-Ub2Z-2 



Figure 8. All-Pole Digital Filter Section 



Mixer 

The mixer multiplies the A/D output by the swept sinewave 
oscillator value. The multiplication produces sum and difference 
frequencies, according to equation (1). 
Two's complement fractional arithmetic is used throughout the 
analyzer. Multiplication cannot overflow, since all numbers are 
less than 1 in magnitude. 

Swept Sinusoidal Oscillator (VCO) 

A precision swept sinusoid from DC to 20 kHz must be 
generated to mix with the input signal. A technique particularly 
well suited to this application is solving the two equations: 

sin (x+y) = sin x cosy + cos X sin y (13) 

cos (x+y) = cos x cosy -sin X sin y (14) 

These two trigonometric identities generate a new sin and cos 
value with y representing the phase shift per sample period. The 
technique is a "CORDIC" algorithm, based on coordinate 
rotation. Exact results are produced, but truncation and round- 
off errors due to the finite digital word length can cause a slow 
change or "drift" of carrier amplitude. Fortunately, the swept 
sinusoid is periodically reset in the spectrum analyzer, arresting 
this amplitude drift. 

The VCO frequency is swept by varying the value of y. However, 
since equations (13) and (14) require sin y and cos y, an identical 
CORDIC algorithm is used to obtain these values. To sweep the 
VCO, then, sinA and cosA are placed in RAM, selected by the 
bandwidth setting. These are two fixed numbers originally stored 
in PROM, and represent the frequency shift per sample time. 
Equations (13) and (14) are then applied to calculate sin y and 
cos y, which represent the desired phase shift per sample time. 
Equations (13) and (14) are executed again to generate the 
actual sinusoidal output. 



Digital Filters 

Fig. 8 shows the implementation of the all-pole digital filter 
sections. Because of the low pole values in all filters, the 
second order sections can be simply cascaded to implement 
high-order filters. Fig. 8 shows a technique for handling 
coefficients greater than 1 with fractional number representa- 
tion. 

Scaling must be performed to ensure maximum dynamic range. 
Filter sections with high-Q poles will show peaking of signals 
near the pole frequencies. The input to such sections must be 
scaled down to prevent overflow of the arithmetic. For a second- 
order all-pole section, this peaking factor is exactly the Q of the 
poles. Thus, when a given second-order section has a pole Q of 
2, the input signal to that section must be multiplied by 0.5 to 
prevent overflow. When the Q is less than or equal to 1, no 
scaling is performed. 

Saturation arithmetic is not provided in this architecture. Careful 
scaling eliminates the need for saturation arithmetic, since the 
/VD will saturate at a precisely known value. 



Aliasing Filters 

Two 4th-order Chebyschev filters permit reduction of the 
sample rate following the mixer. Each filter provides 0.3 dB 
passband ripple and at least 68 dB attenuation of aliasing 
components. The slightly high passband ripple is acceptable, 
since subsequent filters will dominate the composite passband 
shape. 

The first filter permits a sample rate reduction factor of 16. It is 
designed with a passband cutoff frequency of 479 Hz and a 
sample rate of 50 kHz. Eq. (12) predicts the limit cycle noise for 
this filter to be -58 dB. 



2-312 



Monolithic 



ioM 



Memories 



Audio Spectrum Analyzer 



The second filter permits a second sample rate reduction factor 
of two. Its cutoff frequency is 219 Hz, with a sample rate of 
3.125 kHz. Equation (12) predits the limit cycle noise forthis filter 
to be -83 dB. This filter also provides an additional 14-dB 
attenuation of the limit cycle noise generated in the first aliasing 
filter, reducing the limit cycle noise from the first filter to -72 dB. 

These two filters permit an overall fg reduction factor of 32 
before processing the Bessell filter, detector, and linear-to-log2 
functions. This results in a very substantial throughput improve- 
ment. Net execution time is determined by the time to execute a 
given function multiplied by the sample rate for that function. 
Thus 32 instructions at the reduced rate will increase the net 
execution time by an amount equivalent to only 1 instruction at 
the full sample rate. 

Fig. 9 shows the aliasing spectra of the sample rate reduction 
process. 



at the 500-Hz bandwidth, increasing to -67 dB at the 20-Hz 
bandwidth. 

Detector 

A square-law detector provides a DC signal corresponding to 
the energy at the lowpass filter output. From trigonometry: 
. a2 



(A cos wt)*^ 



(1 + cos 2wt) 



(15) 



The detector output contains the desired DC term and a single 
undesired term at frequency 2w. If the square law is ideal (easy 
in the digital domain), no additional terms are produced. The 
elimination of harmonics ensures the accuracy of the detector 
with f3/32 = 1.5625 kHz. The highest component is always less 
than fs/64 with a 250 Hz maximum lowpass filter cutoff 
frequency. However, 2w can be anywhere from DC to 500 Hz as 
the VCO sweeps past the spectral component. 



INPUT FILTER 




i I I I I I I I I 



1ST ALIASING 

FILTER 

OUTPUT 

(fs = 3.125 kHZ) 

2ND ALIASING 

FILTER 

OUTPUT 

(fs = 1.56 kHZ) 



20 30 

FREQUENCY (kHz) 



Figure 9. Aliasing Spectra for (3 Reduction 

Lowpass Filter 

A 4th-order Bessel lowpass filter determines the overall band- 
width of the analyzer. The overall bandwidth is twice the 
bandwidth of this filter. Overall bandwidths of 20, 50, 100 or 500 
Hz are provided by loading the proper set of filter coefficients 
into RAM when the bandwidth is selected. 

The Bessel filter provides an optimal transient response for the 
analyzer. Good transient response is important, especially at 
narrow bandwidths, since the spectral peaks are swept with 
respect to the titer passband. The net effect is similar to pulsing 
the filter input. Because the phase response is critical, the 
matched-z transform is used to convert the analog Bessel design 
to the z-domain. 

The second aliasing filter provides 3 dB of attenuation at 250 Hz. 
When cascaded with the Bessel filter, which also provides 3 dB 
attenuation at 250 Hz in the 500 Hz bandwidth mode, the 
response at this bandwidth is modified. However, since the 
overall bandwidth is relatively broad, good transient response is 
still achieved. Cascading these two filters provides a "transitional" 
filter with a Bessel response at low attenuation and a 
Chebyschev response at high attenuation. At bandwidths less 
than 500 Hz, the combination produces an optimal tradeoff 
between transient response and resolution. 

Analysis of Equation (12) reveals that limit cycle noise increases 
exponentially as the pole frequency is reduced. Operating the 
lowpass filter at the lowest possible sampling frequency (1 .5625 
kHz) minimizes limit cycle noise, in addition to improving 
throughput. Limit cycle noise for the lowpass filter will be -95 dB 
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Figure 10. Detector Sweep Filtering 

Fig. 10 illustrates a technique to render the effects of the 2w 
terms negligible. The analyzer passband is divided into n equal 
intervals. The VCO sweep rate is controlled so that the VCO 
sweeps BW/n per fg/32 interval. The detector is followed by a 
single-pole lowpass filter with a 3 dB frequency of BW/n. As the 
VCO sweeps a component through the passband, the DC term is 
present in all n intervals, but the 2w term can affect only one 
interval. The worst-case DC error is 1/n for an ideal cutoff, and 
is multiplied by (20+2-'' +2-3 +2-4+2-5 +2-5-5 +2-6 + 
2-6.5 + 2-7 + 2-7-25 + ....) = 1 .85 due to the finite 6 dB/octave 
rolloff of the single-pole filter. Further analysis reveals that: 



1.85 



10 



e/10 



-1 



(16) 



0.1 dB, 



where e represents the resulting error in dB. For e 

n = 80. 

In the filter test mode, the signal frequency and VCO frequency 

are the same, forcing w = 0. The detector has no error in this 

mode, but has a 3 dB gain due to the second DC term. 

The detector output represents signal energy. Each bit in the 
detector output word thus represents only 3 dB, and 21 bits are 
required to reflect a 60 dB dynamic range. Double precision 
arithmetic is required for the detector ouput and the single-pole 
filter. The 67C7560 multiplier will handle double precision calcu- 
lations with a time penalty. Fortunately, the calculations to be 
performed are simple and the operations take place at the 
minimum sample rate, reducing the impact on throughput. 

Linear-to-Log Conversion 

The architecture of Figure 6 is customized to provide an efficient 
algorithm for linear-to-logarithmic output conversion. The RAM 
address generator monitors the 8 MSBs of the data bus, and can 
provide a number indicating the MSB position of a positive 
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number. This output is used to retrieve a lookup table value. 
This value is used to scale the data word to quickly left-justify 
the MSB. A second 4-point lookup table is then used to improve 
the accuracy of the resulting log2 conversion. A 1092 con- 
version is adequate, since: 

1092 ^ 

1° 109210 

Equation (17) demonstrates that the output can be displayed in 
decibels by setting the oscilloscope or X-Y plotter Y-axis gain to 
the proper value. 

Two lookup tables provide .027 dB accuracy for output values 
from to -45 dB, and 3 dB accuracy from -45 to -84 dB. The 
logarithmic accuracy is limited by the 10-bit output word length 
to the D/A. This output can represent to -84 dB in .082 dB 
increments. The accuracy of the 4-point lookup is therefore 
sufficient. 

The logarithmic conversion procedure is as follows: 
1 ) If the MSB of the data word is not among the 8 MSBs into the 
address generator, multiply the word by 2^ = 128, and 
increment the output number by 7. Repeat until the data 
word is greater than 2~^ , but no more than three times. 
Set a flag if this step is executed more than once. 



2) Look up the appropriate scale factor, from 2° to 2^. Add 
log 2 of the scale factor to the output word. The conversion 
is now accurate to 3 dB. 

3) If the flag was not set during step 1, multiply the data word 
by the scale factor to left-justify the MSB position. 

4) If the flag was not set during step 1, retrieve an intercept 
and slope value from the 8-word lookup table (four pairs 
available). Perform a linear Interpolation using: 

x' = a X + b 

where a is the slope value (18) 

bis the intercept value 

The conversion is now accurate to .027 dB. 

5) Scale the result to provide 84-dB output range with a 
10-bit word. 

6) Subtract 2"1 from the output to convert it to two's com- 
plement form for the D/A. 

The calculations are double precision for steps 1, 2, and 3, and 
single precision thereafter. The conversion sequence can be by- 
passed using a strap option to provide a linear amplitude output 
from to -30 dB. 
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Control Logic 

Figure 11 shows a simplified schematic of the analyzer. All 
critical components are shown. Bypass capacitors and some 
component input connections are omitted for clarity. 

The microprogram is stored in three 63S281 PROIVIs. The 
microcode word formats are shown in Figure 1 2. A wide, highly- 
parallel instruction word ensures maximum flexibility and pro- 
gram efficiency. 

Eight PAL devices interpret the instruction word and control the 
analyzer. Two additional PAL devices generate a 50-kHz strobe 
from the 8-MHz master clock, and implement the output D/A 
multiplexer The control PAL devices function as follows: 

Sequencer: A PAL20X8 implements an 8-bit instruction sequen- 
cer. The sequencer performs the following operations: 



CI 


CO 











1 


1 





1 


1 



ex Operation 

X Increment by 1 (execute next instruction) 
Increment by 2 (skip next instruction) 
Jump to address 
No increment (repeat current instruction) 



The OX input conditions the sequencer. Conditional 
branches or skip operations can be implemented. The 
sequencer will increment if the conditional requirement is 
not met. 

Condition detector: A PAL16C1 monitors up to twelve status 
flags, and generates OX. The microcode word includes a 4-bit 
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condition word, CXO through CX3. CX will be zero under the 
following conditions: 

CX3 CX2 CXI CXO Condition for CX = 















Always (unconditional operation) 













Sample strobe (SS) = 1 













AU overflow (OVF) = 1 












AU busy (AUB) = 1 





1 








Input sample ready (INR) = 





1 







Loop counter timeout (LLC) = 





1 







LLC = 1 





1 






Address control (AC3-AC0) = 











AGO = 











AGO = 1 












AG1 = 











AC1 = 1 




1 







AG2 = 




1 






AC2 = 1 




1 






AG3 = 




1 






AGS = 1 



The assignment is made using the flexible PAL device coding, 
and is optimized for the analyzer The user can select a different 
set of conditions by reprogramming the PAL device. 
When the microcode repre sents a constant (Type 1 microinstruc- 
tion — see Figure 12) the CON input forces CX = 1 to suppress 
conditional operations. CX is also used to suppress certain 
strobes in the analyzer, providing conditional arithmetic 
operations. 

Loop counter: A PAL16R6 implements a 6-bit programmable 
down counter. This counter controls iteration loops and provides 
a timeout signal to the condition detector. The counter is preset 
via aType 2 microinstruction, and can be decremented by other 
Type2 microinstructions. The counter will halt when zero count 
is reached. Up to 64 iterations can be accommodated with 
minimal overhead. 

Address control: A PAL22V10 provides indexed addressing for 
the 32x1 6 RAM, and analyzes the eight MSBs of the data bus for 
conditional operations. If D15 represents the data bus sign bit, 
then 0P1-0P3 will provide the following functions: 

0P3 0P2 0P1 AC3-AC0 Output Function 

Clear (0000) 

1 Increment 

1 Decrement 

1 1 Preset to D15-D12 (Sign + 3 MSB) 

1 Preset to D14-D11 (4 MSB) 

1 1 Preset to D11-D8 (Address load) 

I 1 MSB position 

II 1 No change 

The ADE input enables a chang e in the address word. The 
address word will not change if ADE = 1. 

The MSB position function indicates the position of the MSB for 
positive numbers. AG3 represents sign bit D15. This output 
should be zero. AC2-AG1 represent the position of the first 1 
following the sign bit. Code 0000 indicates that D15-D8 are all 0. 

Input/RAM control: Miscellaneous FIFO input and RAM control 
is provided by a PAL 10L8. The 67401 FIFO includes input ready 
(FIR) and output ready (FOR) signals, which are latched using 
the input/output shift clocks to generate two flags. The first flag 
(FR) resets the FIFO when input ready (latched) goes low, 
indicating the FIFO capacity is exhausted. The latched output 
ready signal flag represents input sample ready (INR). The INR 



flag is used as a sequencer condition to synchronize wait loops. 
Use of the FR and INR flags maintains proper fill of the FIFO. 

The RAM address LSB (AG) and read-write line (R/W) are 
decoded and latched. These signals are provided directly by 
Type IV microinstructions. 

Notice that a clocked register function requires two PAL 
combinatorial outputs per bit, while a transparent latch function 
requires only one PAL output per bit. 

Arithmetic unit control: The variety of functions listed in Table 3 
indicate the utility of the arithmetic unit (AU). A PAL16R6 pro- 
vides simplified control of the AU. 

The PAL devices and AU load signal provide conditional arith- 
metic operations. Gating the load i nput w ill suppress the start of 
a new arithmetic operation. When GXE is high, the operation is 
performed unconditionally. When GXE is low, the operation is 
performed only if GX is low. Combining conditional jumps and 
conditional AU operations provides a high degree of program 
flexibility. 

The PAL dev ice m onitors the AU instructions and generates a 
busy signal (AUB). A counter in the PAL device keeps track of 
variable-length operations to pr ovide the correct output for any 
instruction sequence. The AUB signal conditions the sequencer 
to synchronize the microprogram to the AU operation. Microp- 
rogramming is simplified as a result. 

The PAL device also gates the input Fl FO shift out clock (INS) to 
eliminate transients while providing a full 125-ns pulse for proper 
FIFO operation. 

Data strobe generator: A PAL10L8 provides a number of 
transient-free, gated strobes. These strobes provide control of the 
analyzer data flow. The PAL device interprets the microinstruc- 
tion to determine the proper microinstruction type, and gener- 
ates the strobes accordingly. 

The PAL device also generates an 8-MHz buffered clock, as 
shown in Figure 1 1 . The crystal oscillator circuit provides inde- 
pendent AC and DC feedback, permitting reliable operation with 
the PAL device. 

Strap/output sample control: A PAL16L8 generates additional 
control strobes for the output sample-and-hold circuits. 

The PAL device also provides a tristate buffer function, connect- 
ing control straps to the data bus for certain conditional jump 
operations. Two straps select the desired analyzer bandwidth/ 
sweep rate, and the third strap selects linear or logarithmic 
output. 
Signal output 

The VCO output must be sampled at precise intervals to avoid 
phase modulation effects. Three 67401 FIFOs buffer the VCO 
samples, which are generated during the 50-kHz input process- 
ing. A 12-bit D/A converter provides better than 91 dB signal-to- 
distortion ratio. The S&H circuit provides VCO outputs at pre- 
cise 50-kHz intervals, and removes spikes that are generated in 
the D/A converter All necessary control signals are generated 
by the strap/output data control PAL device. 

The horizontal and vertical outputs normally drive an X-Y plotter or 
oscilloscope. There is no need to buffer these signals as long as 
the two outputs track each other. The D/A used for the VCO 
output is shared by adding two PAL12H6 chips programmed as 
multiplexers. Use of PAL devices requires fewer packages than a 
TTL multiplexer. Additional S&H circuits decode the multiplexed 
D/A output to separate the output signals. 
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Microprogram 

The architecture can implement a variety of DSP functions. A 
microprogram, stored in 63S281 PROIVIs, customizes the archi- 
tecture to perform the spectrum analyzer tasl<s. The microin- 
struction formats were summarized in Figure 12. The algorithms 
to be implemented were discussed in the previous section. The 
step-by-step implementation of these algorithms is converted to 
a sequence of microinstructions to form the microprogram. The 
procedure is analogous to programming a microprocessor 

Operation of the microprogram is better understood by consid- 
ering the allocation of the 74S218 RAM locations, shown in 
Figure 13. The microprogram consists of two parts. High-speed 
input processing provides the carrier generation, mixing, alias- 
ing filterand lowpassfilterfunctions. Figure 13a shows the RAIVI 
allocation during input processing. The input segment includes 
an efficient iteration loop, using the PAL device loop counter, to 
process the 50-kHz functions. The carrier frequency shift and 
lowpass filter functions are processed at the fs/32 reduced sam- 
ple rate for maximum throughput efficiency. 

The values of sin A, cos A, and the Bessell filter coefficients 
depend on the analyzer bandwidth strap selection. These values 
are stored in a "table" area in Fig. 13, andean be easily changed. 
The fixed aliasing filter coefficients are stored as constants in the 
microprogram itself. 



Once the input processing is complete, coefficients located in 
the table area can be changed. This area is re-used by the output 
program segment to hold the scale factors for the linear-to-log 
conversion routine. The detector functions are processed, and 
the logarithmic conversion is started with the RAIVI allocation of 
Figure 13(b). The table area is then reloaded with the interpola- 
tion coefficients (Figure 13(c)) to complete the logarithmic con- 
version. Shaded areas in Figure 13 provide temporary data and 
flag storage for the routines. 

The microprogram samples the strap settings and loads the table 
area with the appropriate coefficients for input processing. The 
detector filter coefficient (bi ) is also determined and loaded. 
The input processing is then repeated. This sequence repeats 
indefinitely. The coefficient loading technique makes efficient 
use of RAM capacity while eliminating elaborate jump 
sequences. All coefficient table updates are processed at the 
minimum sample rate for best efficiency. 

The PAL device controllers simplify the microprogram. A PAL 
device provides hardware iteration loops. The AU controller 
eliminates wasteful "NO-OP" instructions otherwise required to 
allow completion of AU operations. The input control PAL 
device simplifies handshaking with the input logic. With the 
benefit of the PAL device controllers, the analyzer microprogram 
easily fits into the 256-instruction capacity of the PROMs. 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



AN-100 DSP Counter 

DSPCOUNT 

1 

Marc Baker 

Monolithic Memories 

August 20, 1987 



CHIP DSPCOUNT PAL20X8 

CLK NC 0P3 0P4 OPS 0P6 0P7 CO CI /CX NC GND 
/OE NC A7 A6 A5 A4 A3 A2 Al AO /LOAD VCC 



EQUATIONS 



LOAD = 
/AO 



/Al 



/A6 



/A2 : = 

+ 
: + : 

+ 

/AS : = 

+ 
: + ; 

+ 

/A4 : = 

+ 
: + : 

+ 

/A5 : = 

+ 
: + ; 

+ 



+ 
: + ; 



/A7 : = 



/CO* CI* CX 

/LOAD*/A0 
/CO* CI* CX 
/C0*/C1 

/L0AD*/A1 
/CO* CI* CX 
/C0*/C1* AO 
C0*/C1* CX*/AO 

/L0AD*/A2 
/CO* CI* CX 
/C0*/C1* Al* AO 
C0*/C1* CX* A1*/A0 

/L0AD*/A3 
/CO* CI* CX*/0P3 
/C0*/C1* A2* Al* AO 
C0*/C1* CX* A2* A1*/A0 

/L0AD*/A4 
/CO* CI* CX*/0P4 
/C0*/C1* A3* A2* Al* AO 
C0*/C1* CX* A3* A2* A1*/A0 



;FED BACK TO EQUATIONS 

HOLD/ INCREMENT BY 2 
LOAD 
INCREMENT BY 1 

HOLD 
LOAD 

INCREMENT BY 1 
INCREMENT BY 2 

HOLD 
LOAD 

INCREMENT BY 1 
INCREMENT BY 2 

HOLD 
LOAD 

INCREMENT BY 1 
INCREMENT BY 2 

HOLD 
LOAD 

INCREMENT BY 1 
INCREMENT BY 2 



/L0AD*/A5 ;HOLD 

/CO* CI* CX*/0P5 ;LOAD 

/C0*/C1* A4* A3* A2* Al* AO ; INCREMENT BY 1 

C0*/C1* CX* A4* A3* A2* A1*/A0 ; INCREMENT BY 2 



/L0AD*/A6 
/CO* CI* CX*/0P6 

/C0*/C1* A5* A4* A3* A2* Al* AO 
C0*/C1* CX* A5* A4* A3* A2* A1*/A0 



HOLD 
LOAD 
INC BY 1 
INC BY 2 



/L0AD*/A7 ;HOLD 

/CO* CI* CX*/0P7 ;LOAD 

/C0*/C1* A6* A5* A4* A3* A2* Al* AO ; INC BY 1 

C0*/C1* CX* A6* A5* A4* A3* A2* A1*/A0 ; INC BY 2 



2-318 



^ 



MonolUhiG 



m 



Memories 



^ 



Audio Spectrum Analyzer 



SIMULATION 

TRACE ON CLK A7 A6 A5 A4 A3 A2 Al AO 



; Trace CLK and outputs 



SETF OE /CO CI CX /0P7 /0P6 /0P5 /0P4 /0P3 

CLOCKF CLK ;Load all Os 

CHECK /A7 /A6 /A5 /A4 /A3 /A2 /Al /AO 

SETF CO /CI 



CLOCKF CLK 
CHECK /A7 /A6 


/A5 


/A4 


/A3 


/A2 


Al 


/AO 


; Increment 
;to 2 


by 


2 


CLOCKF CLK 
CHECK /A7 /A6 


/A5 


/A4 


/A3 


A2 


/Al 


/AO 


; Increment 
;to 4 


by 


2 


CLOCKF CLK 
CHECK /A7 /A6 


/A5 


/A 4 


/A3 


A2 


Al 


/AO 


; Increment 
;to 6 


by 


2 


SETF /CO 
CLOCKF CLK 
CHECK /A7 /A6 


/A5 


/A 4 


/A3 


A2 


Al 


AO 


; Increment 
;to 7 


by 


1 


CLOCKF CLK 
CHECK /A7 /A6 


/A5 


/A4 


A3 


/A2 


/Al 


/AO 


; Increment 
;to 8 


by 


1 


SETF CO CI 
CLOCKF CLK 
CLOCKF CLK 
CHECK /A7 /A6 


/A5 


/A4 


A3 


/A2 


/Al 


/AO 


;Hold 
;Hold 
;to 8 







TRACE OFF 
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TITLE AN-100 DSP Condition 

PATTERN DSPCOND 

REVISION 1 

AUTHOR Marc Baker 

COMPANY Monolithic Memories 

DATE August 20, 1987 

CHIP DSPCOND PAL16C1 

CX3 CX2 CXI CXO SS OVF AUB INR /LLC GND 
/CON ACO ACl AC 2 /CX COMP_CX AC 3 NC NC VCC 

EQUATIONS 

CX = /CX3*/CX2*/CX1*/CX0* CON 
+ /CX3*/CX2*/CX1* CXO* SS* CON 
+ /CX3*/CX2* CX1*/CX0* OVF* CON 
+ /CX3*/CX2* CXI* CXO* AUB* CON 
+ /CX3* CX2*/CX1*/CX0*/INR* CON 
+ /CX3* CX2*/CX1* CXO* LLC* CON 
+ /CX3* CX2* CX1*/CX0*/LLC* CON 

+ /CX3* CX2* CXI* CX0*/AC3*/AC2*/AC1*/AC0* CON 
+ CX3*/CX2*/CX1*/CX0*/AC0* CON 
+ CX3*/CX2*/CX1* CXO* ACO* CON 
+ CX3*/CX2* CX1*/CX0*/AC1* CON 
+ CX3*/CX2* CXI* CXO* ACl* CON 
+ CX3* CX2*/CX1*/CX0*/AC2* CON 
+ CX3* CX2*/CX1* CXO* AC2* CON 
+ CX3* CX2* CX1*/CX0*/AC3* CON 
+ CX3* CX2* CXI* CXO* AC3* CON 

SIMULATION 

SETF /CON ;SET /CX HIGH 

CHECK /CX 

SETF CON /CX3 /CX2 /CXI /CXO ;CX3-CX0=0 - SET /CX LOW 
CHECK CX 

SETF CXO SS ;CX3-CX0=1 - SET /CX LOW 

CHECK CX 

SETF CXI /CXO OVF ;CX3-CX0=2 - SET /CX LOW 

CHECK CX 

SETF CXO AUB ;CX3-CX0=3 - SET /CX LOW 

CHECK CX 

SETF CX2 /CXI /CXO /INR ;CX3-CX0=4 - SET /CX LOW 
CHECK CX 

SETF CXO LLC ;CX3-CX0=5 - SET /CX LOW 

CHECK CX 
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SETF CXI /CXO /LLC 
CHECK CX 



;CX3-CX0=6 - SET /CX LOW 



SETF CXO /AC3 /AC2 /ACl /ACO 
CHECK CX 



;CX3-CX0=7 - SET /CX LOW 



SETF CX3 /CX2 /CXI /CXO 
CHECK CX 



;CX3-CX0=8 - SET /CX LOW 



SETF CXO ACO 
CHECK CX 



;CX3-CX0=9 - SET /CX LOW 



SETF CXI /CXO 
CHECK CX 



;CX3-CX0=10 - SET /CX LOW 



SETF CXO ACl 
CHECK CX 



;CX3-CX0=11 - SET /CX LOW 



SETF CX2 /CXI /CXO 
CHECK CX 

SETF CXO AC2 
CHECK CX 



;CX3-CX0=12 - SET /CX LOW 



;CX3-CX0=13 - SET /CX LOW 



SETF CXI /CXO 
CHECK CX 



;CX3-CX0=14 - SET /CX LOW 



SETF CXO AC3 
CHECK CX 



;CX3-CX0=15 - SET /CX LOW 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



AN-100 DSP Address Control 

ADDCONT 

1 

Marc Baker 

Monolithic Memories 

August 21, 1987 



CHIP ADDCONT PAL22V10 

CLK 0P3 0P2 OPl /ADE D15 D14 D13 D12 NC NC GND 
NC NC Dll DIO ACO ACl AC2 AC3 D9 D8 NC VCC 
GLOBAL 



EQUATIONS 



ACO := /OP3*/OP2* OP1*/ACO* ADE 



/OP3* OP2*/OP1*/ACO* ADE 

/0P3* 0P2* OPl* D12* ADE 
OP3*/OP2*/OPl* Dll* ADE 
OP3*/OP2* OPl* D8 * ADE 
0P3* 0P2*/0P1* D14* ADE 
0P3* OP2*/OPl*/D14*/D13* D12* ADE 
0P3* 0P2*/0P1*/D14*/D13*/D12*/D11* DIO* ADE 
0P3* OP2*/OP1*/D14*/D13*/D12*/D11*/D10*/D9*D8* 
0P3* 0P2* OPl* ACO ;HOLD 

ACO* /ADE ;HOLD 



;INC 

;DEC 

;D12 

;D11 

;D8 

;MSB EQUATIONS 



ADE 



ACl := /OP3*/OP2* OPl* AC1*/AC0* ADE 
+ /OP3*/OP2* 0P1*/AC1* ACO* ADE 

/0P3* 0P2*/0P1* ACl* ACO* ADE 

/0P3* OP2*/OP1*/AC1*/ACO* ADE 

/0P3* 0P2* OPl* D13* ADE 
OP3*/OP2*/OPl* D12* ADE 
OP3*/OP2* OPl* D9 * ADE 
0P3* 0P2*/0P1* D14* ADE 
0P3* OP2*/OPl*/D14* D13* ADE 

0P3* 0P2*/0P1*/D14*/D13*/D12*/D11* DIO* ADE 
0P3* OP2*/OP1*/D14*/D13*/D12*/D11*/D10* D9* ADE 
0P3* 0P2* OPl* ACl ;HOLD 

AC1*/ADE ;HOLD 



;INC 

;INC 

;DEC 

;DEC 

;D13 

;D12 

;D9 

;MSB EQUATIONS 



2-322 



^ 



MonolHMo 



m 



Memories 



Audio Spectrum Analyzer 



AC2 : = 


/OP3*/OP2* 0P1*/AC2* ACl* 


ACO* 


ADE 




INC 


+ 


/OP3*/OP2* OPl* AC2*/AC1* 




ADE 




INC 


+ 


/OP3*/OP2* OPl* AC2* 


/ACO* 


ADE 




INC 


+ 


/0P3* 0P2*/0P1*/AC2*/AC1* 


/ACO* 


ADE 




DEC 


+ 


/0P3* 0P2*/0P1* AC2* ACl* 




ADE 




DEC 


+ 


/0P3* 0P2*/0P1* AC2* 


ACO* 


ADE 




DEC 


+ 


/0P3* 0P2* OPl*. D14* ADE 








D14 


+ 


OP3*/OP2*/OPl* D13* ADE 








D13 


+ 


OP3*/OP2* OPl* DIO* ADE 








DIO 


+ 


0P3* 0P2*/0P1* D14* ADE 








MSB EQUATIONS 


+ 


0P3* 0P2*/0P1* 013* ADE 










+ 


0P3* 0P2*/0P1* D12* ADE 










+ 


0P3* 0P2*/0P1* Dll* ADE 










+ 


0P3* 0P2* OPl* AC2 








HOLD 


+ 


AC 2* /ADE 








HOLD 


AC3 : = 


/OP3*/OP2* 0P1*/AC3* AC2* 


ACl* 


ACO* 


ADE 


INC 


+ 


/OP3*/OP2* OPl* AC3*/AC2* 






ADE 


INC 


+ 


/OP3*/OP2* OPl* AC3* 


/ACl* 




ADE 


INC 


+ 


/OP3*/OP2* OPl* AC3* 




/ACO* 


ADE 


INC 


+ 


/0P3* OP2*/OPl*/AC3*/AC2* 


/ACl* 


/ACO* 


ADE 


DEC 


+ 


/0P3* OP2*/OPl*/AC3* AC2* 






ADE 


DEC 


+ 


/0P3* OP2*/OPl*/AC3* 


ACl* 




ADE 


DEC 


+ 


/0P3* OP2*/OPl*/AC3* 




ACO* 


ADE 


•DEC 


+ 


/0P3* 0P2* OPl* D15* ADE 








D15 


+ 


OP3*/OP2*/OPl* D14* ADE 








•D14 


+ 


OP3*/OP2* OPl* Dll* ADE 








•Dll 


+ 


0P3* 0P2* OPl* AC3 








•HOLD 


+ 


AC 3* /ADE 








•HOLD 


SIMULATION 











TRACE ON CLK AC3 AC2 ACl ACO 



SETF ADE /0P3 /0P2 /OPl 

CLOCKF CLK 

CHECK /AC3 /AC2 /ACl /ACO 



; CLEAR TO 



SETF OPl 

CLOCKF CLK 

CHECK /AC 3 /AC 2 /ACl ACO 

CLOCKF CLK 

CHECK /AC3 /AC2 ACl /ACO 



; INCREMENT TO 1 



; INCREMENT TO 2 



CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CHECK AC3 /AC2 /ACl /ACO 



; INCREMENT TO 8 
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SETF 0P2 /OPl 

CLOCKF CLK 

CHECK /AC 3 AC 2 ACl ACO 

CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CHECK /AC 3 /AC 2 ACl ACO 

SETF OPl /D15 D14 /D13 D12 /Dll 

CLOCKF CLK 

CHECK /AC3 AC2 /ACl ACO 

SETF 0P3 /0P2 /OPl 

CLOCKF CLK 

CHECK AC3 /AC2 ACl /ACO 

SETF OPl DIO /D9 D8 

CLOCKF CLK 

CHECK /AC 3 AC 2 /ACl ACO 

SETF OP 2 /OPl 

CLOCKF CLK 

CHECK /AC3 AC2 ACl ACO 

SETF /D14 

CLOCKF CLK 

CHECK /AC3 AC2 /ACl ACO 

SETF OPl 

CLOCKF CLK 

CHECK /AC 3 AC 2 /ACl ACO 

SETF /0P3 /0P2 /OPl /ADE 

CLOCKF CLK 

CHECK /AC3 AC2 /ACl ACO 

TRACE OFF 



DECREMENT TO 7 



DECREMENT TO 3 



SET TO D15-D12 



SET TO D14-D11 



;SET TO D11-D8 



CHECK MSB 



CHECK MSB 



HOLD 



;HOLD 
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Bus interface plays an important role in computer architecture 
design. As systems become more complicated, the bus require- 
ment becomes more critical in terms of speed, power, accessibil- 
ity and feasibility. There are many different kinds of buses in the 
electronics industry. Each of the buses is specially designed to 
fit into a special architecture for various purposes. There are 
many stringent rules and regulations to follow depending on the 
application to design a bus architecture or bus interface. The 
application depends on timing (synchronous or asynchronous), 
type (military or commercial), architecture (open or closed), data 
structure (serial or parallel), computer type (micro, mini or others) 
or a mix of the above. 

There have been nearly one hundred buses since the invention 
of the computer. A great deal of them have faded away because 
the application no longer exists, or have been replaced by new 
technology. 

The following are some current major buses: 

DEC Unibus 
Future Bus 
IEEE 488 Bus 
Multibus I! 
Nu Bus 
PC Bus 
VME Bus 
QBus 

Disregarding the complexity and different varieties of buses, 
there are four basic sections (Figure 1) in the bus structure: 

• Address Section 

• Data Section 

• Control Section 

• Utility Section 



BUS INTERFACE 

— rm — 



-*- ADDRESS SECTION 
-♦■ DATA SECTION 
-*■ CONTROL SECTION 



-*■ UTILITY SECTION 



Figure 1. Four Sections of the Bus interface 

Address Section 

The Address Section consists of the address lines and control 
portion. The address lines are used by the processor to indicate 
to memory and other peripherals the location with which it wants 
to communicate. The address width varies from bus to bus. For 



example, VME, Nu and Multibus II are 32 bits wide. DEC Unibus, 
PC Bus and Q Bus are 1 8, 20 and 22 bits wide, respectively. The 
width of the address line has gradually expanded with each new 
bus announcement. As the width expands, the address decoding 
becomes more complex. One solution to this complex address 
decoding problem is to use PAL devices such as the PAL1 6L8/ 
20L8/20L10. The advantage of using these PAL devices is to 
utilize their wide input range (up to 1 6/20) to address up to 64/1 M 
addresses by a single PAL device. 

Data Section 

The Data Section consists of the data lines and control portion. 
The data lines carry instruction and data between the processor 
and the peripherals, including the memory. The control portion 
provides support to the data lines. In most cases, data can be 
addressed as byte, word or double-word. For instance, PC Bus, 
DEC Unibus, and Nu Bus can be addressed as 8, 1 6 and 32-bits 
wide, respectively. VME and Multibus li can both be addressed 
as8, 16, 24 or 32-bits wide. Theexternalbusdatacommunication 
requires handshaking. As long as the protocol is well defined, it 
can be implemented easily by the PAL16R8/20R8/20X10. If a 
change is needed after the circuit is designed, we can simply 
modify the PAL device file, in other words, by using PAL devices 
for data communication, flexibility can be achieved with minimum 
hardware change. 

One of the major problems when dealing with data transfer is 
synchronization. Synchronization is when events occur concur- 
rently with a regular or predictable timing relationship, typically 
based on a system clock. The data transfer control functions for 
this type of synchronous bus can be implemented by PALdevices 
such as the 20R8. in asynchronous buses, the data lines com- 
municate through handshaking instead of the data clock. This 
can be tedious when using discrete logic. The asynchronous 
devices PAL1 6RA8/20RA1 can be very useful for implementing 
such designs. 

Another problem is when both ends of the data lines have different 
data rates, inthiscase we need FIFO memories. FIFOmemories 
match the data rates and serve as temporary buffers between 
source and destination. 

Control Section 

The Control Section coordinates the operation of all bus interface 
circuitry. This is the heart of the bus interface design. The major 
architecture differences can be observed here. Generally, any 
device that is capable of controlling bus operation is called a "bus 
master". Devices that operate on the bus but cannot control it, i.e. 
the memory system, i/O ports, etc. are called "slaves". Most of 
the buses are designed to allow more than one bus master or a 
variety of intelligent devices, i.e. a second CPU, disk controller or 
network controller, on the bus. 
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Functionally, the Control Section can be split into three 
subsections: 

• Interrupt Subsection 

• DMA Subsection 

• Arbiter Subsection 

Interrupt Subsection 

Interrupts are important to all bus systems. These signals are 
designed to allow infrequent or important peripherals to get the 
attention of the bus master. A maskable interrupt can be ignored 
by the bus master, while a non-maskable interrupt can not. 
Vectored interrupt inputs provide priority to the interrupt controller 
in case more than one interrupt request should occur simultane- 
ously. The number and type of interrupt inputs vary widely among 
buses. 

To determine which device caused the interrupt in order to 
execute the correct interrupt-handling routine, two methods can 
be used: 

1 . Software Method — In this method, a polling program checks, 
one by one, each of the devices connected to the interrupt line, 
to determine which one is interrupting. When an interrupting 
device is found, the routine then branches to the appropriate 
interrupt handler. This scheme is considerably slow due to the 
software overhead involved. 

2. Hardware Method — This method is more efficient than the 
software scheme. The interrupted peripheral will provide an 
interrupt vector which forms the branching address to the 
interrupt-handling routine at the time it occurs. This is called 
automated interrupt-vectoring (Figure 2). After the Interrupt 
Request (IR) and the vector are issued by a peripheral, the 
Interrupt Handler arbitrates, grants and serves that interrupt. 
The VN/1E and DEC Unibus use this interrupt technique. The 
required branching address is loaded in the address register. 
Ordinary registered PALdevices such as the 20R8/20X10 to- 
gether with the 20L8/20L1 are suitable for this technique. A 
design example of the interrupt controller for the DEC Unibus 
is shown on page 2-328. This example demonstrates a near 
4 to 1 chip count saving by using a PAL device design. Obvi- 
ously, the real estate and debugging time shrink and the 
reliability increases. 



In the eventthat several interrupts occur simultaneously, apriority 
scheme is used to decide which I/O device should be granted 
control of the bus. The most popular one is the Fixed Priority 
Scheme. This assigns a fixed priority number to each device. 
This number will be there until the next configuration change. The 
other scheme is more complex and is rarely used, called the 
Programmable Priority Scheme. In this scheme, the priority can 
be changed dynamically by the software program. This scheme 
may be useful in applications such as strategic military or dynamic 
financial marketing where the strategy changes with various 
battles or investment environments; but, the software design 
may be very complex. However, both schemes can be accom- 
plished by using sequencer devices such as the PMS14R21. 
This powerful device can serve as a local distributed processor 
within the Bus Interface Unit. 

Direct IViemory Access (DIVIA) Subsection 

The Direct Memory Access concept exists because the ordinary 
memory transferring speed often isn't fast enough. The transfer 
speed of a microprocessor is limited by the software overhead 
involved in transferring successive words: several instructions 
must be executed in order to transfer a word or a chain of words. 
In such cases where the ordinary memory transferring speeds 
may not be sufficient, as in a floppy disk or a frame buffer, DMA 
is required. The DMA Controller implements the transfer algo- 
rithm in hardware and automates the word or block transfers 
directly between the memory and the I/O device. 

Two techniques are used: 

1 . The processor is halted or suspended by the DMA Controller 
(DMAC). The DMA dominates the bus until the end of the 
transfer. 

2. The DMAC steals a memory cycle from the processor and 
inserts a data transfer between the memory and the I/O 
device. This is called cycle-stealing DMA. 

These methods can also be used in a combined fashion. 



INTERRUPT 
HANDLER 



BUS 
REQUESTER 



INTERRUPTER 1 

r 



INTERRUPTER 2 

T- 



-*- ARBITRATION BUS 



Figure 2. Bus Interrupt Example 
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DMAC design can involve complex handshake sequences be- 
tween the processor and an I/O device. It contributes significantly 
to the system's cost and complexity and thus it is not normally 
used in small systems. However, with the availability of PAL 
devices, DMA applications are becoming more popular. The 
handshake sequence and complex discrete logic design can be 
replaced by asynchronous PAL devices such as the PAL16RA8/ 
20RA10 and programmable sequencers such as the 
PMS14R21/A. This can significantly decrease the cost, real 
estate and debugging time. 

Arbiter Subsection 

Bus Arbiter designs are becoming more critical and complex as 
microprocessor costs decrease. This is because it is becoming 
more cost-effective to design systems with multiple processors 
sharing global resources. In the bus interface, the arbiter acts as 
a judge to decide who should own the proprietorship of the bus 
and for how long. The arbiter decides which Requester should be 
granted control of the Bus when several requests happen simul- 
taneously. 

The function of the arbiter consists of: 

1 . Granting the use of the bus to one of the bus masters. 

2. Scheduling requests from multiple bus masters for optimum 
bus use. 

3. Preventing simultaneous use of the bus by two bus masters. 

In other words, a successful arbiter design can prevent bus 
hangup and increase bus bandwidth. 

Four major types of arbitration are: 

1. Prioritized Arbiter— This priority scheme provide a pre-as- 
signed number to each bus master to arbitrate the execution 
order. 

2. Round-robin Arbiter — ^This scheme assigns the bus master 
on a rotating priority basis. 

3. Single-Level Arbiter — This scheme provides only single level 
arbitration. However, it relies on a bus grant daisy-chain to 
arbitrate the request. 

4. Fairness Arbiter — This scheme prohibits the winning module 
from being serviced again until all pending requests have 
been serviced. 

An example of using the PAL22V10 to perform the Prioritized 
Arbiter is described on the application note on page 2-347. The 
arbitration process and PAL device equations are also provided. 



An example of an arbiter used for resolving a bus contention 
between two bus masters fighting for control is shown in 
Figure 3. 

When both master 1 (M,) and master 2 (M^) issue a Bus Request 
(BR), the Arbiter will grant the Bus Grant (BG) to the one with 
higher priority. Assuming it is Mj, M^will then drive Bus Busy (B 
Busy) active and release the BR signal. After Mj finishes its op- 
eration, the Arbiter detects that the BR signal is still low and 
transfers the bus control to M,. 
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REQUESTER M2 
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Figure 3. Arbitration Example: Bus Contention 

Utility Section 

The Utility section provides support to each of the previous 
sections. It links the entire bus interface between modules. The 
Utility Section consists of the System Clock Distribution, Power 
Monitor and System Diagnostics. The Power Monitor detects 
power failures and signals the system to start an orderly shut- 
down. Whenpoweristhenreappliedtothesystem.itensuresthat 
each module is initialized accordingly. For example, the power 
failure and powerupsequencesforthe VME bus are 4 ms and 200 
ms, respectively. This can help the processor to react to these 
external interrupts according to the system specification. A 
system without diagnostics is an incomplete system. System 
Diagnostics can be implemented by a Programmable Sequencer 
such as the PMS14R21/A (page 3-123). 

Future System Expansion 

With the rapid rate of improvement of technology, the useful life 
of a new system is increasingly getting shorter. Therefore, the 
future system expansion consideration can be a major design 
feature to system designers as well as a selection guideline to a 
buyer. The key to system expansion in a general computer 
system is the bus through which the CPU communicates with the 
present and future system components. Design flexibility, high 
throughput, multiprocessor support, and low power consumption 
soon become critical design considerations for bus interface 
techniques. PLDs can be the solution for bus interface, not only 
because of their flexibility in terms of possible hardware modifica- 
tion, but also because of their real estate saving, which in turn 
lowers the manufacturing cost and power consumption. 
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Functional Description 

One of the more established computer families is the Digital 
Equipment Corporation's PDP-1 1 series. This family of compu- 
ters uses the DEC unibus to communicate between cards. A 
specific protocol is required to interface a card to the unibus. 
This protocol is described in the available DEC literature. 

Since the unibus is an asynchronous bus, much of the interface 
circuitry consists of combinational logic to generate specific 
signals and flip-flops which are set and reset as flags. This tends 
to use a lot of SSI and MSI logic packages. Using Monolithic 
Memories' PAL devices, much of this logic can be condensed 
into a few packages. Figure 2 is the schematic diagram for an 
interrupt controller to be used on the unibus. 

Many cards communicate over the bus by taking control of the 
unibus with an interrupt request, and then do whatever they 
require before releasing control. As can be seen, this interrupt 
controller takes six special interface ICs, (380 and 8881 bus 



drivers and receivers) eight MSI, SSI ICs, (7400, 7402 and 7474s) 
along with some transistors and discrete parts. This parts count 
can be considerably reduced by using a PAL20RA10 and a 
PAL20L10. 

Figure 1 shows how the circuit with the PAL devices would look. 
The two PAL devices allow almost all of the 7400, 7402 and 7474 
packages to be removed. (Almost a four-to-one saving in chip 
count.) In addition, the preload pin (PRLD) on the 20RA10 
allows the flip-flops to be easily set to a known state on power 
up, or when reinitializing. So the PAL devices reduce the logic 
package count from eight chips to three. 

In the schematic shown, there are three VLSI devices, three 
MSIs and two SSIs. Using a PAL20RA10, it is possible to replace 
three MSIs and one SSI device, thereby reducing the chip count 
by a factor of two. The ICs inside the enclosed loop were 
replaced. 
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INTRAH- 

INTRAHEN- 

MCLEARAH- 

INTRBH- 

INTRBHEN- 

MCCLEARBH- 




BUSSSYNL- 
BUSBBSYL 







V 



START INTRAL- 



^ 



PAL 
20L10 



VCC 

NBGINBH 



START INTRBL- 



8881 
8881 
8881 



.^D- 



^O 



8881 



INTRDONEAH 
■INTRDONE BH 
ENINTRB 



PRELOAD 

AINTR 



NC 

FF3RESET 



P 



PAL 
20RA10 
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Figure 1. 
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TITLE INTERRUPT LOGIC 

PATTERN INTRPOl 

REVISION 01 

AUTHOR DAN KINSELLA 

COMPANY MONOLITHIC MEMORIES, INC. 

DATE 11/06/87 

CHIP INT_LOGIC PAL20L10 

INTRAH INTRAHEN MCLEARAH INTRBH INTRBHEN MCLEARBH BGINBH 
BGINAH BUS^SYNL BUSBBSYL STARTINTRAL GND 

STARTINTRBL FF3RESET ENINTRB INTRDONEBH INTRDONEAH EN8881 
SSYN NBGINBH FFIRESET ENINTRA NBGINAH VCC 



EQUATIONS 

/NBGINAH = BGINAH 

/FFIRESET = MCLEARAH+ENINTRA 

/ENINTRA = INTRAHEN* INTRAH 

/NBGINBH = BGINBH 

/SSYN = BUSSSYNL*BUSBBSYL 

/EN8881 = STARTINTRAL*STARTINTRBL 

/INTRDONEAH = BUSSSYNL+STARTINTRAL 

/ENINTRB = INTRBH* INTRBHEN 

/FF3RESET = MCLEARBH+ENINTRB 

/INTRDONEBH = BUSSSYNL+STARTINTRBL 



FFl CLK CONTROL 

BLOCK A 
SET FFl CONTROL 

BLOCK A 
ENABLE INTERRUPT A 

FF3 CLOCK CONTROL 

BLOCK B 
SYNCHRONIZE FF2 & 

FF4 
INTERRUPT BUS 

SIGNAL INTERRUPT 

DONE 
ENABLE INTERRUPT B 

SET FF3 CONTROL 

BLOCK B 
SIGNAL INTERRUPT 

DONE 



; DESCRIPTION 

; COMBINATORIAL LOGIC FOR PAL20RA10 INTERRUPT CONTROLLER 
;(1ST PART OF THE TWO PAL SOLUTION: PAL20L10 & PAL20RA10) 

; SIMULATION NOT INCLUDED 
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INSIDE PAL20L10 



INSIDE PAL20RA10 
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Figure 2. 
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Simulation Results 



DEC PDP-11 unibus interrupt controller 
Control. pds 



Santa Clara, CA 



Title 

Pattern 

Revision 

Author Dan Kinsella 

Company Monolithic Memories Inc 

Date 3/1/85 

CHIP INTR_CONTROL PAL2 0RA10 

PL AINTR NC ABGIN FFIRESET SSYN BINTR NC FF3RESET BEGIN 

NC GND 

NC 0UT4 0UT3 0UT2 OUTl FF3 NFF4 FF4 NFF2 FF2 FFl VCC 



EQUATIONS 










/FFl 


: = 


/FF1*FF2 




.•Master control 


FFl.SETF 


= 


/FFIRESET 




; block A 


FFl.CLKF 


= 


/ABGIN 






FF2 


: = 


FFl 




;Bus Busy Signal 


FF2.SETF 


= 


/AINTR 






FF2.CLKF 


= 


ABGIN*FF2* 


/SSYN 




/NFF2. 


! = 


FFl 




;Bus sack signal 


NFF2.SETF 


= 


/AINTR 






NFF2 . CLKF 


= 


ABGIN*NFF2 


*/SSVN 




/FF3 


: = 


/FF3*FF4 




.•Master control 


FF3.SETF 


= 


/FF3RESET 




.•block E 


FF3.CLKF 


= 


/BEGIN 






FF4 


.= 


FF4 




.•Bus busy signal 


FF4.EETF 


= 


/BINTR 






FF4.CLKF 


= 


BBGIN*FF4* 


'/SSYN 




/NFF4 


! = 


FF3 




.•Bus sack signal 


NFF4.SETF 


= 


/BINTR 






NFF4 . CLKF 


= 


BBGIN*NFF4 


,*/SSYN 




/OUTl 


= 


FF1+FF2 




;Bus request signal 
.•block A 


/OUT 2 


= 


FF4+FF3 




;Bus request signal 
.•block B 


/OUT 3 


= 


AINTR 




.•Intr. signal for 
.-bus req. block A 


/0UT4 


~ 


BINTR 




;Intr. signal for 
;bus req. block B 


SIMULATION 













gg g 


FFIRESET 


LHHH 


FF3RESET 


LHHH 


AINTR 


HLLL 


BINTR 


HLLL 


SSYN 


XXXL 


ABGIN 


XHHH 


BEGIN 


XHHH 


FFl 


LLLL 


FF3 


LLLL 


NFF2 


XLLL 


NFF4 


XLLL 


OUTl 


XHHH 


OUT 2 


XHHH 


OUT 3 


LHHH 


0UT4 


LHHH 



TRACE_0N FFIRESET FF3RESET AINTR BINTR SSYN ABGIN BEGIN 
FFl FF3 NFF2 NFF4 OUTl 0UT2 0UT3 0UT4 



SETF /FFIRESET /FF3RESET AINTR BINTR 



SETF FFIRESET FF3RESET /AINTR /BINTR 
ABGIN BEGIN 



.•Reset all regs 

.•Clock FFl and FF3 
; regs 

.•Clock NFF and NFF3 
.•regs 
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This application note describes ttie implementa- 
tion of a bus arbiter using a PAL16R4 to interface 
the 1 MHz 801 86 to the MULTIBUS multi-master 
environment. The PAL equations for bus ex- 
change were developed based on functional and 
timing requirements specified in the Intel 
MULTIBUS Specification. The interface shown 
addresses the bus master case in regards to bus 
exchange only. No attempt has been made to 
include the requirements for bus slave operation, 
interrupt handling or byte operations. This note 
assumes a knowledge of MULTIBUS operations 
and signal names. 

Overview 

The system environment is assumed to be a 
collection of loosely coupled independent 
processors which communicate through a com- 
mon memory on the system bus. Each processor 
in this system possesses its own local resources. 

A block diagram showing the interface of one such 
processor, an 80186, to the system bus, 
MULTIBUS, is shown in Figure 1 . A block of the 
80186's address space is allocated to system 
memory. Access into this portion of the address 
map generates a request for the system bus via a 
programmable memory chip-select pin. Once a 
request is generated by the CPU, the PAL16R4 
arbiter and the 8288 bus controller combine to 
perform a bus exchange and transfer cycle. 

In this design the arbiter maintains control of the 
system bus and therefore must be forced off either 
by a higher priority mas ter (BP RN goes HIGH) or a 
Common Bus Request (CBRQ). 

Functional Requirements 

On the MULTIBUS, the bus exchan ge process 
begins when the Bus Request (BREQ) goes LOW 
(|)arallel priority resolution scheme), or when 
BPRO goes HIGH to succeeding masters (serial 



priority resolution scheme)^ If the requesting 

master is granted priority (BPRN enabled) a nd the 
bus is not under control of another master (BUSY 
disabled), the n the master may take control of the 
bus by forcing BUSY LOW. 

An optional request mechanism. Common Bus 
Request, used in conjunction with either parallel or 
serial priority has been implemented in this design. 

It reduces bus acquisition overhead by allowing 
bus retention acro ss tra nsfer cycles, unless a 
request is pending (CBRQ is LOW). For example, 
in pa rallel p riority_UTe requesting master asserts 
both BREQ and CBRQ to force the curre nt m aster 
off the bus. Once acquisition is complete CBRQ is 
released for use by another master. 

Timing Considerations 

Bus exc hange on MULTIBUS is a synchr onous 
process. BREQ, BPRO, BUSY and CBRQ are all 
synchronized with the trailing edge (HIGH-to- 
LOW transition) of BCLK. BCLK is a MULTIBUS 
signal having a duty cycle of approximately 50% 
and a maximum frequency of 1 MHz. There is no 
requirement for synchronization between BCLK 
and any other clock in a MULTIBUS system. 
Synchronizer circuits must be used on signals 
that cross clock boundaries, because it is possi- 
ble (as in this design) for the processor clock to be 
asynchronous to BCLK. 

Bus priority on MULTIBUS resolution takes place 
in one BCLK cycle. If the serial priority scheme is 
implemented, this requirement will place an upper 
limit on the number of masters allowed for a given 
BCLK rate. The parallel priority scheme allows 
the relationship between BCLK and the number of 
masters to be independent. Whichever is chosen, 
the MULTIBUS specified minimum setup time of 
22 ns (reso lu tion to c lock), plus the desire to keep 
the BPRN to BPRO propagation delay to a mini- 
mum, points to the use of an "A" speed PAL 
device (Figure 2). 
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The MULTIBUS bus exchange timing requires a 
BUSY setup time before clock of 25 ns (see Fig- 
ure 2). Release by one master and acquisi- 
tion by another takes place on two successive 
HIGH to LOW transitions of BCLK. 

PAL Design Description 

The PAL design specification with supporting 
timing diagrams are shown in Figures 3 and 4. The 



following remarks are in addition to the comments 
found on the PAL design specification. 

The te rm "BRQP*AEN" found in the equation for 
"BREQ" is required to keep the artDiter from 
recapturing the bus when it is t he rele asing master. 
This can occur when a CBRQ forced the 
surrender, and the processor cycle which follows 
the just completed cycle accesses the system 
memory. 




Figure 1. Single Multimaster Bus Interface 
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Using BREQ to disable BPRO results in a 40 ns 
delay worst case from clock (tCO + tPD). Taking 
into account the 22 ns setup (see Fig. 2) and 
clock skew (3 ns max) leaves 35 ns of cycle (10 
MHz BCLK), if the serial priority resolution 
scheme is being employed. Lower priority mas- 
ters using t his des ig n would exhibit a 25 ns worst 
case delay BPRN to BPRO. Use an e xternal g ate 
to impleme nt the equation "BPRO = 
BPRN*BREQ" if the serial priority is required when 
using more than two masters. 

Note the inversion of BLCK for use by the PAL 
device. PAL device registers load data on the 
LOW to HIGH clock transition of pin 1. 

MULTIBUS address setup before Command (50 
ns min.) Is determined by the 8288 bus controller. 
The 8288 specificat ion guarantees 115 ns 
minimum between AEN going LOW and 
Commands going active. 



MULTIBUS address hold after Command (50 ns 
min.) is satisfied by using the ALE signal from the 
8288 instead of the 80186. The 8288 Command 
lines go inactive on the HIGH to LOW transition of 
CLKOUT at the beginning of T4 (30 ns max delay). 
The 8288 ALE signal goes active on the HIGH to 
LOW transition of CLKOUT at the beginning of T1 
(1/2 clock cycle later than ALE from the 80186). 
The minimum hold time is therefore 100 ns - 35 ns 
= 65ns. 



BREQ 



BUSY 



tBCY 

100ns Min. 



|#-tBW-*{- 0.5 tBCY Nom. 

J — V-T^ 



BPRO 



Address 
Unes 




tBUSY 



tBPRNO ^ 1^ i^j tBPRO 
30ns Max.~*j — f*~ |**p40ns Max, 



I I (=tPD + 2; 

L J l^^sed by 
r n"Master in 



tPD + tBPRNs 

22ns) 

next 



Priority Chain. 



Figure 2, Bus Exchange AC Timing 
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TITLE 


A MULTIBUS ARBITER 


PATTERN 


AMP3-318 


REVISION 


005 


AUTHOR 


JOE ENGINEER 


COMPANY 


ADVANCED MICRO DEVICES 


DATE 


11/15/87 



CHIP MULTIB PAL16R4 

BCLK /RD /WR /SREQ /RESET /BPRN NC NC NC GND 

/E /CBRQ /BUSY /SYNC /HREQ /AEN /OEN /BREQ /BPRO VCC 

EQUATIONS 

BPRO = /HREQ*BPRN 

BREQ = HREQ+AEN 



OEN := /RESET*SREQ*AEN 

AEN := /RESET*AEN*HREQ*RD 
+ /RESET*AEN*HREQ*WR 
+ /RESET*HREQ*BPRN*/BUSY 
+ /RESET*AEN*BPRN*/CBRQ 



;HOLD BUS UNTIL CURRENT 

; CYCLE IS COMPLETE 

; INITIAL BUS ACQUISITION 

;HOLD BUS IF NO OTHER REQUEST 



HREQ := /RESET*SYNC 

SYNC := /RESET*SREQ*RD 
+ /RESET*SREQ*WR 

BUSY = AEN 
BUSY.TRST = AEN 

CBRQ = /AEN*HREQ 
CBRQ.TRST = /AEN*HREQ 

;THIS MODIFICATION ALLOWS THE ARBITER TO RESIDE ANYWHERE IN 
;THE ARBITRATION DAISY CHAIN 



Figure 3. PAL16R4 Arbiter Design Specification 
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* Delayed by 8288 
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Figure 4. Bus Transfer Timing Diagram 
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Multibus to Am951 6 Interface 



This interface shows the Am9516 connected to the MULTIBUS 
(Figure 1). This is accomplished by two PAL device designs. 
The equations for the PAL devices are shown in Figures 2 and 3. 
The first designated Am9516MBC does the MULTIBUS arbitra- 
tion as d efined by the MULTIBUS specification. Common bus 
request (CBRQ) was not implemented in this design. Addition- 
ally this design holds the bus as long as BREQ is active. If the 
user wishes to release the bus after each transaction the 
Am9516 should be programmed for CPU interleave. 

The second PAL device in this interface designated 9516MBC, 
converts the Am951 6 signals into MULTIBUS control signals. It 
also generates RD and WR for the 8530 so that flyby transfers 



can be done. When not doing flyby, this part of the PAL device 
should be changed. There are several considerations when 
using the SCC with DMA not addressed here. These are cov- 
ered in a separate application. This example will work for 
moderate serial data rates. To operate the SCC at maximum 
speed, a local RAM should be used as bus arbitration overhead 
could cause problems. The main purpose here was to illustrate 
the versatility of PAL devices and how easy it Is to interface 
apparently incompatible devices to the MULTIBUS. 

The two PAL devices shown are similar in function to the 8289 
and 8288 shown with the 8086 CPU. A similar design could be 
done for processors such as the 68000 or other bus masters 
such as the 8052 CRT controller which has its own DMA. 
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Figure 1 . Am9516 to 8086 in a IVIULTIBUS" Environment 



D 



Multibus to Am951 6 Interface 



TITLE MULTIBUS CONTROLLER FOR AM9516 

PATTERN AMP3-189 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/15/87 



CHIP MULTIl PAL16L8 

BACK MIO NC NC /DACK NC NC NC /CEN GND 
NC /RD /lORC /DS /MWTC /MRDC /lOWC /RW /WR VCC 

EQUATIONS 

lORC = /MIO*DS*/RW*CEN 

lORC.TRST = BACK 

lOWC = /MIO*DS*RW*CEN 

lOWC.TRST = BACK 

MRDC = MIO*DS*/RW*CEN 

MRDC.TRST = BACK 

MWTC = MIO*DS*RW*CEN 

MWTC.TRST = BACK 

RD = DACK*RW*BACK 
+ IORC*/BACK 

WR = DACK*/RW*BACK 
+ IOWC*/BACK 

DS = lORC 
+ lOWC 

DS.TRST ^ /BACK 

RW = I owe 

RW.TRST = /BACK 

; DESCRIPTION 

;This PAL device converts multibus signals into 9516 compatible 

; signals and vice-versa. It also supports the 8530 in flyby mode 

Figure 2. Source Listing for MULTIBUS to Am9516 Interface 
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TITLE MULTIBUS ARBITER FOR AM9516 

PATTERN AMP3-190 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/15/87 

CHIP MULTI2 PAL16R4 

/BCLK /XACK BRQ /BSY /BPRN /DS NC /lORC /CS GND 
/OE /RBEN /TBEN BACK /CEN /BREQ /BUSY /BPRO /WAIT VCC 

EQUATIONS 

TBEN = IORC*CS 

TBEN.TRST = /BACK 

RBEN = /IORC*CS 

RBEN.TRST = /BACK 

WAIT =/XACK*BACK 

BREQ := BRQ 

BPRO = /BRQ*BPRN 

/BACK := /BUSY 

BUSY := BREQ*BPRN*/BSY*/BUSY 
+ BREQ*BUSY*BPRN 
+ DS*BUSY 

CEN := BACK 

DESCRIPTION 

/CEN delays the commands to meet the multibus requirements that 
address and data be valid at least 50 ns prior to control active. 
If we do not allow preemption; remove BPRN from the second 
expression in the BUSY equation and eliminate the third expression 

Figure 3. Source Listing for MULTIBUS to AmSSIS interface 
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This application note describes how replacing two 8086 support 
chips with a Z8000 support chip and a PAL16R8A allows the 
8086 CPU to interface directly to the Z-BUS. Since the timing of 
the signals used is the same for the 8088 CPU, this circuit will 
work equally well in those applications. 

Interfacing the 8086 CPU to the Z-BUS allows 8086 users to 
take advantage of the very powerful Z8000 peripheral and 
memory support circuits that are available. The Z8000 periph- 
eral circuits in particular offer the user higher throughput rates, 
simpler control software and less system overhead require- 
ments than any previous generation peripheral family for 
any CPU. 

Design Requirements 

The 8086 CPU can operate in two different modes. In minimum 
mode, it generates all the bus control and timing signals for the 
8086 (8085, 8088) buses directly on-chip. In maximum mode, 
the CPU puts out status information early in each bus cycle and 
relies on an external bus controller chip, the 8288, to generate 
timing and control signals.This implementation uses the CPU in 
maximum mode and replaces the 8288 with a PAL16R8A that 
generates the Z-BUS timing and control signals from the status 



signals provided by the CPU. It also makes use of the AmZ81 27 
clock generator to allow precise timing resolution by providing 
an oscillator signal at 3 times the CPU clock frequency. The 
AmZ8127 provides all the clock generation functions of an 
8284A as well as several additional functions. Either clock chip 
will work in this system. 

The bus controller provides the following functions: 



—Generates AS, DS, INTACK and R/W with proper timing rela- 
tive to address and data. 
—Provides simultaneous assertion of AS and DS during reset. 
— Automatic insertion of 1 wait state for all I/O cycles. 

— Synthesizes a single Z-BUS interrupt acknowledge cycle from 
the 8086 lACK cycles. 

Figure 1 shows the circuit interconnection diagram. The system 
uses a h igh-speed PAL16R8A to generate AS, DS, R/W and 
INTACK of the Z-BUS, ROY for wait state generation, and three 
internal state variables. The registers are clocked with the 15- 
MHz OSC signal from the 81 27. The five input signals to the PAL 
device are 5-MHz CPU Clock Signal (CLK^, System Reset 
(RESET), and the three CPU Status States (Sq, S,, Sg). 
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Figure 1. Circuit Interconnection Diagram 
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The CPU indicates the start of a bus cycle by bringing at least one 
of the status lines lowf rom the idle high state (see Figure 2). This 
starts an internal timing sequence within the PAL device which 
corresponds closely to the various T states of a bus cycle. AS is 
asserted during the time CLK is LOW during Ti. DS is asserted 
at the start of T3. If it is an I/O cycle, then RDY would be disabled 
for one CLK period straddling T2 and T3 causing the 8127 to 
request 1 wait state after T3. In either case, DS remains as- 
serted until after the first 1 /3 of T4, which is identified by the 
status lines returning to the idle_state during the previous cycle. 
R/W is generated by sampling Sq and S, during AS. 



It is in the realm of Interrupts where the Z8000 peripherals shine 
over other peripherals. Each peripheral can identify many differ- 
ent exception conditions during its operation. The occurrence of 
one or more of these conditions causes activation of a single 
interrupt request line. The peripheral wants the CPU to respond 
with a single interrupt acknowledge cycle, during which the pe- 
ripheral resolves priority and provides the CPU with enough 
status and vector information to allow it to respond to the excep- 
tion without any further interrogation of the peripheral. This al- 
lows interrupt driven systems to achieve very high data 
throughput rates. 
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Figure 2. Memory and I/O Timing Diagram 
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The 8086 CPU responds to an interrupt request with a sequence 
of two interrupt acl<nowledge cycles, and only in the second is 
any data read off the bus. As stated before, the Z-BUS peripher- 
als require only one acknowledge cycle. The timing of this has to 
be such that there is enough delay between AS going HIGH and 
DS going LOW to allow any prioritizing daisy chains to settle, 
and DS has to be wide enough to allow the peripheral time to 
place vector or status information on the bus. Figure 3 shows 
how these two requirements are accomplished by turning the 
two acknowledge cycles into one. The first cycle allows only AS 
and the second asserts only DS and does so for the complete 
cycle. This appears to the peripheral as one very long bus cycle 
which is ide ntified a s an interrupt acknowledge cycle by the 
assertion of INTACK. 



Design Approach 

To implement this design in a PAL16R8A requires recognizing 
the Z-BUS timing characteristics in Figures 2 and 3. The major 
characteristic to consider is counting the phases of a bus cycle. 
Internal state variables Pq and Pi are the result (see Figure 4). 
An additional internal state variable (I2) is necessary to count the 
second bus cycle of an interrupt ack nowledge sequence. As 
shown in Figure 5, I2 in conjunction with INTACK allows AS to be 
asserted only in the first interrupt acknowledge cycle and DS 
only in the second. The RESET input is used to initialize the 
internal variables and assert AS and DS. Note also that Sq and 
Si are included in the DS equation to prevent DS from being 
asserted during a halt cycle. 
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Figure 3. Interrupt Acknowledge Timing Diagram 
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The fact that PAL devices are user programmable allows a great 
deal of flexibility for the designer. Minor timing changes are 
easily implemented by simply adding or changing a term in the 
logic equations and reprogramming the device. In this system, 
we have timing resolution to 67 ns. This same configuration can 
be used with a 24-MHz crystal for 8-MHz CPU chips. The logic 
equations would change because the OSC period would be 
42 ns. The only hardware change would be the crystal. 

An additional PAL device could also perform chip select decod- 
ing based on both address and status signals. 



Conclusion 

We have seen how a properly programmed PAL device can be 
used to replace a specialized bus controller chip and allow an 
8086 CPU to interface directly to 2-BUS peripheral(s) and/or 
memory systems. This brings all the advantages of the superior 
Z8000 peripheral family in terms of both throughput and ease of 
use to 8086 users with no increase in chip count while still 
allowing a wide range of design flexibilty. The PALASM software 
equations for the PAL16R8A 8086 to Z-BUS interface chip are 
shown in Figure 6. 
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TITLE 8086 TO Z-BUS INTERFACE 

PATTERN AMP3-195 

REVISION 01 

AUTHOR NICK ZWICK 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/15/87 



CHIP ZBUSl PAL16R8 

CK RESET CLK /SO /SI /S2 NC NC NC GND 
/E /PO /PI /RW /DS /AS /INTACK RDY /1 2 VCC 

EQUATIONS 

; INTERNAL STATE VARIABLES 

PO := /RESET* S0*/P0*/P1*/CLK 

+ /RESET* S1*/P0*/P1*/CLK 

+ /RESET* S2*/P0*/P1*/CLK 

+ /RESET* P0*/CLK 

+ /RESET* PI* CLK* SO 

+ /RESET* PI* CLK* SI 

+ /RESET* PI* CLK* S2 

PI := /RESET* P0*/P1* CLK 

+ /RESET* P1*/CLK 

+ /RESET* PI* CLK* SO 

+ /RESET* PI* CLK* SI 

+ /RESET* PI* CLK* S2 

12 := /RESET* INTACK*/I2* CLK* P0*.P1 

+ /RESET* I2*/P1 

+ /RESET* I2*/P0 

+ /RESET* I2*/CLK* PO* PI 

; Z-BUS OUTPUT SIGNALS 

AS := RESET 

+ /P0*/P1* CLK*/I2 
+ AS*/P0*/I2*/DS 

DS := RESET 

+ /INTACK*/P0* PI* CLK*SO 
+ /INTACK*/PO* PI* CLK*S1 
+ 12* SO* SI* S2 
+ DS* PO* PI 

RW := AS* S0*/S1 
+ RW*/AS 

INTACK:= /RESET*AS*S0*S1*S2 

+ /RESET*INTACK*/I2*P1 
+ /RESET*I2 

/RDY := /RESET* S0*/S1* S2*/P0* PI 
+ /RESET*/SO* SI* S2*/P0* PI 



Figure 6. Source Listing fortlie Z-Bus to 8088/8086 interface 
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A High-Performance Bus 

The VMEbus is a high-performance asynchronous bus capable 
of supporting multiple 16- and 32-bit processors. Its asynchro- 
nous nature permits easy implementation of hardware control- 
lers that reduce the time required for bus control tasks such as 
bus arbitration. By offloading these tasks to hardware, bus ar- 
bitration time is minimized which consequently maximizes the 
overall bus transfer rate. 

This article describes PLD implementations of two such control- 
lers in a typical VMEbus system: the bus arbiter and the inter- 
rupter that comply with the VMEbus protocol. These state 
machines used to require multi-chip designs consisting of se- 
quencers, microprogrammed control stores, and other MSI/LSI 
chips. By using the PAL22V1 programmable array logic device, 
you can reduce substantially the number of chips needed for 
these state-machine implementations. 

Functional Modules 

A typical VMEbus-based computer system contains a bus ar- 
biter and an interrupt handler board. A processor/master in- 
itiates a data transfer to a slave by requesting control of the data 
bus from the bus arbiter. Once bus control has been granted to a 
master, control signals to and from the master and slave are 
exchanged which follow predefined protocols. These protocols 
guarantee an orderly transfer of data between the communicat- 
ing modules. Interrupt servicing capability is provided by the 
interrupt handler board (see Figure 1). 



Steps in Designing the 
Bus Controllers 

VMEbus protocols outline the steps to perform any bus-related 
operation, such as transferring data over the bus between two 
modules. These protocols are described using flow diagrams 
with interactions between the communicating modules shown 
through various interface signals. For example, the priority bus 
arbiter flow diagram (Figure 2) shows how bus requests be- 
tween two modules using the same request line are resolved. 

The flow diagrams are analyzed to pick out the functions that 
can be incorporated into PLDs, and then state machines are 
created for these functions. These state machines can be 
described logically, or with flowchart symbols such as rectan- 
gular blocks (to symbolize events or control signals) and deci- 
sion diamonds (to determine control program flow). Logic 
equations are then written for these state machines. 

This methodical process is used in designing the two PLD- 
based VMEbus controllers: the bus arbiter and interrupt handler 
controller. 

A PLD Bus Arbiter 

Before any module can perform a data transfer, it must request 
control of the data transfer bus from the bus arbiter. This design 
uses a priority option bus arbiter implemented on a single 
PAL22V10 that monitors the four bus request lines BR0-3(L) 
with BR3(L) assigned the highest priority. 
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Figure 2. Bus Arbitration Flow Diagram 
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Based on the flow diagram (Figure 2), the arbiter grants the bus 
to the requesting module using the highest active request line, in 
this case BR1(L). A bus grant signal is daisy-chained to all the 
devices using this request line to resolve simultaneous bus re- 
quests from two or more modules. This dictates that the arbiter 
be in the first slot of the VMEbus system, and that the module 
physically closest to the arbiter through the daisy-chain will get 
the bus grant. 

All bus requests are processed in parallel by the AND-OR array 
of the PLD. This allows priority arbitration to occur in a single 
clock cycle, which minimizes arbitration time and maximizes the 
data transfer rate on the VMEbus. 

When the bus is free, the module using the highest active bus 
request line is granted the bus. This is described with Boolean 
logic notation where registered outputs are defined with ':=': 



IF(/BBSY*(BR0+BR1-HBR2 
+ BR3)) THEN BEGIN 
IF (BR3) THEN BG3IN := 1 ; 

IF (/BR3*BR2) THEN 

BG2IN:= 1; 
IF (/BR3*/BR2*BR1)THEN 

BG1IN:= 1; 



IF (/BR3*/BR2*/BR1*BR0) 

THENBGOIN := 1; 
END ; 



if bus not busy and a 
request line is active 

if BR3 is active, grant 
bus to device on BR3 

activate bus grant daisy 
chain 2 

if BR1 is active and BR3 
and BR2 are not, then 
grant bus to device 
using request line 1 

BRO-3 and BGOIN to 
BG3IN are active low 



The PAL22V10 can be programmed with normal or inverted 
outputs and the logic software can support active high or low 
Input polarities. 

BGIIN(L) is asserted until the requesting module asserts 
BBSY(L). This is defined in logic equation form as: 



IF(/BBSY*BG1IN)THEN 
BG1IN:= 1; 



continue asserting 
BGl'lN until BBSY 
becomes active 



Priority Arbitration Options 

It may be necessary for the arbiter to force the current bus 
master to relinquish bus control for certain conditions, such as if 
a higher priority operation must be attended to. This is done by 
asserting the bus clear signal BCLR(L) based on the priority of 
the current bus master and the bus clear conditions. 

The PLD arbiter keeps track of the current bus master's priority 
by recording which bus request line was used to gain control of 
the bus. For example, two output registers called BUS-MAS- 
TER will be set to the binary value "2" if the current bus master 
used BR2(L) to gain control of the bus. 

In this design, BCLR(L) wil be activated under two conditions: 

1 . If the BUS-MASTER is 2,1 or and the active bus request 
line is 3 or 2 

2. If BUS-MASTER is 0, and any bus request line is active. 

When BUS-MASTER is 3, then the arbiter will not honor any 
bus requests until the bus busy line BBSY{L) is low. The above 
conditions are expressed logically as: 



IF (BBSY*(BR0+BR1-^BR2-^BR3)) THEN 
BEGIN IF (BR3*(/MASTER[1:0] = 3) ) ) THEN 

BGLR := 1 ; "assert BCLR if MASTER < > 3" 
IF (/BR3*BR2*(/MASTER[1:0] = 3) ) ) THEN 

BGLR := 1; 
IF(MASTER[1:0] = 0) THEN 

BCLR:= 1; 
END; 

Once the BCLR(L) line is active, then it should be held until the 
current bus master releases BBSY(L). This is logically ex- 
pressed as BCLR = BCLR*BBSY, or in a high-level syntax, 

IF (BCLR'BBSY) THEN 
BCLR := 1; 

The BCLR(L) signal is defined such that uninterruptible devices 
use BR3(L). Devices that can be temporarily suspended to ac- 
commodate interrupts and higher-priority operations are as- 
signed to bus request line 0. The BCLR conditions will vary with 
your application, but any modifications will only require redefini- 
tions of the high-level logic expressions. 

The PAL22V10 is ideal as the bus arbiter because of the input/ 
output signal requirements and the large number of product 
terms needed to logically define the bus grant and bus clear 
signals. 

Processing interrupts 

When real-time or urgent response is needed by a processor, it 
generates an interrupt request signal and waits for the interrupt 
acknowledge signal lACK(L) and lACKIN(L) daisy-chain signal. 
A 3-bit value is then read from the VMEbus when the data 
strobes are active; this value indicates which interrupt request 
line was acknowledged. These three-bits are decoded to deter- 
mine if it matches the processor's request level. 

If the interrupt is acknowledged, then the interrupting processor 
puts its status or ID byte on the data bus for the interrupt hand- 
ler. This data is used by the interrupt handler as an interrupt 
vector. This processor can then wait in a loop until the lACK(L) 
signal from the interrupt handler is driven HIGH to signify inter- 
rupt service completion. 

Interrupt Handling: The Interrupt 
Handler Preprocessor (IHP) 

To handle external I/O or special system events (e.g., timeout, 
overflow), interrupts are supported on the VMEbus through an 
interrupt handler (IH) module. 

The logiccomplexity of the IH is reduced by offloading some of the 
initial interrupt recognition tasks to a PLD. A PAL22V10 can be 
programmed as an IHP to preprocess interrupt requests, obtain 
the bus and handle data transfers (such as interrupt acknowledge 
signals). Control is passed to the interrupt handler only when the 
IHP latches the interrupt vector. 

Interrupt request processing, bus acquisition, and the interrupt 
vector transfer phase are all specified using logic equations 
which are written using a high-level Boolean notation. The 
PAL22V10 monitors seven interrupt request lines and five other 
control inputs, and generates ten registered outputs to send 
control signals to the interrupt handler and the VMEbus drivers. 



Monolithic 



m 



Memories 



n 



2-349 



VME Bus Control Simplified with PLDs 



Interrupt Logic 

All interrupt request lines are monitored according to the follow- 
ing logic equation: 

IF (IR1 + IR2 + IR3 + IR4 + IRS + IR6 + IR7) THEN 
BR3 := 1 ; this interrupt handier uses 

the BR3 request line 

If any of the interrupt lines are active, then BR3(L) is asserted. 
This initiates the bus acquisition phase. 

The next step is to wait for the bus grant in signal. Only when 
BGIN3(L) is active will BBSY(L) be active. This is expressed 
logically as: 



IF (BR3*/BG3IN) THEN — 

BR3 := 1; — 

IF (BR3*BG3IN + BBSY*/SERVICE_DONE) 

THEN - 

BBSY := 1 ; 



[A] 
[B] 



Equation [A] continually asserts BR3(L) as long as BG3IN(L) is 
not active, while [B] asserts BBSY(L) only when request line 
BR3(L) and BG3IN(L) are active, or if service has not been 
completed once the IHP asserts BBSY(L). 

When the IHP is the bus master, it puts the 3-bit interrupt-line 
acknowledge value on the bus and applies the data strobes. 
When the DTACK(L) signal is received from the interrupter, the 
IHP then strobes the status byte from the bus into a register on 
the interrupt handler card. The IHP informs the interrupt handler 
that a status/ID byte is ready and to commence interrupt 
servicing. 

The IHP takes only a single clock cycle to resolve interrupt 
priorities because all interrupt/input signal lines are processed 
in parallel by the PLD. This is expressed in a logic description 
language as follows: 



IF (BBSY) THEN 

BEGIN 
(1) IF(IR7)THEN 

INTR[2:0] := 7; 

IF (/IR7*IR6) THEN 
INTR[2:0] := 6; 

IF (/IR7*(IR6*IR5) THEN 
INTR[2:0] := 5; 



IR7 was active 

3-bit value acknowledging 

interrupt line 7 

IR6 active, IR7 inactive 

acknowledge interrupt line 6 

IRS highest priority 

interrupt line active 



(2) IF (/IR77IR6*/IRS*/IR4*/IR3VIR2*IR1) THEN 

INTR[2:0] := 1 ; acknowledge interrupt line 1 

END; 

As noted above, if both the iR7(L) and BBSY(L) signal are ac- 
tive in (1), then regardless of the value of the other interrupt 
lines, the three-bit value generated will be "111" or 7. In (2), if 
IR1(L) and BBSY(L) are active and the other six control signals 
are inactive, then the three-bit output value will be "001 " or 1 . In 
(2), IR1(L) is the highest priority line active. 

The remaining preprocessing involves completing the interrupt 
vector byte transfer. This is logically expressed as: 

IF (BBSY) THEN 
BEGIN 

lACK := 1; begin interrupt acknowledge 

(X) IF (DTACK) THEN daisy chain; if device sends 

LATCH_STATUS := 1; data transfer acknowledge 
END; (DTACK) signal, then latch 

IF (lACK) THEN the status 

AS := 1 ; assert the address strobe signal to inform 
the interrupter that the interrupt 
acknowledge level is ready 

Once the 8-bit status of ID byte is transferred successfully via 
the DTACK(L) signal (X), the IHP PAL device informs the inter- 
rupt handler module to begin the interrupt service routine: 

IF (BBSY*LATCH_STATUS +BBSY*START_SERVICE* 
/SERVICE_DONE) THEN 
START_SERVICE := 1; 

In this design, the START_SERVICE signal is constantly as- 
serted until the interrupt handler generates a SERVICE_DONE 
signal, at which point START_SERVICE is brought LOW. The 
above equations will generate the timing diagram in Figure 3. 
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Figure 3. Timing Diagram 
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DEVICE ARBITER (AMPAL22V10) " VMEbus Priority Arbiter Option 
For this application, if the bus is busy, then bus requests will be 
deferred until the BBSY(L) (bus busy line active LOU) goes HIGH, 
unless the bus request is of a higher priority. In this case, the 
bus clear signal BCLR(L) is asserted and the current bus master 
relinquishes control. If the bus is free and 2 or more masters 
request service, then the bus is granted to the master with the 
higher priority request line. " 



PIM CLK 




= 1 


/BBSY 




= 2 


/BRO 




= 3 


/BR1 




= 4 


/BR2 




= 5 


/BR3 




= 6 


/BCLR 




= 23 


/BGOIN 




= 22 


/BG1IN 




= 21 


/BG2IN 




= 20 


/BG31N 




= 19 


MASTER [1 


0] 


= 18,17 



BEGIN 

"if a bus request occured and bus = free, then activate the bus grant in 
line corresponding to the highest priority bus request line " 

IF </B8SY*(BR0+BR1+BR2+BR3)) THEN 
BEGIN 
IF (BR3) THEN 

BG3IN := 1 ; 
IF (/BR3*BR2) THEN 

BG2IH := 1 ; 
IF (/BR3*/BR2*BR1) THEN 

BG1IN := 1 ; 
IF (/BR3*/BR2*/BR1*BR0) THEN 

BGOIN := 1 ; 
END ; 

" while the bus grant in line is active and the bus is still busy, 
then latch the bus grant signals until BBSY(L) becomes active" 



when requester responds with BBSY active, then " 
MASTER is set to the line currently controlling" 
the bus; this is used internally for future " 
bus priority resolution " 



IF (BG3IN*/B8SY) THEN 

BG3IN := 1 ; 
IF {BG2IN*/BBSY) THEN 

BG2IN := 1 ; 
IF (BG1INVBBSY) THEN 

BGIIN := 1 ; 
IF (BGOINVBBSY) THEN 

BGOIN := 1 ; 

IF (BG3IN*BBSY) THEN 

MASTER [1:0] := 3 ; 
IF (BG2IN*BBSY) THEN 

MASTER [1:0] := 2 ; 
IF (BG1IN*BBSY) THEN 

MASTER [1:0] := 1 ; 
IF (BG0IN*BBSY) THEN 

MASTER [1:0] := ; 



IF (BBSY) THEN " when bus busy, then remember current bus master line " 
MASTER [1:0] := MASTER [1:0] ; 

"if a higher priority bus request occurs and the bus is busy, then begin 
bus resolution (by using bus clear BCLR) under the following conditions : 

- if BR3 and present bus master is not using bus line 3; i.e., once a device 
uses BR3, then no one can force it to relinquish the bus or 

- if BR2 and present bus master is not 3; i.e., BR2 can assert bus clear 
except when the presetn bus master obtained the bus using bus line 3 or 

- if BR1 and the bus master obtained the bus using bus request line 0, then 
assert BCLR 

==> if BRO is activated when the bus is busy, then do not assert BCLR; BRO 
devices cannot force anyone off the bus. This is one possible priority 
implementation. 

IF (BBSY*(BR0+BR1+BR2+BR3)) THEN "if a bus request occured and bus = busy" 
BEGIN 

IF (BR3*(/MASTER[1]+/MASTER[0])) THEN "if BR3 and present master is not " 
BEGIN 

BCLR := 1 ; " using line 3, then assert BCLR <• 

IF (BCLR*BBSY) THEN 

BCLR := 1; 
END; 
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IF (/BR3*BR2*(/MASTER[1]+/HASTER[0])) THEN 
BEGIN 

BCLR := 1 ; 
IF (BCLR*BBSY) THEN 

BCLR := 1; 
END; 
IF (/BR3*/BR2*BR1*/HASTER[1]*/MASTERtO]) THEN 
BEGIN 

BCLR := 1 ; 
IF (BCLR*BBSY) THEN 

BCLR := I.- 
END.- 
END ; 
END. 

"Test vectors used for simulation and testing" 
TEST_VECTORS 

IN CLK , /BBSY , /BRO , /BRi , /BR2 , /BR3 ; 
OUT /BCLR , /BGOIN , /BG1IN , /BG2IN , /BG3IN , MASTER [1:0] 
BEGIN 

" I I I I I I I I I I 

"cbbbbb bbbbbmm" 
"Isrrrr cggggss" 
'! k y 1 2 3 r 1 2 3 1 0" 



C 1 1 1 

C 1 1 1 

C 1 1 1 1 

C 1 1 

colli 

C 1 1 1 
C 1 1 1 



C 1 1 1 

C 1 1 1 1 

C 1 1 1 1 

C 1 1 1 1 

colli 

C 1 1 

C 1 1 

C 1 1 

C 1 1 

C 1 1 1 
END. 



XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 



X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 

X X X X 



Figure 4. PLPL Specifications for the Exampie of Figure 1 (Cont'd.) 
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DE.VICE INTR_HMD_CTRLR (AMPAL22V10) " VHEbus interrupt handler preprocessor 

This PAL serves as an interface between the interrupt servicing logic in the 
interrupt handler and the bus arbiter: when an interrupt is detected, the 
PAL requests the bus from the arbiter. When the bus is granted, the bus 
busy BBSY(L) is asserted. The 3 bit code corresponding to the highest 
priority interrupt request is put on the bus, and both lAClC(L) and AS(L) 
are also asserted. The PAL waits for DTACIC(L) LOU and then latches the 
status/ID byte. The START_SERVICE signal informs the interrupt handler 
to begin the interrupt service sequence with the status/ID byte latched in. 
Note : this interrupt handler interface PAL handles all interrupts using 
bus request line BR3; this is application-dependent " 



PIN 
CLK 
/IR1 
/IR2 
/IR3 
/IR4 
/IR5 
/IR6 
/IR7 



= 1 
= 2 
= 3 
= 4 
= 5 
= 6 
= 7 
= 8 



"interrupt request lines 1 to highest priority 7" 



/DTACK = 9 



/BG3IN = 10 
SERVICE DONE = 13 



LATCH_STATUS 
/BBSY = 22 
/lACK = 21 
/AS = 20 
/BR3 = 19 
GET_BUS 
INTR[2:0] 



= 23 



= 18 



"data transfer ack; used by interrupter to indicate 

status/ID data byte ready on bus " 
"bus grant in signal from arbiter" 
"signal from interrupt handler indicating 

service complete" 

"latches data from bus for interrupt service routine" 
"bus busy" 

"interrupt acknowledge" 
"address strobe" 

"bus request signal to bus arbiter" 
"used internally by PAL " 



= 15:17 "3-bit level code put on bus 



START SERVICE = 14 ; "sent from PAL to intr handler 



BEGIN 



"if any interrupt request line is active, then try getting the bus. All the 
interrupts handled by this PAL will use the bus request line BR3. A 
request can occur only when no other interrupt is being serviced (i.e., 
bus should not be busy and bus grant in should not be active) " 



IF (/BBSY*/BG3IN*(IR1 + IR2 + IR3 + IR4 + IR5 + IR6 + IR7)) THEN 
BEGIN 

GETBUS := 1 ; 

BR3 ;= 1 ; "subject to application" 
END ; 

"if interrupt handler wants the bus, the bus grant line is still inactive, 
and the interrupt request is still active, then continue requesting the bus " 
IF (GET_BUS*BR3*/BG3IN*{IR1 + IR2 + IR3 + IR4 + IRS + IR6 + IR7)) THEN 

BEGIN 

BR3 := 1 ; 

GET_BUS := GET_BUS ; 

END; 

"if bus grant in line is active, then assert the bus busy signal" 
IF (GET_BUS*BG3IN + BBSY*/SERVICE_DONE) THEN 
BBSY := 1 ; 

"if interrupt handler received bus, then acknowledge the interrupt" 
IF (BBSY) THEN 
BEGIN 

IF (/START_SERVICE) THEN 
I ACK := 1 ; 

"when the interrupter responds with DTACIC(L), then " 
"latch the status/ID byte into the interrupt handler" 



IF (DTACK) THEN 
LATCH STATUS := 1 



"IR7 line is highest priority" 



IF.(IR7) THEN 

INTR [2:0] := 7 ; 
IF (/IR7*IR6) THEN 

INTR [2:0] := 6 ; 
IF (/IR7*/IR6*IR5) THEN 

INTR [2:0] := 5 ; 
IF (/IR7*/IR6*/IR5*IR4) THEN 

INTR [2:0] := 4 ; 
IF (/IR7*/IR6*/IR5*/IR4*IR3) THEN 

INTR [2:0] := 3 ; 
IF (/IR7*/IR6*/IR5*/IR4VIR3*IR2) THEN 

INTR [2:0] := 2 ; 
IF (/IR7*/IR6*/IR5*/IR4*/IR3*/IR2*IR1) THEN 

INTR[2:0] := 1 ; "IR1 line is lowest priority" 

END; 
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IF (lACKVAS + lACKVDTACK) THEN "hold the interrupt acknowledge level until" 
INTRt2:0] := INTRt2:0] ; "strobed by address strobe signal or until 
the interrupter acknowledges the 3-bit 
level with DTACK " 

"if the lACK line is active and interrupt servicing has not yet begun 
activate the address strobe line AS " 
IF (IACK*/START_SERV1CE) THEN 
AS := 1 ; 

"send the start service signal until service is done" 
IF (BBSY*LATCH_STATUS + BBSY*START_$ERVICE*/SERVICE_DONE) THEN 
START SERVICE := 1 ; 



END. 



TEST_VECTORS 

IN CLK , /IR1 , /1R2 , /IR3 , /IR4 

/OTACK , /BG3IN , SERVICE_DONE; 
OUT START_SERVICE , GET_BUS , INTR[2:0] 

/AS , /lACK , /BBSY , /BR3 ; 
BEGIN 

"C///////// 
" L I I I I I I I D B 
"KRRRRRRRTG 
" 1234567A3 
■I C I 

" K_N 

1 



/IR5 , /1R6 , /IR7 
LATCH STATUS , 



C 1 1 

C 1 1 

C 1 1 

C 1 1 

C 1 1 

C 1 1 

C 1 1 

C 1 1 

C 1 1 

C 1 1 

C 1 1 
END. 
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Figure 5. PLPL Specifications for thie Exampie of Figure 1 (Cont'd.) 
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Communications 



Introduction to Communication 
Systems 



Modern comm unications has developed from two major sources: 
telephones and computers. Within telephony, what was once 
straightforward engineering in the analog telephone system has 
grown over 1 00 years into the sophisticated discipline of telecom. 
Within computers, communication needs have expanded rapidly 
over the last 20 years from simple asynchronous terminals 
clustered around a host to also include networking of PCs, work- 
stations and minicomputers across rooms and continents. Both 
areas can be expected to merge into complex information net- 
works which handle all aspects of communications over the next 
1 0-20 years. A graphic portrayal of a hypothetical future system 
is shown in Figure 1. 

Historically, the system architecture, engineering and compo- 
nents of these two converging areas have been dissimilar. As 



telephony systems become digital-based and computer systems 
use more of the telephony system for interaction, this dissimilarity 
is eroding. In particular, digital design techniques developed in 
computer engineering are used increasingly in telecom. The 
original application was in the trunk links between exchanges 
where digital time division multiplexing allowed a much higher 
density of circuits over a given pair of wires. This spread into the 
switch, line card and other areas of the exchange until only the 
subscriber line to the exchange remains largely analog. Develop- 
ment of the Integrated Services Digital Network (ISDN) interna- 
tional standard will make this digital as well, enabling data and 
video as well as voice to be handled in a uniform and flexible 
manner. Future telecom engineering will therefore have much in 
common with the digital design techniques used in computers 
and the distinction between them will become minimal. 
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Along with the strong trend in equipment design towards digital 
systems, there is a parallel move towards a more global market 
for equipment. The development costs for major pieces of tele- 
com systems, such as central office switches, require a large 
market in order to amortize the costs. This implies a requirement 
for standards beyond national boundaries, permitting sales into 
foreign markets with minimum engineering changes. This in- 
creased market size on top of increased complexity of services is 
realizable only through widespread standardization. It also 
makes such a universal system more complex and therefore 
more elusive. 

The computer area has evolved around a number of proprietary 
computer families whose architectures and operating systems 
were incompatible to a large degree. The need to interconnect 
multiple computer systems with high-speed links led to a polari- 
zation around proprietary standards as local area network (LAN) 
interconnect standards were evolved by each major manufac- 
turer. Recent moves towards decentralized computing, based on 
personal workstations of varying power, has crystallized users 
around a few LAN standards, but the dream of universal LAN 
interconnectivity among equipment is still far off. 

Communications therefore requires widespread standards for 
any meaningful inter-vendor or international operation. Teleph- 
ony is the more advanced in this area. Standards efforts by the 
Bell system in the US and the local PTT (the governmental Post, 
Telephone and Telegraphy authority) in other industrial countries 
has ensured that any equipment which connects to the public 
telephone network operates fully with any other piece of equip- 
ment. 

The main international body which regulates that is the CCITT 
(Consultative Committee forTelephony and Telegraphy) through 
a series of "recommendations", which are effectively standards. 
In the USA, before deregulation, AT&T published technical stan- 
dards or "Bell Pubs" which specified the operating characteristics 
for equipment in a manner similar to and sometimes based on the 
CCITT recommendations. Since deregulation, AT&T continues 
to issue such documents as it pertains to their equipment and the 
RBOCs (Regional Bell Operating Companies) now issue their 
own either directly or through their joint research organization, 
BellCoRe (Bell Communications Research). 

Despite strenuous efforts among governing bodies, problems of 
compatibility have existed. Deregulation, the increase in the 
number of suppliers, international traffic and equipment sales, 
and most of all, the massive increase in equipment complexity 
have led to a growing demand for flexibility within telecom 
systems to handle compatibility issues. Fully compatible systems 
on a global scale remain an unrealistic goal before the billenium. 
Most systems are variants on a few widespread standards. The 
small deviations among manufacturers or countries can fre- 
quently be handled by a minimal amount of flexibility in the 
system design, such as that provided by PLDs. 

The situation on the computing side remains even more chaotic. 
Other than IBM, no vendor has managed to establish a wide- 
spread standard based on their own proprietary network. All 
others have turned to variants of the newer LAN standards, such 
as Ethernet, but even here, compatibility at the hardware levels 
does not necessarily mean compatibility at the upper or software 
levels. 



Features of Programmable Logic 
in Communications 

Programmable logic has already demonstrated its usefulness in 
computer systems in a variety of roles, including address decod- 
ing, state machines and "garbage collection". All these applica- 
tions exist in communications, but others particular to this market 
also exist. 

The problems of interconnecting dissimilar systems will remain in 
communications for a number of years. Equipment manufactur- 
ers are therefore faced with a dilemma of how to combine the 
economies of scale associated with volume production with the 
need to interface with multiple vendor equipment, some of which 
deviates from the standards. PLDs offer a unique method of 
customizing logic to conform to deviations while maintaining 
identical hardware layout and parts. 

Power and space requirements are considerations in most 
computer systems, but normally are relegated to being secon- 
dary issues when compared to performance. The opposite is true 
of telephony, where these factors assume paramount importance 
over speed. In order to satisfy requirements from both areas, 
components must therefore be available in both high-speed and 
low-power versions. 

In telephony, certain high-volume functions such as line card 
interfaces justify VLSI devices. But a large numberof other lower- 
volume functions differ among equipment vendors and applica- 
tions. Such design problems cannot be easily solved with dedi- 
cated VLSI components. With the introduction of CMOS PLDs, an 
opportunity now exists to combine higher integration with low 
power and the ability to modify designs without hardware compo- 
nent or board alteration. 

in computers, the proliferation of decentralized computing power, 
particularly in the form of PCs and workstations, has brought the 
power issue into focus as engineers attempt to minimize power 
consumption and the associated heat generation. While minis 
and mainframes continue to strive for increased speed, the bulk 
of applications now have a need for both lower-power compo- 
nents and a flexibility which permits each unit to communicate 
with as many of the office automation interconnection schemes 
as possible. Again, the advent of CMOS PLDs offers a solution to 
the system design problems associated with this. 

Basic Teiepiiony 

Basic telephony is loosely defined as the classic telephone 
system. It includes such major components as the voice tele- 
phone with twisted pair interface to the exchange, as well as the 
Private Branch Exchange (PBX), the Central Office (CO) and the 
trunk lines which interconnect them. The voice telephone has little 
scope for PLDs due to high volume and extremely low cost 
requirements. From an architecture point of view, both the PBX 
and the CO can be considered similar and divided into several 
main components: 

• The line cards which interface to subscriber lines 

• The central switching facility which routes calls through the 
exchange 

• The trunk interface which connects to other exchanges 
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A simplified structure of a PBX is shown in Figure 2. 
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Line cards are the most cost- and power-sensitive components of 
the basic telephony system. They are usually fabricated in 
volume and incorporate VLSI devices available from several 
semiconductor houses. The scope for PLDs is limited, due to 
these constraints and the limited customization needs of the 
simpler functions of the voice telephone. 

The central switch in modern PBX and CO exchanges is a digital 
memory-based system controlled by a minicomputer. As such, it 
contains many familiar opportunities for PLD applications from 
the computer industry, but a limited number that are distinctly 
communication-oriented. 

The trunk interface is an area that received little attention until 
recently as the interface was buried in the heart of the telephony 
system. Its moderate unit volume attracted little interest in dedi- 
cated silicon development. Deregulation of the Bell system in the 
US has resulted in a proliferation of equipment complying with the 
widespread T1 standard developed by AT&T for Multiplexers and 
other Channel Service Units (CSUs) which use trunk links. 

From its beginnings as a means of allowing several voice circuits 
to share the same cable, T1 has developed in application to the 
stage where it is being used in many cases where up to 24 x 64 
Kb/sec channels are required in a point-to-point configuration 
both within and between isolated customer sites. This application 
allows data and video, as well as the original voice connections, 
to be multiplexed over existing standardized links provided by 
both AT&T and the RBOCs. It is one of the fastest growing areas 



of modern telecom and is spawning related equipment such asT1 
switching nodes and T1 testers. 

The basic interface to the T1 connection can be engineered in a 
number of ways but basically consists of a transmit and receive 
section, as shown in Figure 3. 

The transmit section of a T1 mux must buffer and rate-adapt the 
bit stream of incoming channels which are to be multiplexed, 
perform a 24-to-1 multiplex function and then encode the result- 
ing bits with appropriate framing onto the line using a bipolar line 
driver. The receive section of a mux contains a corresponding line 
receiver and decoder. The recovered NRZI data is then fed into 
a deframer which detects frame synchronization and allows 
channel demultiplexing and buffering. 

Several of these functions can be efficiently implemented using 
programmable logic. The encoder and decoderfor both the North 
American T1 B8ZS and the European HDB3 coding schemes can 
be implemented in such devices (pages 2-362 and 2-384, respec- 
tively). The deframing problem for the most common current T1 
framing scheme (D4 Superframes) can also be solved with a few 
PAL devices (page 2-404). In the newer T1 Extended Super- 
Frame (ESF) standard, an error detection scheme is superim- 
posed on the framing bit pattern using a CRC-6 scheme. The 
CRC generator for this fits nicely within a single PALdevice (page 
2-431). 

Two applications are also shown for the LCA (Logic Cell Array) 
device. Page 2-435 shows a 32:1 mux for a T1 link, and page 
2-444 shows a format converter for a PBX. 
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Data Communications 

Unlike telephony, data communications is a relatively new disci- 
pline and equipment designed to interconnect data processing 
equipment evolved in parallel with mainframe computers. Such 
equipment tends to be high-speed and less concerned with power 
requirements than telephony equipment. Its main function is to 
transmit binary data in a reliable manner across a room or across 
a continent between computers compatible enough to exchange 
data. A wide variety of methods exist to do this at speeds ranging 
from 300 bits per second to 100,000,000 bits per second. As 
distance increases, the speed options available become limited. 
Generally, the options can be considered in a number of catego- 
ries, in order of increasing speed, cost and sophistication: 

• Asynchronous terminals and connections using existing 
analog phone lines 

• Synchronous terminals and dedicated data links 

• High-speed point-to-point links, including microwave and 
satellite 

• Multipoint, medium-speed local area networks 

• Multipoint backbone, high-speed local area networks 

The slowest and most widely available category includes most 
"dumb" terminals, which are normally linked to a host system 
either directly via an RS-232C/V.24 serial link or over a standard 
telephone line via a voiceband modem. Maximum data rates 
available are 9600 bits/sec and the typical modem rates are 
currently 300 and 1 200 baud. Devices used in building eithertype 
of link are typically LSI low-cost NMOS devices with most new 
devices being VLSI CMOS. Where PALdevices offerthe greatest 
advantage is in higher-performance applications for which VLSI 
solutions do not yet exist. 

There are, however, a number of additional control functions 
required around standard UART/USARTs which can be optimally 
implemented in a single PAL device, such as providing address 
latching and an asynchronous divider around a Motorola 6850 
Asynchronous Communications Interface Adapter (page 2-453). 

In the case of modems, however, several highly sophisticated 
techniques beyond that found in the standard modem "box" can 
be used to improve throughput on the link. One technique used 
to increase the bit rate across the link both in the slow, voiceband 
modems and in the higher-speed satellite modems is Quadrature 
Amplitude Modulation (QAM). Another technique uses a convo- 
lutional encoding scheme to increase the reliability of data trans- 
mitted across a noisy link. This cuts down the requirement for 
retransmission of lost data and effectively increases the through- 
put of the link. 

Both schemes can be implemented efficiently in PAL devices as 
described in the application notes on pages 2-456 and 2-463. 



Synchronous terminal links usually comply with one of a number 
of standards for which low-cost LSI devices exist. These provide 
for data rates above that of async links and generally can perform 
upto 1 Mbit/sec and beyond, but only overa limited distance, such 
as within a building. Data rates above 2400 baud over long 
distances are currently expensive and limited in popularity. 

One scheme which is expected to overcome this and gain wide 
application over the coming years is the provision of 64Kbit/sec 
data across the new digital telephone system which will be 
available on the international Integrated Services Digital Network 
(ISDN) "S" and "U" interfaces to the phone line. This is a standard 
to which virtually every telephony authority and a large number of 
major computer manufacturers have given their support. Among 
the difficulties which will be encountered is the problem of 
adapting existing terminal devices to communicate with the wave 
of VLSI CMOS devices which are being released for use on the 
ISDN "S" and "U" interface link. 

The "S" interface is being addressed by a number of VLSI 
solutions. The recently standardized "U" interface has no such 
devices and has an additional encoding/decoding requirement 
similar to trunk lines, due to the longer distances involved to the 
central office. A 4B3T coding scheme is used, which can be im- 
plemented using PLDs, as shown in the application note on page 
2-475. 

Further applications in data communications can be found in 
LANs. While several VLSI chip sets exist for standards such as 
Ethernet, a number of proprietary systems are in use and their 
numbers are increasing steadily. Some of the more basic func- 
tions required in the LAN interface, such as the Manchester 
encoding function of Ethernet, can be provided in a single PAL 
device (page 2-499). 

Conclusion 

The opportunities for the use of PLDs in communications is 
widespread and is becoming increasingly important forthefollow- 
ing reasons: 

• Power and space limitations. 

• Increasing adherence to multiple standards, yet the need 
for... 

• Standardized hardware to permit low-cost 
volume production. 

• Need to reconfigure equipment in the field 
at the minimum overhead. 

Where speed is an issue, the broad range of fast bipolar PAL 
devices provides a range of solutions. But in the more recent trend 
towards low-power circuitry, the need for which is most apparent 
in equipment such as in fanless or portable personal computers. 
The family of low-power CMOS devices provides the same func- 
tionality at a fraction of the power budget. 
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Bipolar with eight zero substitution (B8ZS) coding provides 
clear channel capability, which is required in an Integrated 
Service Digital Network (ISDN) or an open network. A (B8ZS) 
encoder and decoder can be implemented in CMOS PAL 
devices. The encoder and decoder convert between non- 
return-to-zero (NRZ) data and B8ZS while providing a flexible 
interface to framing chips. 

Principles of Coding 

Communication lines require a data stream with imbedded 
clocking information. T1 digital communication lines transmit 
Alternate Mark Inversion (AMI) coded voice and data. For AMI, 
logic "1's" are encoded as alternating positive and negative 
pulses, while logic "O's" are encoded as zero voltage. This 
coding scheme ensures that consecutive pulses alternate in 
polarity. The pulse edges provide the clocking. A pair of 
consecutive pulses with the same polarity is called a bipolar 
violation. 

Since a string of zeros in AMI coding does not contain any 
transitions, a clock cannot be derived. If this string of O's is 
too long, the system will lose synchronization. Therefore, the 
AT&T T1 standard limits the maximum number of consecutive 
O's to fifteen. For voice transmission this restriction is 
acceptable; however, for data it is not. A variation of AMI could 
provide clocking information while not restricting user data. 

B8ZS coding is such an improvement on the AMI code; logic 
"1 's" are encoded as alternating positive and negative pulses, 
while logic "O's" are encoded as zero voltage. Additionally, 
B8ZS substitutes a string of eight O's with a code word as 
shown in Table 1. The polarity of the preceding pulse 
determines which B8ZS code word is substituted. The B8ZS 
code words contain bipolar violations in the fourth and seventh 
bit positions. 

Since B8ZS coding allows strings of O's for any length, it 
supplies clear channel capability. Clear channel capability is 
required for an open network and ISDN. With B8ZS coding, the 
maximum length of consecutive zeros is seven bits. 



PRECEDING PULSE 


B8ZS CODE WORDS 


+ 


000+-0-+ 


- 


00O-+0+- 



Table 1. B8ZS Coding 



B8ZS coding is used in the following applications: 

• T1 multiplexers 

• T1 repeaters 

• T1 channel banks 

• T1 network equipment 

• Video CODECS 

Functional Description 

Implementing B8ZS coding in such equipment requires 
converting between NRZ and B8ZS format. In this design, 
CMOS PAL devices encode and decode data while providing 
flexibility. Current single chip solutions do not interface to all 
framing devices. Using the PAL devices, B8ZS line coding can 
be added to any framing technique. This design performs two 
operations: encoding and decoding. The encoding operation 
converts NRZ data into positive and negative pulses with B8ZS 
code words. Decoding converts the positive and negative 
pulses into NRZ data and substitutes eight O's for the B8ZS 
code words shown in Table 1 . 



B8ZS Encoder 

The B8ZS encoder converts NRZ data to B8ZS data. Each 
string of eight consecutive O's is replaced with a B8ZS code 
word. 

Two PAL devices can be configurated to encode the data 
stream. Figure 1 shows the PAL devices, designated, PAL_A 
and PAL_B- Signals on Figure 1 are explained in Table 2. 
PAL_A, a PALC16R8Z, implements the 3-bit counter, which 
searches for strings of eight O's. An 8-bit string of O's is 
detected by C2-C0 and NRZ_IN. PAL_A also incorporates a 
5-stage shift register, which delays the NRZ_IN data stream to 
ensure proper alignment with the counter's output. The 
delayed data, NRZ_DELAY, is encoded by PAL_B. 



PIN 


DESCRIPTION 


RST 


Active low master reset 


NRZ_CK 


External NRZ clock 


NRZ_IN 


NRZ input data stream 


C(2-0) 


Counter output 


NRZ_DELAY 


Delay NRZ data stream 


CS 


Current state 


PPO 


Positive pulse output 


NPO 


Negative pulse output 



Table 2. B8ZS Encoder Pin Description 
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Figure 1. B8ZS Encoder 



PAL_B, a PALC16R6Z, encodes the data using two state 
machines; one generates the B8ZS sequence, and the other 
generates the positive and negative pulse outputs. Figure 2a 
shows the B8ZS Sequence State Machine, which generates 
the B8ZS sequence. If an 8-bit string of O's is detected, the 
B8ZS Sequence State IVIachine forces the B8ZS code word to 

C2+C1+C0+NRZ IN 



be encoded. Otherwise, B8ZS Encoder State (Vlachine 
performs normal AMI-type encoding. The B8ZS Encoder State 
Machine encodes NRZ_DELAY into positive and negative 
pulses based on the B8ZS Sequence states, Q2-Q0 (see 
Figure 2b). The positive and negative pulses (PPO and NPO) 
occur six clock cycles after the input NRZJN. 




Figure 2a. B8ZS Sequence State Machine 



Q2*Q1*Q0*NRZ DELAY + Q2*Q1*Q0 
Q2*Q1*Q0*NRZ DELAY + Q2*Q1*Q0 V ^\ Q2*Ql*Q0*NRZ_DELAY + Q1*Q0 



ai*Qo 




(5l*Q0 



Q2*Q1*Q0*NRZ DELAY + Q2*Q1*Q0 



Figure 2b. B8ZS Encoding State Maciiine 
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B8ZS Decoder 

Decoding is performed by a PALC16R6Z and a shift register 
(see Figure 3). The PAL device searches for incoming B8ZS 
code words. When a B8ZS code word is detected, the PAL 
device substitutes an 8-bit string of O's. The output, NRZ_OUT, 
is delayed eight clocl^ cycles from the positive and negative 
pulse inputs (PPI and NPI). The device flags the B8ZS code 
words and any bipolar violations. If two bipolar violations occur 
in a non-B8ZS 8-bit data string, the device will only flag a single 
bipolar violation. The B8ZS Decoding State fvlachine shown in 
Figure 4 searches for the two B8ZS code words. The state 
machine has two groups of states; SN(0-7) searches for 
000-^-0-^- and SP(0-7) searches for 000+-0-+. The decoder's 
state machine shows reduced equations from the B8ZS and 
BPV (Bipolar Violation) outputs. Note: PPI and NPI are never 
both HIGH. 



PIN 


DESCRIPTION 


RST 


Active low master reset 


T1_CK 


External T1 clock 


PPI 


Positive pulse input 


NPI 


Negative pulse input 


DATA 


NRZ data stream 


NRZ_OUT 


NRZ output data stream 


B8ZS 


B8ZS flag 


BPV 


BPV (Bipolar Violation) flag 



Table 3. B8ZS Decoder Pin Description 
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Figure 3. B8ZS Decoder 
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PPI'NPI I 



FPI'NPI / BPV = PPI \ PPI'NPI 



B82S = PPI'NPI/ PPI'NPI 




PPI'NPI [ BPV = NPI > PPI'NPI 



Figure 4. B8ZS Decoder State Machine 
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TITLE PAL_A 

PATTERN B8ZS ENCODER PAL A 

REVISION 1.02 

AUTHOR THERESA SHAFER 

COMPANY MMI 

DATE 6/6/87 

CHIP PAL A PAL16R8 



; PINS 
;l 2 
NRZ_CK /RST 
;11 12 
/OE CO 



3 4 
NRZ_IN NC 
13 14 
CI C2 



5 6 

NC NC 
15 16 
NRZ DELAY R3 



7 

NC 
17 
R2 



8 
NC 
18 
Rl 



9 

NC 
19 
RO 



GND 
20 
VCC 



INPUTS : 



OUTPUTS : 



NRZ_CK 
/OE 
/RST 
NRZ IN 



NRZ_DELAY 
C2 - CO 



EXTERNAL CLOCK 

ACTIVE LOW OUTPUT ENABLE SIGNAL 
ACTIVE LOW MASTER RESET SIGNAL 

SERIAL NRZ DATA STREAM WHICH IS ENCODED 
AND TRANSMITTED 

DELAY NRZ DATA WHICH IS INPUT FOR PAL B 
COUNTER OUTPUTS (111) INDICATES 
AN 8 -BIT STREAM OF ALL ZEROS 



EQUATIONS 

; 3 -BIT COUNTER WITH SYNCHRONOUS RESET AND ENABLE 



/C2 := C2 * CI * CO 
+ /C2 * /CI 
+ /C2 * /CO 
+ RST 
+ NRZ IN 



MASTER RESET 

CLEAR COUNTER WHEN NRZ IN = 1 



/CI := /CI * /CO 
+ CI * CO 
+ RST 
+ NRZ_IN 

/CO := CO 
+ RST 
+ NRZ_IN 

; 5 -STAGE PIPELINE DELAY 

/NRZ_DELAY := /R3 + RST 



/R3 
/R2 
/Rl 
/RO 



= /R2 + RST 
= /Rl + RST 
= /RO + RST 
= /NRZ IN + RST 



MASTER RESET 

CLEAR COUNTER WHEN NRZ IN = 1 



MASTER RESET 

CLEAR COUNTER WHEN NRZ IN = 1 
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SIMULATION 

TRACE_ON NRZ_CK /OE /RST NRZ_IN 

C2 CI CO NRZ DELAY R3 R2 Rl RO 



SETF OE 

RST 
CLOCKF NRZ CK 



ENABLE OUTPUT 
RESET 



SETF NRZ_IN 

/RST 
CLOCKF NRZ CK 



INITIALIZE 



SETF /NRZ_IN 

FOR J: = TO 8 DO 

BEGIN 

CLOCKF NRZ_CK 

END 



PERFORM ZERO SUBSTITUTION 



SETF NRZ_IN 
CLOCKF NRZ_CK 
SETF NRZ_IN 
CLOCKF NRZ_CK 
SETF /NRZ_IN 
CLOCKF NRZ_CK 
SETF NRZ_IN 
CLOCKF NRZ CK 



ALTERNATE POSITIVE AND NEGITIVE PULSES 



SETF /NRZ_IN 

FOR J: = TO 6 DO 

BEGIN 

CLOCKF NRZ_CK 

END 



PERFORM ZERO SUBSTITUTION 



SETF NRZ_IN 
CLOCKF NRZ_CK 

TRACE OFF 
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\C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1936 
PAL ASM SIMULATION SELECTIVE TRACE LISTING 



Title : PAL. ...A 

Pattern : BBZS ENCODER PAL A 

Revision : 1.02 



AuthcDT 

Companv 

Date 



THERESA SHAFER 

MM I 

6/6/87 



f"AL...,A 
Paqe : 
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TITLE PAL B 








PATTERN B8ZS ENCODER PAL B 




REVISION 1. 


04 






AUTHOR THERESA SHAFER 






COMPANY MMI 








DATE 6/6/87 








CHIP PAL_B 


PAL16R6 






;PINS 








;i 2 


3 


4 5 6 7 8 9 




NRZ CK /RST NRZ IN 


C2 CI CO NRZ DELAY NC NC 


GND 


; 11 12 


13 


14 15 16 17 18 19 


20 


/OE NC 


CS 


NPO PPO QO Ql Q2 NC 


VCC 


; INPUTS: 


NRZ CK 


EXTERNAL CLOCK 






/OE 


ACTIVE LOW OUTPUT ENABLE SIGNAL 






/RST 


ACTIVE LOW MASTER RESET SIGNAL 






NRZ_DELAY 


SERIAL NRZ DATA STREAM WHICH WAS DELAYED 
BY PAL A 






C2 - CO 


COUNTER OUTPUTS 






NRZ_IN 


NRZ DATA STREAM INDICATES AN 








8 -BIT STREAM OF ALL ZERO WHEN 

COUNTER OUTPUTS ARE 111 AND NRZ_IN = 




; OUTPUTS: 


CS 


CURRENT STATE 






PPO 


POSITIVE B8ZS CODE 






NPO 


NEGATIVE B8ZS CODE 






Q2 - QO 


B8ZS SEQUENCE STATE MACHINE STATES 




EQUATIONS 









B8ZS ENCODING STATE MACHINE EQUATIONS 

STATE VARIATES = [CS, PPO, NPO] 

WITH THE FOLLOWING STATE ASSIGNMENT 

P_ZERO = [0,0,0] 

NEGATIVE = [1,0,1] 

N_ZERO = [1#0,0] 

POSITIVE = [0,1,0] 



/CS := /CS * /NPO * /Ql * QO 
+ CS * /PPO * Ql * /QO 
+ /CS * /NPO * /NRZ_DEIAY * /Ql 
+ /CS * /NPO * Q2 * /Ql 
+ CS * NRZ^DELAY * /PPO * /Q2 * /QO 
+ RST 



/PPO := CS * /PPO * QO * /Ql 
+ /CS * /NPO * /QO 
+ CS * /NRZ_DELAY * /PPO * 
+ CS * /PPO * Q2 * /Ql 
+ RST 



/Ql 
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/NPO := /CS * /NPO * /Ql * QO 



+ /CS * /NPO * /NRZ_DELAY * /Ql 
+ /CS * /NPO * Q2 * /Ql 
+ CS * /PPO * /QO 
+ RST 



; B8ZS 


SEQUENCE STATE MACHINE EQUATIONS 


; STATE VARIATES 


[Q2,Q2 


.,Q0] 


; WITH 


THE FOLLOWING STATE ASSIGNMENTS 


A 


= [0,0,0] 






B 


= [0,0,1] 






C 


= [0,1,0] 






D 


= [1,0,0] 






E 


= [1,0,1] 






F 


= [1/1,0] 






/Q2 : = 


/Q2 * /Ql 






+ 


Q2 * Ql 






+ 


RST 






/Ql : = 


/QO 






+ 


Q2 * Ql 






+ 


RST 






/QO : = 


/Q2 * /Ql 


* /QO 


* /C2 


+ 


/Q2 * /Ql 


* /QO 


* /CI 


+ 


/Q2 * /Ql 


* /QO 


* /CO 


+ 


/Q2 * /Ql 


* /QO 


* NRZ IN 


+ 


QO 






+ 


Ql 







+ RST 



SIMULATION 

TRACE_ON NRZ_CK /OE /RST NRZ_DELAY NRZ_IN C2 CI CO 
CS PPO NPO Q2 Ql QO 

SETF OE ; ENABLE OUTPUT 

RST ; RESET 

CLOCKF NRZ_CK 

; INITIALIZE 
SETF NRZ_DELAY 

/RST 

NRZ_IN /C2 /CI /CO 
CLOCKF NRZ_CK 

; PERFORM NORMAL ENCODING 
SETF /NRZ_DELAY 

/NRZ_IN /C2 /CI /CO 
CLOCKF NRZ_CK 
SETF NRZ DELAY 
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/C2 /CI CO 
CLOCKF NRZ_CK 
SETF NRZ_DELAY 

/C2 CI /CO 
CLOCKF NRZ_CK 
SETF NRZ_DELAY 

/C2 CI CO 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 

C2 /CI /CO 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 

C2 /CI CO 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 

C2 CI /CO 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 

C2 CI /CO 
CLOCKF NRZ_CK 
SETF C2 CI CO 
CLOCKF NRZ_CK 
SETF /C2 /CI /CO 
CLOCKF NRZ_CK 
FOR I:=0 TO 5 DO 

BEGIN 

CLOCKF NRZ_CK 

END 



SETF /C2 /CI /CO 

NRZ_DELAY 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 
CLOCKF NRZ_CK 
SETF C2 CI CO 

/NRZ_DELAY 
CLOCKF NRZ_CK 
SETF /C2 /CI /CO 

/NRZ_DELAY 
CLOCKF NRZ_CK 
FOR I:=0 TO 5 DO 

BEGIN 

CLOCKF NRZ_CK 

END 



; PERFORM B8ZS SUBSTITUTION 
; FORCE VIOLATION 



; FORCE ONE 

; FORCE ZERO 

; FORCE VIOLATION 



; FORCE ONE 



; PERFORM NORMAL ENCODING 



SETF /NRZ_DELAY 

NRZ_IN C2 CI /CO 
CLOCKF NRZ_CK 
SETF C2 CI CO 
CLOCKF NRZ_CK 
SETF /C2 /CI /CO 
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CLOCKF NRZ_CK 
FOR I:=0 TO 5 DO 

BEGIN 

CLOCKF NRZ_CK 

END 



; PERFORM B8ZS SUBSTITUTION 



TRACE OFF 
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PAL ASM SIMULATION, V2-22 - MARKET RELEASE (11-19-86) 
'.O - COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PAL ASM SIMULATION SELECTIVE TRACE LISTING 



Title : 


PAL B 


Author 


: THERESA SHAf-ER 


Pattern : 


BBZS ENCODER PAL B 


Camp any 


: MM I 


RevisicDn : 


1.04 


Date 


: 6/6/87 


PAL B 








Page : 1 
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PAL....B 
Paqe : 
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TITLE B8ZS_DEC0DER 

PATTERN B8ZS DECODER PAL 

REVISION 1.05 

AUTHOR THERESA SHAFER 

COMPANY MMI 

DATE 6/6/87 

CHIP B8ZS DECODER PAL16R6 



B8ZS Coding 



; PINS 

;1 2 3 4 5 6 7 
T1_CK /RST PPI NPI NC NC NC 

;11 12 13 14 15 16 17 

/OE NC QO Ql Q2 Q3 /BPV 



8 9 

NC NC GND 

18 19 20 

/B8ZS DATA VCC 



INPUTS: T1_CK EXTERNAL CLOCK 

/OE ACTIVE LOW OUTPUT ENABLE SIGNAL 

/RST ACTIVE LOW MASTER RESET SIGNAL 
PPI SERIAL POSITIVE PUSLE DATA STREAM 

TO BE DECODED 
NPI SERIAL NEGATIVE PUSLE DATA STREAM 
TO BE DECODED 



OUTPUTS: DATA DATA WHICH IS LOADED INTO THE SHIFT 

REGISTER 
Q3 - QO STATE MACHINE OUTPUTS 
/BPV LOW ACTIVE SIGNAL WHICH INDICATES THAT 

BIPOLAR VIOLATION HAS OCCURRED 
/B8ZS LOW ACTIVE SIGNAL WHICH INDICATES THAT 
ZERO SUBSTITUTION HAS OCCURRED 



EQUATIONS 

; DATA OUTPUT 

/DATA = /PPI * /NPI; 

; /BPV AND /B8ZS FLAGS 

BPV := PPI * /Q2 * Ql 

+ PPI * /Q3 * Q2 * /QO 
+ NPI * Q2 * Ql 
+ NPI * /Q3 * Ql * /QO 
+ /PPI * /Q2 * /Ql * QO 
+ /PPI * Q3 * /Ql * QO 
+ /NPI * /Q3 * Q2 * /Ql 
+ /NPI * Q2 * /Ql * /QO; 

B8ZS := /PPI * NPI * /Q3 * Q2 * /Ql * QO 
+ PPI * /NPI * Q3 * Q2 * /Ql * QO; 

; STATE MACHINE WHICH DETECTS B8ZS SEQUENCE 

; STATE VARIABLES [Q3,Q2,Q1,Q0] 

; WITH THE FOLLOWING STATE ASSIGNMENTS 
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SPO 
SPl 
SP2 

SP3 
SP4 
SP5 
SP6 
SP7 
SNO 
SNl 
SN2 
SN3 
SN4 
SN5 
SN6 
SN7 



[0,0,1,1] 
[1,0,1,1] 
[1,0,1,0] 



[0,0, 
[0,1, 
[0,0, 
[1,1, 
[1/1, 
[1,1/ 
[0,1, 
[1/1/ 
[1/0, 
[1/0, 
[0/1/ 
[0,0, 
[0,1, 



0/0] 
0/0] 
1/0] 
0,0] 
0/1] 

1/1] 
1/1] 

1/0] 
0/0] 
0,1] 
1/0] 
0/1] 
0/1] 



/Q3 := PPI * /RST 
+ /NPI * /Q3 
+ /NPI * 
+ /NPI * 
+ /NPI * 
+ /NPI * 



/Q2 * /Ql * /QO * /RST 
Q3 * /Q2 * Ql * /QO * /RST 
/Q3 * Q2 * Ql * /QO * /RST 
Q3 * /Ql * QO * /RST 
Q3 * Q2 * QO * /RST 



+ NPI * /Q3 * Q2 */Ql * /QO * /RST; 



/Q2 : = 



PPI * Ql * /RST 
PPI * Q2 * /RST 
/NPI * /Q3 * Q2 * /QO * /RST 

* Q3 * Ql * /QO * /RST 

* /NPI * /Q3 * /Ql * /RST 

* /Q2 * Ql * QO * /RST 
/Q2 * /Ql * /QO * /RST 
Q2 * /Ql * /QO * /RST; 



/NPI 
/PPI 
/NPI 
Q3 * 
/Q3* 



/Ql := PPI * /Q3 * /Q2 * /Ql * /RST 

+ /PPI * /NPI * /Q2 * /QO * /RST 

+ /PPI * /NPI * Ql * /QO * /RST 

+ /PPI * NPI * Q3 * /Ql * /QO * /RST; 



/QO : = 

+ 
+ 
+ 
+ 
+ 
+ 
+ 



PPI 

PPI 

/PPI 

/PPI 

/PPI 

/PPI 

/PPI 

/PPI 



r /Q3 * 
r Q3 * 

* /NPI 

* /NPI 

* /NPI 

* /NPI 

* /NPI 

* NPI ^ 



/Q2 * /Ql * /QO * /RST 
/Q2 * /Ql * QO * /RST 



/Q2 * /QO * /RST 
Q3 * /QO * /RST 
/Q3 * /Q2 * /Ql * /RST 
Q3 * /Q2 * Ql * /RST 
/Q3 * Q2 * Ql * QO * /RST 
/Q3 * Q2 * /Ql * /QO * /RST; 
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SIMULATION 

TRACE_ON T1_CK /OE /RST PPI NPI 
Q3 Q2 Ql QO /BPV /B8ZS 



SETF OE 



ENABLE OUTPUT 



RST 

/PPI /NPI 
CLOCKF T1_CK 
SETF RST 
CLOCKF T1_CK 
SETF /PPI /NPI 

/RST 
CLOCKF Tl CK 



; RESET 



; NORMAL DECODING 



SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 

CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF Tl CK 



SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 

SETF /PPI NPI 
CLOCKF T1_CK 

SETF /PPI /NPI 
CLOCKF Tl CK 



• DETECT B8ZS SEQUENCE 
; + SEQUENCE 



; SET-UP FOR NEXT SEQUENCE 



; - SEQUENCE 
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SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF PPI /NPI 

CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF Tl CK 



SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF Tl CK 



; DETECT BIPOLAR VIOLATIONS 



TRACE OFF 
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PAL ASM SIMULATION, y2.22 - MARKET RELEASE (11-19-86) 
■:C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PAL ASM SIMULATION SELECTIVE TRACE LISTING 



Title 
F-'attern 
Revi si on 



B8ZS DECODER 
B82S DECODER PAL 
1 . 05 



Author 

Companv 

Date 



THERESA SHAF-ER 

MM I 

6/6/37 



BS2S DECODER 

Page 5 1 
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B8ZS._.DEC0DER 
F'aqe : 2 
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Introduction 

A digital communication trunk system consists essentially of two 
channel banks and several repeaters on the transmission lines 
(Figure 1). A channel bank is terminal equipment which multi- 
plexes more than one channel onto a single transmission line 
using Time Division Multiplexing (TDM) technique. This TDM 
technique assigns an equal individual time slot for each channel 
to allow several channels to share a transmission line sequen- 
tially. 




Figure 1. Digital Communication Trunk System 

H\gh Density Bipolar (HDB3) code is a modified bipolar code. A 
bipolar code uses alternating polarities for successive logic 
ones. The signal edges provide the timing information. An error 
occurs when two contiguous logic ones have the same polarity; 
this is defined as a "bipolar violation." HDB3 code permits a legal 
bipolar violation to prevent four or more contiguous zeros in the 
data stream. Three is the maximum number of contiguous zeros 
allowed before a violation is inserted to maintain timing informa- 
tion for a long string of logic zeros. 

HDB3 Code 

Code conversion rules for trunk lines are defined in the CCITT 
G703 recommendation, which includes two standards. One 
standard is T-carrier standard for North America and Japan; the 
other standard is CERT standard for Europe and elsewhere. 
HDB3 code is recommended by CCITT for the CEPT standard, 
at the 2,048, 8,448, or 34,368 kbps rates. 

Violation patterns used for four-zeros-substitution in HDB3 are 
shown in Figure 2. Four possible HDB3 substitution codes are: 
■t-00+, -00-, 000-H, or 000-. The "+" indicates the positive voltage. 



"-" indicates the negative voltage, and "0" indicates the zero volt- 
age. The polarity of the preceding pulse and the number of bipo- 
lar pulses after the previous bipolar violation select the substitu- 
tion code. 



POLARITY 

PRECEDING 

PULSE 


NUMBER OF BIPOLAR PULSE 


ODD 


EVEN 


Positive 
(+) 


000+ 


-00- 


Negative 
(-) 


000- 


+00+ 



Figure 2. HDB3 Substitution Table 

HDB3 Coding Example 

An example of converting from NRZ (Non-Return-Zero) data to 
HDB3 code is shown in Figure 3. There are four sets of four suc- 
cessive zeros in the data stream. In the first group the previous 
signal is positive and the number of logic ones is odd, so the 
"000+" pattern is generated. In the second group the previous 
signal is positive and the number of logic ones is even, so the "- 
00-" pattern is generated. In the third group the previous signal is 
negative and the number of logic ones is even, so the "+00+" 
pattern is generated . A "000-" pattern is generated in the fourth 
group because the previous signal is negative and the number 
of logic ones is odd. 

HDB3 substitution codes force the fourth bit to be a bipolar viola- 
tion. The violation is chosen by the following scheme, based on 
the number of bits since the previous violation: 

(1) When the number is even, HDB3 substitution code forces the 
first zero to be a bipolar code (B) and the fourth zero to be a bi- 
polar violation (V). Therefore, four successive zeros is converted 
to a bipolar signal, following two continuous zeros, and a bipolar 
violation with the first bipolar signal (BOOV). The polarity of the 
bipolar violation (V) is the same as the polarity of the first bipolar 
signal (B). 



THE PREVIOUS 
SIGNAL 



NRZ SIGNAL STRING 
POLARITY 
NUMBER 



HDB3 SUBSTITUTION CODE 



+ 



- 



+ 00 + 



-^001-0000-11-0000-11-0000-1-0000 
+ 

2 







-0 0+ + 



- 



+00+ 000 



0> V^ ^w* ^ 

GROUP 1 GROUP 2 GROUP 3 GR0UP4 

Figure 3. An Example of HDB3 Coding 
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(2) When the number is odd, HDB3 substitution code forces the 
fourth zero to be a bipolar violation (V). Therefore, HDB3 is con- 
verted as three continuous zeros followed by bipolar violation 
with the previous bipolar signal (OOOV). The polarity of the bipo- 
lar violation is the same as the polarity of the previous bipolar 
signal. 



HDB3 Encoder Implementation 

Figure 4 shows the block diagram of an HDB3 encoder, which 
includes three state machines and one 4-bit shift register. A DE- 
TECT_4_ZER0S state machine detects four successive zeros. 
An ODD_EVEN state machine checks the number of logic ones 
after the last bipolar violation. An HDB3 encoding state machine 



generates the HDB3 code and a RESET signal which clears the 
ODD_EVEN state machine. A 4-bit shift register delays the in- 
coming NRZ data in order to synchronize with the 4-ZEROS sig- 
nal from the DETECT_4_ZER0S state machine. 

The HDB3 encoding state machine generates HDB3 code, de- 
pending on the DSIN, 4-ZEROS, ODD_EVEN, and POS_NEG 
signals. When DSIN is false and 4-ZEROS is false, the output is 
a zero. When DSIN is true and 4-ZEROS is false, the output 
generates a signal that alternates between the positive and neg- 
ative signals. When DSIN is true and 4-ZEROS is true, an error 
flag is raised. HDB3 substitution code is generated when DSIN 
is false and 4-ZEROS is true. Four possible HDB3 substitution 
codes are generated, depending on POS_NEG and 
ODD_EVEN signals. These are summarized in Figure 5. 



RESET 













EVEN 
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ODD_ 


ODD _ EVEN 
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GENERATE 

HDB3 

CODING 

STATE 

MACHINE 








( + ^ 
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DETECT 4 ZEROS 


4-ZEROS 




-PHDB 






























S3 


S2 


SI 


SO 






•NHDB 
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* 






" 




4 -BIT 




(-) 


SHIFT REGISTER 













Figure 4. Block Diagram of an HDB3 Encoder 
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POS_NEG 


ODD_EVEN 
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Zero output 
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1 








1 








1 








+00+ 





1 
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000- 
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1 


000+ 





1 


1 





-00- 













Alternating 
PHDB (+) and NHDB (-) 
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1 







1 
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Error 
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1 




1 
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1 




1 


1 






Figure 5. HDB3 Encoding State Table 
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The three Programmable Array Logic (PAL) devices used to im- 
plement the entire HDB3 encoder are a 16RA8, a 20RS8, and a 
16R4 (see Figure 6). 

The 16RA8 includes two state machines: ODD_EVEN and DE- 
TECT_4_ZER0S. The ODD_EVEN's output signal is fed into 
the HDB3 decoding state machine. If the ODD_E\/EN state ma- 
chine is triggered by the same clock edge as the decoding state 
machine, the wrong result is generated because the encoding 
state machine always gets the ODD_EVEN's output signal one 
clock early. Therefore, the ODD_EVEN state machine must be 
triggered by the falling edge of the clock to compensate for this 
timing problem. However, the DETECT_4_ZER0S state ma- 
chine is triggered by the rising edge of the clock. Both rising and 
falling edges are needed for these two state machines. The 
16RA8 device offers the programmable clock feature because 
one of the product terms in each cell is the clock input. The c lock 
flexibility of the 16RA8 allows the falling-edge clock (CLK) for 
the ODD_EVEN state machine and the rising- edge clock (CLK) 
for the DETECT_4_ZER0S state machine in a single 20-pin 
PAL device. 



The HDB3 encoding state machine diagram is shown in Figure 
7. Four registers are used for fifteen states. Two of the four reg- 
isters have ten product terms, one register has eight product 
terms, and the final register has seven product terms. Both the 
20RS4 and 20RS8 have the product-sharing feature. The 
20RS8 is used to implement this HDB3 encoding state machine 
because the product sharing feature is sixteen product terms 
shared by two registers exclusively, and in this case at least sev- 
enteen product terms are required per register pair. The design 
fits in the 20RS8 by using the product terms from two registers 
for one output, and leaving the other output as a no-connect. In 
addition to the HDB3 encoding state machine, a reset signal and 
an error message are generated in this 20RS8. The reset signal 
clears the ODD_EVEN state machine. The error message oc- 
curs when DSIN and 4_ZER0S both are high simultaneously. 

The 16R4 has four registered outputs and four combinatorial 
outputs. A 4-bit shift register is implemented using four regis- 
tered outputs. The positive HDB3 signal (PHDB) and negative 
HDB3 signal (NHDB) are generated using two combinatorial 
outputs. 




U 
E 



VCC 2£l~^'^° 

PHDB ]?] 

QINO NHDB jj] 

QIN1 QO 1^ 

Q1N2 Qi ^ 

QIN3 Q2 ^ 

DSIN iTj 

NC ^j] 



MR 
NC 
NC 
GND 



NC 



E 



->-PHDB 
->>NHDB 



Figure 6. HDB3 Encoder Implemented by Three PAL Devices 
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/DSIN* ZEROS 
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/DSIN* 
7FROS* 






























S4(0100) 
PHDB = 1 
NHDB = 
RESET = 1 
ERROR = 


/ODDEVEN 




813(0101) 
PHDB=0 
NHDB=0 
RESET = 
ERROR =0 


/ZEROS* 


812(0001) 
PHDB=0 
NHDB=1 
RESET = 1 
ERROR = 






















/DSIN* 




/DSIN* 
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ZEROS* 
/ODDEVEN 
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/ODDEVEN 
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/ZEROS 
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/DSIN* 
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/DSIN* 
ZEROS* 
ODDEVEN 












85(1100) 
PHDB=0 
NHDB=0 
RESET =0 
ERROR = 


/ZEROS* 


SO (0000) 
PHDB=0 
NHDB=0 
RESET = 
ERROR = 


/DSIN* 




83(1110) 
PHDB=0 
NHDB=1 
RESET = 
ERROR = 








811 (0011) 
PHDB=0 
NHDBrO 
RESET =0 
ERROR =0 




(^ 


/ZEROS* 




























USIN- 
/ZEROS 








DSIN* 
/ZEROS 
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I 






t 








DSIN* 
/ZEROS 






DSIN* /ZEROS 


DSIN* 
/ZEROS 








w 
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< 






1 
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/DSIN* ^_ 
/ZEROS ( 


^ 














S6(1101) 
PHDB=0 
NHDB=0 
RESET =0 
ERROR = 






81 (1000) 
PHDB a 1 
NHDB=0 
RESET = 
ERROR =0 


/DSIN* 
/ZEROS 


V 


82(1010) 
PHDB = 
NHDB=0 
RESET = 
ERROR = 






810(0010) 
PHDB=0 
NHDB=0 
RESET = 
ERROR =0 












/DSIN* 
ZEROS* 


















/ODDEVEN 
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1 
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I 






















/DSIN* 
ZEROS* 
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/DSIN* 
ZEROS* 
ODDEVEN 


fi 


LL DSIN 


*ZER 


38 


/ODDEVEN 




87(1111) 
PHDB=1 
NHDB=0 
RESET = 1 
ERROR =0 






88(0111) 
PHDBrO 
NHOB=0 
RESET =0 
ERROR =0 






814(1001) 
PHDB=0 
NHDB=0 
RESET = 
ERROR = 1 






89(0110) 
PHDB = 1 
NHDB=0 
RESET = 1 
ERROR =0 






' w^ 








/DSIN* 












/ZE 


ROS' 








/DSIN* Z 


=ROS 














, 


I 



Notes; 

1. Inputs: DSIN, ZEROS, and ODDEVEN. 

2. State Outputs: PHDB, NHDB, RESET and ERROR. 



Figure 7. HDB3 Encoding State Machine 
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HDB3 Decoder Implementation 

The block diagram of the HDB3 decoder is shown in Figure 8. A 
decoding state machine detects the HDB3 code, then either a 
Data Serial OUT (DSOUT) signal or a RESET signal is generat- 
ed. These two signals are fed into the 4-bit shift register. Two 
error flags are generated: VERROR (Violation ERROR) and ER- 
ROR. 



RESET 



4-BIT 
SHIFT REGISTER 



DSOUT 

ERROR- 

VERRORx 



DETECT 

HDB3 CODING 

STATE 

MACHINE 



Figure 8. Block Diagram of an l-iDB3 Decoder 



A single 24-pin PAL device (20RS4) and a 4-bit shift register 
(HCT194) are used to implement the HDB3 decoder (see Figure 
9). The decoding state machine diagram is shown in Figure 10. 
Four registers are used for thirteen states. Two of the four reg- 
isters have eight product terms, one register has nine product 
terms, and the final register has five product terms. Because 
the total product terms of two registers can be less than six- 
teen product terms, these four registers can be implemented 
using a 20RS4 device rather than a 20RS8. 

When an HDB3 substitution code is detected, a reset signal is 
generated to load four zeros to the HCT194. Otherwise, 
DSOUT is generated to the shift right serial input (SR SER) of 
the HCT194. When any two sequential signals of the same po- 
larity are detected except the HDB3 substitution code, the 
bipolar violation signal is generated (VERROR). The ERROR 
signal is generated when both positive HDB3 signal and nega- 
tive HDB3 signal are high simultaneously. 



PHDB 
NHDB 



I 



E 

■*E 

-E 

*E 

E 

E 

E 

E 

E 

E 

-E 



CLK 

NC 

PHDB 

NHDB 

MR 

NC 
P> 

NC 
NC 
NC 
NC 
NC 
GND 



■\y 



VCC 

RESET 

DSOUT 

NC 

QO 

Q1 

0RS4 

02 
03 
NC 
VERROR 
ERROR 
OE 



24 
22} 

E 

20] 

H 

i] 

15 



VERROR 
ERROR 




1 



Figure 9. HDB3 Decoder Implemented by a PAL Device and a Shift Register 
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Notes: 

1. Inputs: PHDB and NHDB. 

2. State Outputs: RESET, VERROR and ERROR. 
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Title HDB3 encoder logic 

Pattern HDB3ENl.pds 

Revision A 

Author Cindy Lee 

Company Monothic Memories Inc., Santa Clara, Ca 

Date 3/4/87 

CHIP HDB3EN1 PAL16RA8 



;PIN1 PIN2 PIN3 PIN4 PINS PIN6 PIN7 PINS PIN9 PINIO 
/PL CLK PHDB NHDB NRZIN RESET MR NC NC GND 



;PIN11 PIN12 PIN13 PIN14 PIN15 PIN16 
/OE ZQO ZQl ZQ2 ZEROS NC 

INPUT SIGNALS 



PIN17 PIN18 PIN19 PIN2 
OQO OQl ODDEVEN VCC 



/PL 

CLK 

PHDB 

NHDB 

NRZIN 

RESET 

MR 

/OE 



ASSERTIVE LOW PRELOAD 

EXTERNAL CLOCK 

POSITIVE HDB3 SIGNAL 

NEGATIVE HDB3 SIGNAL 

NOR-RETURN-ZERO INPUT SIGNAL 

RESTE SIGNAL TO RESET THE ODD_EVEN STATE MACHINEG 

MASTER REST 

ASSERTIVE LOW OUTPUT ENABLE 



OUTPUT SIGNALS 
ZQO, ZQl, ZQ2 : DETECT_4_ZER0S STATE VARIABLES 
ZEROS : DETECT_4_ZER0S STATE OUTPUT 
OQO, OQl : ODD_EVEN STATE VARIABLES 
ODDEVEN : ODD EVEN STATE OUTPUT 



EQUATIONS 

;16RA8 PROVIDES TWO STATE MACHINES 



; (1) THE ODD_EVEN STATE MACHINE CHECKS THE NUMBER OF BIPOLAR PULSES 
AFTER THE PREVIOUS BIPOLAR VIOLATION IS ODD OR /EVEN 



ODDEVEN 
OQl 



OQl.SETF 
OQl . CLKF 



= OQO*/OQ1 

:= NHDB*OQO*/OQ1*/RESET 
+ PHDB*OQO*/OQ1*/RESET 
+ /NHDB*/PHDB*OQO*OQ1*/RESET 
= MR 
= /CLK 



; ODD_EVEN STATE OUTPUT 
; MSB STATE VARIABLE 



MASTER RESET 

THE FALLING EDGE OF 

THE CLOCK 



OQO 



OQO . SETF 
OQO . CLKF 



:= NHDB*/0Q1*/RESET 
+ PHDB*/0Q1*/RESET 
+ OQ0*/RESET 
= MR 
= /CLK 



LSB STATE VARIABLE 
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7(2) THE 


DETECT_ 


4_ZER0S 


STATE MACHINE 


DETECTS 


FOUR CONTINUOUS ZEROS 


ZEROS 


= 


/ZQO * 


/ZQl * ZQ2 




r STATE OUTPUT 


ZQ2 


J = 


/NRZIN 


* /ZQO * ZQl * 


' /ZQ2 ; 


; MSB STATE VARIABLE 


ZQ2.SETF 
ZQ2.CLKF 


= 


MR 
CLK 




1 


; MASTER RESET 
f EXTERNAL CLOCK 


ZQl 

ZQl.SETF 

ZQl.CLKF 


'[ 


/NRZIN 

MR 

CLK 


* ZQO * /ZQ2 






ZQO 

ZQO.SETF 
ZQO.CLKF 


+ 


/NRZIN 
/NRZIN 
MR 
CLK 


* /ZQO * /ZQl 

* /ZQl * /ZQ2 


i 


■ LSB STATE VARIABLE 



SIMULATION 

TRACE_ON CLK MR NRZIN PHDB NHDB RESET 

SETF /CLK MR /NRZIN /PHDB /NHDB /RESET 
SETF CLK /MR 

SETF /CLK /MR /NRZIN /PHDB /NHDB /RESET 
SETF CLK 

SETF /CLK /NRZIN /PHDB /NHDB /RESET 
SETF CLK 

SETF /CLK /NRZIN /PHDB /NHDB /RESET 
SETF CLK 

SETF /CLK /NRZIN /PHDB /NHDB /RESET 
SETF CLK 

SETF /CLK NRZIN PHDB /NHDB RESET 
SETF CLK 

SETF /CLK NRZIN /PHDB /NHDB /RESET 
SETF CLK 

SETF /CLK NRZIN /PHDB /NHDB /RESET 
SETF CLK 

SETF /CLK /NRZIN PHDB /NHDB RESET 
SETF CLK 

SETF /CLK NRZIN /PHDB NHDB /RESET 
SETF CLK 

SETF /CLK /NRZIN PHDB /NHDB /RESET 



ODDEVEN ZEROS 

OE /PL ; RESET CONDITION 

;/DSIN, /ZEROS, /ODDEVEN 

; VECTOR 1, STATE 
;/DSIN, /ZEROS, /ODDEVEN 

; VECTOR 2, STATE 
;/DSIN, /ZEROS, /ODDEVEN 

; VECTOR 3, STATE 
;/DSIN, /ZEROS, /ODDEVEN 

; VECTOR 4, STATE 
;/DSIN, ZEROS, /ODDEVEN 

; VECTOR 5, STATE 4 
;/DSIN, /ZEROS, /ODDEVEN 

; VECTOR 6, STATE 5 
;/DSIN, /ZEROS, /ODDEVEN 

; VECTOR 7, STATE 6 
;/DSIN, /ZEROS, /ODDEVEN 

; VECTOR 8, STATE 7 
;DSIN, /ZEROS, /ODDEVEN 

; VECTOR 9, STATE 3 
7DSIN, /ZEROS, ODDEVEN 

; VECTOR 10, STATE 1 
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SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 

SETF /CLK 
SETF CLK 



NRZIN /PHDB NHDB /RESET 
/NRZIN /PHDB /NHDB /RESET 
/NRZIN PHDB /NHDB /RESET 
/NRZIN /PHDB /NHDB /RESET 
/NRZIN /PHDB NHDB /RESET 

NRZIN /PHDB /NHDB /RESET 

NRZIN /PHDB /NHDB /RESET 
/NRZIN /PHDB /NHDB /RESET 
/NRZIN /PHDB NHDB RESET 
/NRZIN PHDB /NHDB /RESET 
/NRZIN /PHDB NHDB /RESET 

NRZIN PHDB /NHDB RESET 
/NRZIN /PHDB /NHDB /RESET 

NRZIN /PHDB /NHDB /RESET 
/NRZIN PHDB /NHDB RESET 

NRZIN /PHDB NHDB /RESET 

NRZIN /PHDB /NHDB /RESET 
/NRZIN PHDB /NHDB /RESET 



DSIN, /ZEROS, /ODDEVEN 

VECTOR 11, STATE 3 
/DSIN, /ZEROS, ODDEVEN 

VECTOR 12, STATE 
DSIN, /ZEROS, ODDEVEN 

VECTOR 13, STATE 1 
/DSIN, /ZEROS, /ODDEVEN 

VECTOR 14, STATE 2 
DSIN, /ZEROS, /ODDEVEN 

VECTOR 15, STATE 3 
/DSIN, ZEROS, ODDEVEN 

VECTOR 16, STATE 13 
/DSIN, /ZEROS, ODDEVEN 

VECTOR 17, STATE 10 
/DSIN, /ZEROS, ODDEVEN 

VECTOR 18, STATE 11 
/DSIN, /ZEROS, ODDEVEN 

VECTOR 19, STATE 12 
DSIN, /ZEROS, /ODDEVEN 

VECTOR 20, STATE 1 
DSIN, /ZEROS, ODDEVEN 

VECTOR 21, STATE 3 
/DSIN, ZEROS, /ODDEVEN 

VECTOR 22, STATE 4 
/DSIN, /ZEROS, /ODDEVEN 

VECTOR 23, STATE 5 
/DSIN, /ZEROS, /ODDEVEN 

VECTOR 24, STATE 6 
/DSIN, /ZEROS, /ODDEVEN 

VECTOR 25, STATE 7 
DSIN, /ZEROS, /ODDEVEN 

VECTOR 26, STATE 3 
/DSIN, /ZEROS, ODDEVEN 

VECTOR 27, STATE 
DSIN, /ZEROS, ODDEVEN 

VECTOR 28, STATE 1 
/DSIN, /ZEROS, /ODDEVEN 
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SETF /CLK /NRZIN /PHDB /NHDB /RESET 
SETF CLK 



/VECTOR 29, STATE 2 
;DSIN, /ZEROS, /ODDEVEN 



SETF /CLK /NRZIN /PHDB NHDB /RESET 
SETF CLK 



/•VECTOR 30, STATE 3 
;DSIN, /ZEROS, ODDEVEN 



SETF /CLK /NRZIN PHDB /NHDB /RESET 
SETF CLK 



; VECTOR 31, STATE 1 
;/DSIN, ZEROS, /ODDEVEN 



TRACE OFF 
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Title HDB3 encoder 

Pattern HDB3EN2 . pds 

Revision A 

Author Cindy Lee 

Company Monolithic Memories Inc., Santa Clara, Ca 

Date 3/9/87 

CHIP HDB3EN2 PAL20RS8 

;PIN1 PIN2 PIN3 PIN4 PINS PIN6 
CLK DSIN ZEROS ODDEVEN NC MR 

;PIN7 PINS PIN9 PINIO PINll PIN12 
NC NC NC NC NC GND 

;PIN13 PIN14 PIN15 PIN16 PIN17 PINIB 
/OE ERROR NC Q3 NC Q2 

;PIN19 PIN20 PIN21 PIN22 PIN23 PIN24 
NC Ql NC QO RESET VCC 



INPUT SIGNALS 



CLK 

DSIN 

ZEROS 

ODDEVEN 

MR 

/OE 



EXTERNAL CLOCK 

SERIAL DATA INPUT SIGNAL 

FOUR ZEROS DETECTION FLAG INPUT 

ODD OR /EVEN SIGNAL 

MASTER RESET 

ASSERTIVE LOW OUTPUT ENABLE 



OUTPUT SIGNALS 

ERROR : ERROR FLAG 
Q3, Q2, Ql, QO : HDB3 DECODING STATE MACHINE STATE VARIABLES 
RESET : RESET SIGNAL 



EQUATIONS 

;20RS8 GENERATES TWO SIGNALS AND ONE STATE MACHINE 

;(1) A RESET SIGNAL CLEARS THE ODD_EVEN STATE MACHINE 

RESET = /QO * Q2 * /Q3 ; STATE 4 AND STATE 9 

+ QO * /Ql * /Q2 * /Q3 ; STATE 12 

+ QO * <21 * Q2 * Q3 ; STATE 7 



;(2) A ERROR SIGNAL IS RAISED WHEN DSIN AND 4-ZEROS BOTH ARE 
HIGH AT THE SAME TIME 

ERROR = QO * /Ql * /Q2 * Q3 ; STATE 14 



;(3) HDB3 ENCODING STATES MACHINE 

Q3 := /QO * /Ql * Q2 * /MR ; MSB STATE VARIABLE 
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+ /QO * /Q2 * Q3 * /ZEROS * /MR 

+ DSIN * /QO * ZEROS * /MR 

+ DSIN * Q2 * Q3 * /MR 

+ DSIN * /Ql * /Q2 * /Q3 * /MR 

+ DSIN * /Ql * ZEROS * /MR 

+ /Ql * Q2 * Q3 * /MR 

+ DSIN * /Q3 * ZEROS * /MR 

+ QO * Ql * Q2 * /Q3 * /MR 

+ QO * Q2 * Q3 * /ZEROS * /MR 



Q2 := /DSIN * /QO * /Ql * Q2 * /MR 

+ /DSIN * QO * Ql * Q2 * /Q3 * /MR 

+ /QO * /Ql * Q2 * /ZEROS * /MR 

+ /Ql * Q2 * Q3 * /ZEROS * /MR 

+ QO * Ql * Q2 * /Q3 * /ZEROS * /MR 

+ DSIN * /QO * /Q2 * Q3 * /ZEROS * /MR 

+ DSIN * QO * Q2 * Q3 * /ZEROS * /MR 

+ /DSIN * /QO * Q3 * ZEROS * /MR 

+ /DSIN * /Ql * /Q2 * /Q3 * ZEROS * /MR 

+ /DSIN * Q2 * Q3 * ZEROS * /MR 



Ql := /DSIN * /QO * Ql * /Q3 * /MR 
+ /DSIN * QO * /Ql * Q2 * /MR 
+ /QO * Ql * /Q3 * /ZEROS * /MR 
+ QO * /Ql * Q2 * /ZEROS * /MR 
+ /QO * /Q2 * Q3 * /ZEROS * /MR 
+ QO * Q2 * Q3 * /ZEROS * /MR 
+ /DSIN * /QO * /Q2 * Q3 * /MR 
+ /DSIN * QO * Q2 * Q3 * /MR 



QO := DSIN * /QO * ZEROS * /MR 7 LSB STATE VARIABLE 
+ DSIN * /Ql * ZEROS * /MR 
+ /Ql * Q2 * Q3 * /MR 
+ Ql * /Q2 * /Q3 * /MR 
+ DSIN * Q2 * ZEROS * /MR 
+ ODDEVEN * /QO * Q3 * ZEROS * /MR 
+ ODDEVEN * /QO * /Q2 * ZEROS * /MR 

SIMULATION 

TRACE_ON MR CLK DSIN ZEROS ODDEVEN Q3 Q2 Ql QO RESET ERROR 

SETF OE MR DSIN ZEROS ODDEVEN ; RESET CONDITION 
CLOCKF CLK 



SETF /MR 

SETF DSIN 
CLOCKF CLK 



ZEROS /ODDEVEN 



SETF /DSIN /ZEROS /ODDEVEN 
CLOCKF CLK 



;VECTOR 1, STATE 14, 1001 
; ERROR 

/VECTOR 2, STATE 0, 0000 



SETF DSIN /ZEROS /ODDEVEN 
CLOCKF CLK 



;VECTOR 3, STATE 1, 1000 
; PHDB 
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SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF DSIN 
CLOCKF CLK 

SETF DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF DSIN 
CLOCKF CLK 

SETF DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 

SETF /DSIN 
CLOCKF CLK 



ZEROS ODDEVEN 



/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 

ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 

ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 

ZEROS 
/ZEROS 



ODDEVEN 

ODDEVEN 

ODDEVEN 

/ODDEVEN 

ODDEVEN 

/ODDEVEN 

/ODDEVEN 

/ODDEVEN 

/ODDEVEN 

/ODDEVEN 

ODDEVEN 

/ODDEVEN 

/ODDEVEN 

/ODDEVEN 

/ODDEVEN 

/ODDEVEN 

ODDEVEN 

ODDEVEN 



;VECTOR 4, STATE 8, 0111 



; VECTOR 5, STATE 5, 1100 



; VECTOR 6, STATE 6, 1101 



; VECTOR 7, STATE 7, 1111 
;PHDB, RESET 

; VECTOR 8, STATE 3, 1110 
;NHDB 

; VECTOR 9, STATE 1, 1000 
; PHDB 

; VECTOR 10, STATE 9, 0110 
;NHDB, RESET 

/•VECTOR 11, STATE 10, 0010 



; VECTOR 12, STATE 11, 0011 



/VECTOR 13, STATE 12, 0001 
;NHDB, RESET 

; VECTOR 14, STATE 1, 1000 
; PHDB 

; VECTOR 15, STATE 3, 1110 
;NHDB 

; VECTOR 16, STATE 4, 0100 
;PHDB, RESET 

; VECTOR 17, STATE 5, 1100 



; VECTOR 18, STATE 6, 1101 



; VECTOR 19, STATE 7, 1111 
;PHDB, RESET 

; VECTOR 20, STATE 3, 1110 
;NHDB 

/VECTOR 21, STATE 13, 0101 



/VECTOR 22, STATE 10, 0010 
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SETF /DSIN /ZEROS ODDEVEN ; VECTOR 23, STATE 11, 0011 
CLOCKF CLK 

SETF /DSIN /ZEROS ODDEVEN ;VECTOR 24, STATE 12, 0001 
CLOCKF CLK ;NHDB, RESET 

TRACE OFF 



PALASM SIMULATION, V2.22 - MARKET RELEASE (11-19-86) 
(C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PALASM SIMULATION SELECTIVE TRACE LISTING 
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Title HDB3 shift register 

Pattern HDB3EN3 . pds 

Revision A 

Author Cindy Lee 

Company Monothic Memories Inc. , Santa Clara, Ca 

Date 2/24/87 

CHIP HDB3EN3 PAL16R4 



;PIN1 PIN2 PIN3 PIN4 PINS PIN6 PIN7 PINS PIN9 PINIO 
CLK NRZIN QINO QINl QIN2 QIN3 MR NC NC GND 

;PIN11 PIN12 PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 PIN19 PIN20 
/OE NC NC DSIN Q2 Ql QO NHDB PHDB VCC 

INPUT SIGNALS 

CLK : EXTERNAL CLOCK 
NRZIN : NON-RETURN- ZERO INPUT SIGNAL 
QINO, QINl, QIN2, QIN3 : FOUR STATE VARIABLE INPUTS 
MR : MASTER RESET 
/OE : ASSERTIVE LOW OUTPUT ENABLE 

OUTPUT SIGNALS 
DSIN, Q2, Ql, QO : 4-BIT SHIFT REGISTER OUTPUT SIGNALS 
PHDB : POSITIVE HDB3 SIGNAL 
NHDB : NEGATIVE HDB3 SIGNAL 



EQUATIONS 

;16R4 GENERATES TWO HDB SIGNALS AND PROVIDES A 4-BIT SHIFT REGISTER 

;(1) A POSITIVE HDB3 AND A NEGATIVE HDB3 SIGNALS ARE GENERATED 



/PHDB = QINO * /QINl 

+ /QINO * QIN2 * QIN3 
+ QINl * QIN2 * /QIN3 
+ QINl * /QIN2 
+ /QIN2 * /QIN3 

/NHDB = /QINO * /QINl 
+ QINl * /QIN2 
+ /QIN2 * QIN3 
+ QINO * QIN2 



THE PHDB IS GENERATED AT 
STATE 1, STATE 4, OR STATE 7 
THIS EQUATION IS THE INVERTING 
LOGIC OF THE PHDB SIGNAL 



THE NHDB IS GENERATED AT 
STATE 3, STATE 9, OR AT ATE 12 
THIS EQUATION IS THE INVERTING 
LOGIC OF THE NHDB SIGNAL 



;(2) THIS IS A 4-BIT SHIFT REGISTER WITH MASTER RESET 



/QO 



:= /NRZIN 
+ MR 



LSB OF SHIFT REGISTER 
MASTER RESET 



/Ql := /QO 
+ MR 



/Q2 



:= /Ql 
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MR 



/DSIN := /Q2 
+ MR 



; MSB OF SHIFT REGISTER 



SIMULATION 

TRACE_ON MR CLK QINO QINl QIN2 QIN3 PHDB NHDB 

SETF OE MR QINO /QINl /QIN2 QIN3 NRZIN 
CLOCKF CLK 

SETF /MR QINO /QINl /QIN2 QIN3 NRZIN 
CLOCKF CLK 

SETF /MR QINO /QINl /QIN2 QIN3 NRZIN 
CLOCKF CLK 

SETF /MR /QINO /QINl /QIN2 /QIN3 /NRZIN 
CLOCKF CLK 

SETF /MR /QINO /QINl /QIN2 QIN3 NRZIN 
CLOCKF CLK 

SETF /MR /QINO QINl /QIN2 QIN3 /NRZIN 
CLOCKF CLK 

SETF /MR /QINO /QINl QIN2 QIN3 /NRZIN 
CLOCKF CLK 

SETF /MR QINO /QINl QIN2 QIN3 NRZIN 
CLOCKF CLK 

SETF /MR QINO QINl QIN2 QIN3 NRZIN 
CLOCKF CLK 

SETF /MR QINO /QINl QIN2 /QIN3 /NRZIN 
CLOCKF CLK 

SETF /MR /QINO QINl /QIN2 /QIN3 NRZIN 
CLOCKF CLK 

SETF /MR QINO QINl /QIN2 /QIN3 /NRZIN 
CLOCKF CLK 

SETF /MR QINO /QINl /QIN2 /QIN3 NRZIN 
CLOCKF CLK 



NRZIN DSIN 
; RESET CONDITION 

; VECTOR 1 

; VECTOR 2 

; VECTOR 3 



; VECTOR 4, PHDB 
;DSIN STARTS 

/VECTOR 5 
;DSIN 

; VECTOR 6 
;/DSIN 

/VECTOR 7 
;DSIN 

/VECTOR 8, PHDB 
;/DSIN 

; VECTOR 9 
7 /DSIN 

; VECTOR 10 
;DSIN 

; VECTOR 11 
;DSIN 

/VECTOR 12, NHDB 
//DSIN 



TRACE OFF 
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Title HDB3 decoder 

Pattern HDB3 DE . pds 

Revision A 

Author Cindy Lee 

Company Monolithic Memories Inc., Santa Clara, Ca 

Date 3/12/87 

CHIP HDB3DE PAL20RS4 

;PIN1 PIN2 PIN3 PIN4 PINS PIN6 
CLK NC PHDB NHDB MR NC 

;PIN7 PINS PIN9 PINIO PINll PIN12 
NC NC NC NC NC GND 

;PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 
/OE ERROR VERROR NC Q3 Q2 

;PIN19 PIN20 PIN21 PIN22 PIN23 PIN24 
Ql QO NC DSOUT RESET VCC 



INPUT SIGNALS 



CLK 

PHDB 

NHDB 

MR 

/OE 



EXTERNAL CLOCK 

POSITIVE HDB3 SIGNAL 

NEGATIVE HDB3 SIGNAL 

MASTER RESET 

ASSERTIVE LOW OUTPUT ENABLE 



OUTPUT SIGNALS 

ERROR : ERROR FLAG 

VERROR : 3IP0LAR VIOLATION ERRIR 
Q3, Q2, Ql, QO : HDB3 DECODING STATE VARIABLE OUTPUTS 

DSOUT : DATA SERIAL OUTPUT 

RESET : RESET SIGNAL 



EQUATIONS 

;THE 20RS4 GENERATES ONE SIGNAL, TWO ERROR FLAGS 
;AND A HDB3 DECODING STATE MACHINE 

;(1) A RESET SIGNAL IS GENERATED TO CLEAR 
THE 4-BIT SHIFT REGISTER 



RESET = QO * Q2 * /Q3 * /PHDB * NHDB 
+ Ql * /Q2 * Q3 * PHDB * /NHDB 



STATE 8 OR STATE 9 
WITH /PHDB * NHDB 
STATE 3 OR STATE 4 
WITH PHDB * /NHDB 



;(2) A VERROR SIGNAL IS GENERATED TO INDICATE THE VIOLATION ERROR 

VERROR = /QO * Ql * Q2 * Q3 ; STATE 11 

+ QO * /Ql * Q2 * Q3 ; STATE 12 
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CLOCKF CLK 



SETF /MR PHDB NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB NHDB 
CLOCKF CLK 

SETF /MR PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB NHDB 
CLOCKF CLK 

SETF /MR PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB NHDB 
CLOCKF CLK 

SETF /MR PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 



; VECTOR 1, STATE 0, 0000 
; ERROR 

; VECTOR 2, STATE 0, 0000 



; VECTOR 3, STATE 0, 0000 



; VECTOR 4, STATE 1, 1000 



/VECTOR 5, STATE 2, 1001 



; VECTOR 6, STATE 3, 1011 



; VECTOR 1, STATE 4, 1010 



; VECTOR 8, STATE 5, 0010 
; RESET 

;VECTOR 9, STATE 6, 0110 



; VECTOR 10, STATE 1, 1000 



; VECTOR 11, STATE 6, 0110 



; VECTOR 12, STATE 7, 0100 



; VECTOR 13, STATE 8, 0101 



; VECTOR 14, STATE 10, 1111 
; RESET 

; VECTOR 15, STATE 1, 1000 



; VECTOR 16, STATE 6, 0110 



; VECTOR 17, STATE 1. 1000 



; VECTOR 18, STATE 2, 1001 
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;(3) A ERROR SIGNAL IS GENERATED WHEN PHDB AND NHDB BOTH ARE HIGH 
AT THE SAME TIME, 

ERROR = NHDB * PHDB ; BOTH PHDB AND NHDB 

; ARE HIGH 

;(4) A DATA SERIAL OUTPUT SIGNAL IS GENERATED 

DSOUT = NHDB 
+ PHDB 

;(5) THE HDB3 DECODING STATE MACHINE 

Q3 := /NHDB * /PHDB * /Q2 * Q3 * /MR ; MSB STAE VARIABLE 
+ /NHDB * PHDB * /QO * /Q3 * /MR 
+ /NHDB * /QO * Ql * /Q2 * /Q3 * /MR 
+ /NHDB * /QO * Ql * Q2 * Q3 * /MR 
+ /NHDB * /Ql * /Q2 * Q3 * /MR 
+ /NHDB * PHDB * QO * Q2 * /MR 
+ NHDB * /PHDB * Q2 * /Q3 * /MR 
+ NHDB * /PHDB * QO * Q2 * /MR 

Q2 := /NHDB * PHDB * /QO * Ql * /Q2 * /Q3 * /MR 
+ /NHDB * PHDB * /QO * Ql * Q2 * Q3 * /MR 
+ /NHDB * PHDB * /Ql * /Q2 * Q3 * /MR 
+ NHDB * /PHDB * /QO * /Q3 * /MR 
+ NHDB * /PHDB * Ql * Q3 * /MR 
+ NHDB * /PHDB * /Q2 * Q3 * /MR 
+ /PHDB * Q2 * /Q3 * /MR 
+ /PHDB * QO * Q2 * /MR 

Ql := /NHDB * /PHDB * QO * Q3 * /MR 
+ /NHDB * PHDB * /Q2 * Q3 * /MR 
+ /NHDB * PHDB * /QO * Ql * /Q2 * /MR 
+ /NHDB * PHDB * /QO * Ql * Q3 * /MR 
+ NHDB * /PHDB * /QO * /Q2 * /MR 
+ NHDB * /PHDB * /QO * Ql * Q3 * /MR 
+ /PHDB * QO * /Q2 * Q3 * /MR 
+ /PHDB * Ql * /Q2 * Q3 * /MR 
+ /PHDB * QO * Q2 * /Q3 * /MR 

QO := /NHDB * /PHDB * /Ql * /Q2 * Q3 * /MR 

+ NHDB * /PHDB * Q2 * /Q3 * /MR ; LSB STATE VARIABLE 
+ /PHDB * /Ql * Q2 * /Q3 * /MR 
+ /PHDB * QO * Q2 * /Q3 * /MR 
+ NHDB * /PHDB * QO * Q2 * /MR 



SIMULATION 

TRACE_ON MR CLK PHDB NHDB DSOUT Q3 Q2 Ql QO RESET VERROR ERROR 

SETF OE MR PHDB NHDB ; RESET CONDITION 
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SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB /NHDB 
CLOCKF CLK 

SETF /MR /PHDB NHDB 
CLOCKF CLK 

SETF /MR /PHDB NHDB 
CLOCKF CLK 

TRACE OFF 



/VECTOR 19, STATE 3, 1011 



•VECTOR 20, STATE 5, 0010 
; RESET 

; VECTOR 21, STATE 6, 0110 



; VECTOR 22, STATE 7, 0100 



; VECTOR 23, STATE 8, 0101 



;VECTOR 24, STATE 9, 0111 



; VECTOR 25, STATE 10, 1111 
; RESET 
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With the rapid increase in popularity of T1 trunk links following 
the deregulation of AT&T, the transparent nature of the 24 data 
channels must be supported by handling of the framing bits. 
These must comply with certain standards, the most common 
of which is known as D4. 

T1 -Carriers 

l\/lore than 100 million miles of T-carrier lines are installed in the 
U.S. and Canada. They are full-duplex digital communication 
links with a specific bit rate and format. T-carrier lines' 
transmission quality and reliability exceeds that of analog 
lines. T-carrier lines are available in a range of transmission 
rates between 1 .544 Mbps and 274 Mbps, designated T1 , T1C, 
T2, T3, and T4. 

The most common standard, T1 can be considered a building 
block for the others. T1 transmits at 1.544 Mbps. AT&T 
specifications define D1, D1D, D2, D3, D4 and ESF framing 
structures for T1 carriers. These standards specify the 
framing-bit pattern and frame synchronization. D1 was 
established in 1969 and has been superseded by the later 
standards. D1 D, D2, D3, and D4 framing formats group twelve 
frames into a superframe. Today, the D4 framing format is the 
most prevalent. The future format is ESF, also called Fe, which 
groups twenty-four frames into an extended superframe. It is 
presently under definition, and will be supported in the future. 

D4 Framing Format 

The T1 standard for D4 framing defines a superframe as a 
group of twelve frames. A frame consists of twenty-four 8-bit 
channels plus one framing bit for a total of 193 bits. The 8-bit 
channel consists of one polarity bit and seven magnitude bits; 
however, in frames six and twelve, the eighth bit in each 
channel carries signaling information. Since this bit cannot 
transmit user data (either voice or data) it is called robbed-bit 
signaling. The signaling bit transmits requests for services and 
calling information. At the 1 .544 Mbps transmission rate, 8 
kbps are overhead for framing; so the user transmission rate is 
1.536 Mbps. Because the user transmission rate is divided 
into twenty-four channels, each channel transmits at 64 kbps. 

A frame bit should occur every 193rd bit in the data stream. 
Frame detection occurs when a sequence of sampled frame 
bits matches the frame pattern. However, occasionally data 
can produce the same pattern. False framing occurs when the 
frame synchronization circuitry mistakes data bits for the 
framing bits. 

A frame-bit error occurs when the incoming frame bit does not 
match the expected framing bit. If two out of four consecutive 
frame bits are in error, then frame synchronization is "lost." 
Once frame synchronization is "lost," a search for the frame 
pattern is started. AT&T T1 standards specify the maximum 
time to search and re-synchronize as 50 ms. This time period 
is called the reframe time. 

For the D4 frame format, the framing bits are divided into two 



groups: Ft and Fs. The terminal framing bit (Ft) identifies frame 
boundaries. The signaling frame bit (Fs) identifies the 
superframe boundary and the frames containing signaling bits. 
Ft and Fs are interleaved to create the 12-bit framing pattern 
shown in Table 1 This pattern is repeated every twelve frames. 



FRAME 


TERMINAL 

FRAMING 

Ft 


SIGNALING 

FRAMING 

Fs 


SUPER- 
FRAMING 
SF 


1 


1 


— 


1 


2 


— 








3 





- 





4 


— 








5 


1 


- 


1 


6 


— 


1 


1 


7 





- 





8 


— 


1 


1 


9 


1 


- 


1 


10 


— 


1 


1 


11 





— 





12 


- 









Table 1. Framing Pattern 

Hardware Implementation 

The frame synchronization logic was implemented in PAL 
devices because current single-chip solutions do not offer 
enough flexibility in line coding. Using PAL devices, this 
design has the flexibility to interface various line coding 
devices. 

The PAL devices implement the frame synchronization logic, 
which can be divided into three sections. The first section 
detects the D4 framing pattern. The second section searches 
for frame synchronization. The third section provides the su- 
perframe timing. These are implemented using four CMOS PAL 
devices, a shift register, and a counter as shown in Figure 1 . 

The frame detection hardware samples every 193rd bit of the 
data stream. The data is sampled on the positive edge of 
T1_CK. Either a shift register or a state machine can be used 
to sampje the framing bits. The data stream is shifted into a 
(193x23J-bit shift register with taps every 193 bits. The taps 
are fed into two identical Frame Pattern Detection (FPD) PAL 
devices. The more taps sampled, the less likely a data pattern 
will reproduce the framing pattern. In this design, twenty-four 
taps ensure reasonable prevention of false framing. 
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Each FPD device monitors twelve framing bits. If tfie framing 
pattern is found, a FP_DETB signal is generated. From these 
signals, the controller determines the correct framing pattern. 
The FPD's FRM1 B indicates the start of a superframe. 

The first FPD's inputs are the first eleven taps from the shift 
register plus the data input stream. The second FDP monitors 
the remaining taps from the shift register as shown in Figure 1. 

The control section is a controller and a counter, which search 
for frame synchronization. The controller, SYNC PAL, uses the 
193-bit counter to determine if the frame detection signals are 
spaced every 193 bits. First the controller searches for the 
first framing pattern. During the search operation, SYNC PAL 



waits for the FPD's frame detection signals. When a framing 
pattern is found, the counter is reset. If 193 bits later, the next 
expected framing bit is found, the controller is "in sync." It 
continues checking the framing pattern every 193 bits. If the 
framing pattern is not found, it assumes that the incoming 
framing bit is in error. If the next four framing bits are correct, 
the controller remains "in sync." If, during this time, a second 
frame-bit error occurs, the framing is "lost." The controller, 
then, asserts the lost frame signal, NO_FRM. Once the framing 
is "lost," the controller begins searching for the correct frame 
pattern. While the controller searches for the correct frame 
pattern, the previous frame synchronization is maintained until 
the controller re-synchronizes. 
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Figure 1. Frame Synchronization Logic 
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Figure 2 shows the sync, state machine for the controller. 
Signals FPD1B and FPD2B (from the FPDs) indicate a T1 frame 
pattern detection, while SOF (from the counter) indicates 193 
counts. Beginning in states F and G, the controller searches 
for the frame pattern. Once the framing has been found, the 
state machine enters state A. At state A SUNK is asserted, 
which indicates "in sync." The state machine remains in state A 
until a single frame-bit error occurs. Upon the first frame-bit 
error, the state machine enters state B. If the next four framing 
bits are correct, the state machine sequences through states 
C, D, and E, which returns to state A. However, if during this 
time, a second frame bit error occurs, the state machine enters 
state F. Loss of T1 carrier (N0_T1CB) or reset (RSTB) forces 
the state machine into the searching state. 

SYNC PAL performs two other miscellaneous functions. When 
a single framing-bit error is encountered, the device corrects 
the bit. The corrected version of the input data stream, BIT, is 
fed into the shift register. The device also generates the 
BYT_CK. A single pulse is asserted for every channel for 
external serial-to-parallel (S/P) data conversion. For this imple- 
mentation BYT CK is asserted when the counter's three least 



significant bits are all zeros. Depending on the requirements of 
the S/P converter, any combination of the counter's three least 
significant bits could generate BYT_CK for the correct 
alignment with the channel sample. 

SUPER FRAME PAL generates the superframe timing. It 
generates FRM_6 and FRM_12 by counting the frames. These 
signals indicate the frames containing the signaling bits. The 
CMOS PAL device implements a 4-bit counter. When the frame 
is "in sync," SOF increments the counter, while FRM_1 clears 
it. If the controller is "out of frame sync," the counter continues 
counting, based on the last known frame position. 



Design Recommendations 

While this design meets the D4 specification, one design 
improvement might be useful. A signal flagging single-framing 
bit errors could be added. Monitoring this signal indicates the 
amount of random noise on the line. This signal could warn that 
a line is noisy even though frame synchronization was never 
lost. 



FPD1 B * FPD2B • SOF + SOF 




FPD1B«FPD2B*S0F 



FPD1B*FPD2B'S0F 



NOTE: "BITA = FPD1B * FPD2B * SOF * DATA + 
SOF*DATA+ 

FPD1B*S0F*DATA+ 
FPD2B* SOF 'DATA 



Figure 2. Sync State Machine 
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TITLE FRAME_DETECTION_PAL 

PATTERN Tl FRAME DETECTION EAL FOR Tl INTERFACE 

REVISION PI. 04 

AUTHOR STEVE PATTERSON AND THERESA SHAFER 

COMPANY 

DATE 6/4/87 

This PAL device monitors 12 193rd bits in the incoming Tl NRZ data s 
It detects any valid Frame Pattern (start of any Frame) and the start 
of Frame 1. 



CHIP FPD PAL16L8 



; PINS 

;i 

SRDO 

;ii 

SRD9 



2 

SRDl 
12 
FRMIB 



3 

SRD2 
13 
SRDIO 



4 

SRD3 
14 
SRDll 



5 

SRD4 
15 
NC 



6 7 
SRD5 SRD6 
16 17 
FRMl 6B NC 



8 

SRD7 
18 
SOF 



9 10 
SRD8 GND 
19 20 
FP DETB VCC 



INPUTS : 



OUTPUTS ; 



SRD(ll-O) 
SOF 

FP_DETB 

FRMIB 

FRMl 6B 



FRAMING PATTERNS 



FRM_1 

FRM_2 

FRM_3 

FRM_4 

FRM_5 

FRM_6 

FRM_7 

FRM_8 

FRM_9 

FRM_10 

FRM_11 

FRM 12 



S S S 

R R R 

D D D 

119 

1 

1,0,0, 

0,1,0, 

0,0,1, 

1,0,0, 

1,1,0, 

1,1,1, 

0,1,1, 

1,0,1, 

1,1,0, 

0,1,1, 

0,0,1, 

0,0,0, 



FRAMING BITS 

LAST KNOWN START OF FRAME 

ACTIVE LOW SIGNAL INDICATING FRAMING 

PATTERN DETECTED 
ACTIVE LOW SIGNAL INDICATING START OF 

FRAME 1 
ACTIVE LOW SIGNAL INDICATING FRAME 1 

TO 6 DETECTED 



SSSSSSSSS 
RRRRRRRRR 
DDDDDDDDD 
876543210 



0,1,1, 
0,0,1, 
0,0,0, 
1,0,0, 
0,1,0, 
0,0,1, 
1,0,0, 
1,1,0, 
1,1,1, 
0,1,1, 
1,0,1, 
1,1,0, 



0,1, 
1,0, 

1,1, 

0,1, 
0,0, 
0,0, 
1,0, 
0,1, 
0,0, 
1,0, 

1,1, 
1,1, 



1,1,0,0 
1,1,1,0 
0,1,1,1 
1,0,1,1 
1,1,0,1 
0,1,1,0 
0,0,1,1 
0,0,0,1 
1,0,0,0 
0,1,0,0 
0,0,1,0 
1,0,0,1 



EQUATIONS 

/FRMIB = SOF * SRDll * /SRDIO * /SRD9 * /SRD8 * SRD7 * SRD6 * /SRD5 
* SRD4 * SRD3 * SRD2 * /SRDl * /SRDO 



/FRM1_6B = SRDll * /SRDIO * /SRD9 * /SRD8 * SRD7 * SRD6 * /SRD5 
* SRD4 * SRD3 * SRD2 * /SRDl * /SRDO 
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/SRDll * SRDIO * /SRD9 * /SRD8 * /SRD7 * SRD6 * SRD5 
* /SRD4 * SRD3 * SRD2 * SRDl * /SRDO 

/SRDll * /SRDIO * SRD9 * /SRD8 * /SRD7 * /SRD6 * SRD5 
SRD4 * /SRD3 * SRD2 * SRDl * SRDO 
/SRDIO * /SRD9 * SRD8 * /SRD7 * /SRD6 * /SRD5 
SRD4 * SRD3 * /SRD2 * SRDl * SRDO 
SRDIO * /SRD9 * /SRD8 * SRD7 * /SRD6 * /SRD5 
/SRD4 * SRD3 * SRD2 * /SRDl * SRDO 
SRDIO * SRD9 * /SRD8 * /SRD7 * SRD6 * /SRD5 
/SRD4 * /SRD3 * SRD2 * SRDl * /SRDO 



+ SRDll 



+ SRDll 



+ SRDll 



+ SRDll 



+ SRDll 



/FP_DETB = /FRM1_6B 

+ /SRDll * SRDIO * SRD9 * SRD8 * /SRD7 

* /SRD4 * /SRD3 * /SRD2 * SRDl 
/SRDIO * SRD9 * SRD8 * SRD7 * 
SRD4 * /SRD3 * /SRD2 * /SRDl 
SRDIO * /SRD9 * SRD8 * SRD7 * 

* /SRD4 * SRD3 * /SRD2 * /SRDl 
/SRDll * SRDIO * SRD9 * /SRD8 * SRD7 

* /SRD4 * /SRD3 * SRD2 * /SRDl 
/SRDll * /SRDIO * SRD9 * SRD8 * /SRD7 

* SRD4 * /SRD3 * /SRD2 * SRDl * 
/SRDll * /SRDIO * /SRD9 * SRD8 * SRD7 

* SRD4 * SRD3 * /SRD2 * /SRDl * 



* /SRD6 * SRD5 

* SRDO 

/SRD6 * /SRD5 

* SRDO 

SRD6 * /SRD5 

* /SRDO 

* SRD6 * SRD5 

* /SRDO 

* SRD6 * SRD5 
/SRDO 

* /SRD6 * SRD5 
SRDO 



FRM 2 
FRM 3 
FRM 4 
FRM 5 
FRM 6 

FRM 1-6 
FRM 7 

FRM 8 

FRM 9 

FRM 10 

FRM 11 

FRM 12 



SIMULATION 

TRACE_ON 

SOF SRDll SRDIO SRD9 SRD8 SRD7 SRD6 SRD5 SRD4 SRD3 SRD2 SRDl SRDO 

FP_DETB FRMIB FRM1_6B 

SETF SOF 



SETF 

SRDll /SRDIO /SRD9 /SRD8 SRD7 SRD6 /SRD5 SRD4 SRD3 
SETF 

/SRDll SRDIO /SRD9 /SRD8 /SRD7 SRD6 SRD5 /SRD4 SRD3 
SETF 

/SRDll /SRDIO SRD9 /SRD8 /SRD7 /SRD6 SRD5 SRD4 /SRD3 
SETF 

SRDll /SRDIO /SRD9 SRD8 /SRD7 /SRD6 /SRD5 SRD4 SRD3 
SETF 

SRDll SRDIO /SRD9 /SRD8 SRD7 /SRD6 /SRD5 /SRD4 SRD3 
SETF 

SRDll SRDIO SRD9 /SRD8 /SRD7 SRD6 /SRD5 /SRD4 /SRD3 
SETF 

/SRDll SRDIO SRD9 SRD8 /SRD7 /SRD6 SRD5 /SRD4 /SRD3 
SETF 

SRDll /SRDIO SRD9 SRD8 SRD7 /SRD6 /SRD5 SRD4 /SRD3 
SETF 

SRDll SRDIO /SRD9 SRD8 SRD7 SRD6 /SRD5 /SRD4 SRD3 
SETF 
/SRDll SRDIO SRD9 /SRD8 SRD7 SRD6 SRD5 /SRD4 /SRD3 





; FRAME 1 


SRD2 


/SRDl /SRDO 




; FRAME 2 


SRD2 


SRDl /SRDO 




; FRAME 3 


SRD2 


SRDl SRDO 




; FRAME 4 


/SRD2 


SRDl SRDO 




; FRAME 5 


SRD2 


/SRDl SRDO 




; FRAME 6 


SRD2 


SRDl /SRDO 




; FRAME 7 


/SRD2 


SRDl SRDO 




; FRAME 8 


/SRD2 


/SRDl SRDO 




; FRAME 9 


/SRD2 


/SRDl /SRDO 




; FRAME 10 


SRD2 


/SRDl /SRDO 
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SETF ; FRAME 11 

/SRDll /SRDIO SRD9 SRD8 /SRD7 SRD6 SRD5 SRD4 /SRD3 /SRD2 SRDl /SRDO 
SETF ; FRAME 12 

/SRDll /SRDIO /SRD9 SRD8 SRD7 /SRD6 SRD5 SRD4 SRD3 /SRD2 /SRDl SRDO 



SETF 
















; NOT 


/SRDll /SRDIO 
SETF 


/SRD9 


/SRD8 


/SRD7 


/SRD6 


/SRD5 


/SRD4 


/SRD3 


/SRD2 
; NOT 


/SRDll /SRDIO 
SETF 


SRD9 


/SRD8 


/SRD7 


/SRD6 


/SRD5 


/SRD4 


/SRD3 


/SRD2 
; NOT 


/SRDll /SRDIO 
SETF 


/SRD9 


/SRD8 


/SRD7 


/SRD6 


/SRD5 


/SRD4 


/SRD3 


/SRD2 

; NOT 


/SRDll SRDIO 
SETF 


SRD9 


/SRD8 


SRD7 


/SRD6 


/SRD5 


/SRD4 


/SRD3 


/SRD2 
; NOT 


SRDll SRDIO 


SRD9 


SRD8 


/SRD7 


SRD6 


SRD5 


SRD4 


SRD3 


SRD2 


SETF 
















; NOT 


SRDll SRDIO 


SRD9 


SRD8 


SRD7 


SRD6 


SRD5 


SRD4 


SRD3 


SRD2 


TRACE OFF 



















FRAMING PATTERN 
/SRDl /SRDO 
FRAMING PATTERN 
/SRDl /SRDO 
FRAMING PATTERN 
/SRDl /SRDO 
FRAMING PATTERN 
/SRDl /SRDO 
FRAMING PATTERN 

SRDl SRDO 
FRAMING PATTERN 

SRDl SRDO 
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PALASM SIMULATION, V2.22 - MARKET RELEASE (11-19-86) 
(C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PALASM SIMULATION SELECTIVE TRACE LISTING 



Title : FRAME_DETECTION_PAL Author 
Pattern : Tl FRAME DETECTION PAL FCompany 
Revision : PI. 04 Date 



STEVE PATTERSON AND THER 
6/4/87 



FPD 






Page : 


1 






gggggggggg 


ggggggggg 


SOF 


HHHHHHHHHH 


HHHHHHHHH 


SRDll 


XHLLHHHLHH 


LLLLLLLHH 


SRDIO 


XLHLLHHHLH 


HLLLLLHHH 


SRD9 


XLLHLLHHHL 


HHLLHLHHH 


SRD8 


XLLLHLLHHH 


LHHLLLLHH 


SRD7 


XHLLLHLLHH 


HLHLLLHLH 


SRD6 


XHHLLLHLLH 


HHLLLLLHH 


SRD5 


XLHHLLLHLL 


HHHLLLLHH 


SRD4 


XHLHHLLLHL 


LHHLLLLHH 


SRD3 


XHHLHHLLLH 


LLHLLLLHH 


SRD2 


XHHHLHHLLL 


HLLLLLLHH 


SRDl 


XLHHHLHHLL 


LHLLLLLHH 


SRDO 


XLLHHHLHHL 


LLHLLLLHH 


FP DETB 


XLLLLLLLLL 


LLLHHHHHH 


FRMIB 


XLHHHHHHHH 


HHHHHHHHH 


FRMl 6B 


XLLLLLLHHH 


HHHHHHHHH 
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TITLE SYNC_PAL 

PATTERN Tl FRAME SYNC PAL FOR Tl INTERFACE 

REVISION PI. 06 

AUTHOR STEVE PATTERSON AND THERESA SHAFER 

COMPANY 

DATE 6/4/87 

; This PAL device decides whether the Tl Interface is in Frame Sync, 
; provisional Sync, or Out of Sync. It controls the Frame Sync 
; process. 

CHIP SYNC PAL20R4 



;PINS 

;l 2 

T1_CKB RSTB 

;9 10 

BCO SOF 



713 
OEB 

;2i 

BYT CK 



14 
NC 
22 
BIT 



3 4 

N0_T1CB DATA 

11 12 

NC GND 

15 16 

NO_FRM SUNK 

23 24 

NC VCC 



5 
FPDIB 



17 
LDB 



6 
FPD2B 



18 
QO 



7 
BC2 



19 
Ql 



8 
BCl 



20 
Q2 



INPUTS : T1_CKB 
RSTB 
N0_T1CB 
FPD(1-2)B 
BC(2-0) 
SOF 
DATA 
OEB 

OUTPUTS : BIT 
LDB 
SUNK 
NO_FRM 
Q(2-0) 
BYT CK 



ACTIVE LOW EXTERNAL Tl CLOCK 

ACTIVE LOW MASTER RESET 

NO Tl SIGNAL ACTIVE LOW INPUT 

ACTIVE LOW FRAME PATTERN DETECT INPUTS 

193 COUNTER 3 LSBs INPUTS USED FOR BYT_CK 

LAST KNOWN START OF FRAME 

INPUT DATA STREAM 

ACTIVE LOW OUTPUT ENABLE INPUT 

NEXT Tl BIT STREAM 

ACTIVE LOW LOAD COUNTER SIGNAL 

IN SYNC STATE AND NO FRAME BIT ERROR 

FRAME LOST, IN SEARCH STATE 

STATE VARIABLES 

8 -BIT CHANNEL SAMPLE CLOCK 



EQUATIONS 



; STATE MACHINE OUTPUTS AND STATE VARIABLES 



/BIT = DATA * /Q2 * /Ql * QO 
+ DATA * /Q2 * /Ql * QO 
+ /DATA * /SOF 
+ /DATA * /FPDIB * /FPD2B 
+ /DATA * Q2 
+ /DATA * Ql 
+ /DATA * /QO 



* FPDIB * SOF 

* FPD2B * SOF 



/LDB := Q2 * Ql * QO * /FPDIB * /FPD2B + SOF 
/SUNK = /Q2 * /Ql * /QO + Ql * QO 



IN FRAME SYNC 
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/NO_FRM = /Q2 + /Ql + /QO 
USING THE FOLLOWING STATE ASSIGNMENTS 



SEARCHING FOR FRAME SYNC 



A 


= 


001 


B 


= 


010 


C 


= 


110 


D 


= 


100 


E 


= 


101 


F 


= 


111 


G 


= 


oil 


H 


= 


000 



IN SYNC STATE 

SINGLE FRAME BIT ERROR STATE 



SEARCHING STATE 
SEARCHING STATE 
UNUSED STATE - GOES TO F 



/Q2 := /FPD2B * /FPDIB * Ql * QO * RSTB * N0_T1CB 
+ /Q2 * Ql * /SOF * RSTB * N0_T1CB 
+ /Q2 * /Ql * QO * RSTB * N0_T1CB 
+ /FPD2B * /FPDIB * QO * SOF * RSTB * N0_T1CB 

/Ql := /Ql * QO * /SOF * RSTB * N0_T1CB 
+ Q2 * /Ql * /SOF * RSTB * N0_T1CB 

+ /FPD2B * /FPDIB * Q2 * /QO * SOF * RSTB * N0_T1CB 
+ /FPD2B * /FPDIB * Q2 * /Ql * RSTB * N0_T1CB 
+ /FPD2B * /FPDIB * /Q2 * QO * SOF * RSTB * N0_T1CB 

/QO := Ql * /QO * /SOF * RSTB * N0_T1CB 
+ Q2 * /QO * /SOF * RSTB * N0_T1CB 
+ FPDIB * /Q2 * /Ql * QO * SOF * RSTB * N0_T1CB 
+ FPD2B * /Q2 * /Ql * QO * SOF * RSTB * N0_T1CB 
+ /FPD2B * /FPDIB * Ql * /QO * RSTB * N0_T1CB 

; BYTE CLOCK 



/BYT_CK = BC2 + BCl + BCO 



SIMULATION 

TRACE_ON T1_CKB RSTB N0_T1CB DATA FPDIB FPD2B SOF 
Q2 Ql QO BIT LDB SUNK NO_FRM 
BYT CK BC2 BCl BCO 



SETF /OEB 

/RSTB /N0_T1CB 
CLOCKF Tl CKB 



; ENABLE OUTPUT 
; RESET REGISTERS 



SETF RSTB N0_T1CB 
DATA 

/BC2 /BCl /BCO 
FPDIB FPD2B SOF 

CLOCKF Tl CKB 



; RESET REGISTERS 

; STATE MACHINE - GO TO STATE F 



SETF FPDIB /FPD2B SOF 
CLOCKF Tl CKB 



; LOOPS AT STATE F 
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SETF /FPDIB FPD2B SOF 
CLOCKF Tl CKB 



; LOOPS AT STATE F 



SETF FPDIB /FPD2B /SOF 
CLOCKF T1_CKB 

SETF /FPDIB FPD2B /SOF 
CLOCKF Tl CKB 



; LOOPS AT STATE F 



; LOOPS AT STATE F 



SETF FPDIB FPD2B /SOF 
CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 
CLOCKF Tl CKB 



; LOOPS AT STATE F 



; GO TO STATE G 



SETF FPDIB FPD2B SOF 

/BC2 /BCl BCO 
CLOCKF T1_CKB 

SETF /FPDIB /FPD2B 
/BC2 BCl /BCO 
CLOCKF Tl CKB 



; GO TO STATE F 



; GO TO STATE G 



SETF /SOF 

/BC2 BCl BCO 
CLOCKF T1_CKB 
SETF FPDIB /FPD2B SOF 

BC2 /BCl /BCO 
CLOCKF T1_CKB 

SETF /FPDIB /FPD2B /SOF 

BC2 /BCl BCO 
CLOCKF T1_CKB 
SETF /FPDIB /FPD2B SOF 

BC2 BCl /BCO 
CLOCKF Tl CKB 



; LOOP AT STATE G 



; GO TO STATE F 



; GO TO STATE G 



; GO TO STATE A 



SETF /SOF 

BC2 BCl BCO 
CLOCKF T1_CKB 
SETF /FPDIB /FPD2B SOF 

/BC2 /BCl /BCO 
CLOCKF Tl CKB 



; LOOP AT STATE A 



; LOOP AT STATE A 



SETF /FPDIB FPD2B SOF 

CLOCKF T1_CKB 

SETF FPDIB FPD2B /SOF 

/BC2 /BCl BCO 
CLOCKF T1_CKB 
SETF /FPDIB /FPD2B SOF 
CLOCKF Tl CKB 



; GO TO STATE B 

; LOOP AT STATE B 
; GO TO STATE C 



SETF /SOF 
CLOCKF Tl CKB 



; LOOP AT STATE C 
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SETF /FPDIB /FPD2B SOF 
CLOCKF T1_CKB 

SETF /SOF 

CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 

CLOCKF T1_CKB 

SETF /SOF 

CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 

CLOCKF T1_CKB 

SETF FPDIB /FPD2B SOF 
CLOCKF T1_CKB 
SETF FPDIB /FPD2B SOF 
CLOCKF T1_CKB 

SETF /FPDIB /FPD2B /SOF 

CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPDIB FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 

FOR I:=l TO 1 DO BEGIN 

CLOCKF T1_CKB 
END 

SETF FPDIB SOF 
CLOCKF T1_CKB 

SETF /FPDIB /FPD2B /SOF 

CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPDIB FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 

FOR I:=l TO 2 DO BEGIN 

CLOCKF T1_CKB 
END 

SETF FPDIB /SOF 
CLOCKF T1_CKB 
SETF FPDIB SOF 
CLOCKF T1_CKB 

SETF /FPDIB /FPD2B /SOF 
CLOCKF T1_CKB 
SETF /FPDIB /FPD2B SOF 
CLOCKF T1_CKB 
SETF /FPDIB FPD2B SOF 
CLOCKF T1_CKB 
SETF /FPDIB /FPD2B SOF 
FOR I:=l TO 3 DO BEGIN 
CLOCKF Tl CKB 



; GO TO STATE D 

; LOOP AT STATE D 
; GO TO STATE E 

; LOOP AT STATE E 
; GO TO STATE A 

; GO TO STATE B 
; GO TO STATE F 

; GO TO STATE G 
; GO TO STATE A 
; GO TO STATE B 

; GO TO STATE C 

; GO TO STATE F 

; GO TO STATE G 
; GO TO STATE A 
; GO TO STATE B 

; GO TO STATES C,D 

; LOOP AT STATE D 
; GO TO STATE F 

; GO TO STATE G 
; GO TO STATE A 
; GO TO STATE B 

; GO TO STATES C,D,E 
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END 

SETF FPDIB SOF 

CLOCKF T1_CKB 

SETF /DATA 

SETF /FPDIB /FPD2B /SOF 

CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPDIB FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPDIB /FPD2B SOF 

FOR I:=l TO 4 DO BEGIN 

CLOCKF T1_CKB 
END 

SETF FPDIB SOF 
CLOCKF T1_CKB 
TRACE OFF 



GO TO STATE F 
INVERT DATA 
GO TO STATE G 
GO TO STATE A 
GO TO STATE B 

; GO TO STATES C,D,E,A 

; GO TO STATE B 
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PALASM SIMULATION, V2.22 - MARKET RELEASE (1:1-19-36) 
■;C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1936 
PALASM SIMULATION SELECTIVE TRACE LISTING 



Title : SYNC.,.PAL Author 

Pflttern : Tl FRAME SYNC PAL FOR Tl Company 
R G V i s ion : F' 1 . 6 D a t b 



STEVE PATTERSON AND THER 



6/4/87 



SYNC 
Page 



T1...CKB 

RSTB 

N0...T1CB 

DATA 

FPDIB 

FPD2B 

SOF 

02 

Dl 

DO 

BIT 

LDB 

SLINK 

ND..FRM 

BYT..CK 

BC2 

BCl 

BCO 



Q >^Q c 


Q (=q cc 


? (= 


g eg eg eg c 


Q c 


9 ^■ 


1 1 n M 1 1 1 1 1 1 1 1 


III 


MilM 

:n:n 


MM 


i<MMM 


MM 


' ' 








I I 




; ; 


1 ; 






, 1 , , 
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SYNC 
PaoG 



T 1 .... CKB 
RSTB 

NO TICB 

DATA 

FPDIB 

FPD2B 

SOF 

Q2 

Ql 

DO 

BIT 

LDB 

SUNK 

NO...FRM 
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SYNC 
F'aqe 



Tl CKB 

RSTB 

NO TICB 

DATA 

FPDIB 

FPD2B 

SGF 

Q2 

Ql 

QO 

BIT 

LDB 

SUNK 

NO._FRM 

BYT...CK 

BC2 

BCl 

BCO 
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3YNC 
F'age : 4 



T1...CKB 

RGTB 

N0_.T1CB 

DATA 

FPDIB 

Ff='D2B 

SOF" 

(32 

Ql 

QO 

BIT 

LDB 
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BYT...CK 
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BCl 

BCD 
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SYNC 
Paqe : 



T1._CKB 

N0..T1CB 

DATA 

FPDIB 

FPD2B 

SDF 

D2 

Ql 

QO 

BIT 

LDB 

SUNK 

ND_FRM 

BYT._CK 

BC2 

BCl 

BCO 
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SYNC 
F'aqe 



CQ 



Tl ..CKB 

RSTB 

Na...TlCB 

DATA 
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SOF 
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PALASM SIMULATION, V2.22 - MARKET RELEASE (11-19-86) 
(C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PALASM SIMULATION SELECTIVE TRACE LISTING 



Title : SYNC_PAL Author 

Pattern : Tl FRAME SYNC PAL FOR TlCompany 
Revision : PI. 06 Date 



STEVE PATTERSON AND THER 



6/4/87 



SYNC 
Page : 

T1_CKB 
RSTB 



g eg eg 
XHHLLHHLLH 
LLLLHHHHHH 
NO TICB LLLLHHHHHH 



DATA 

FPDIB 

FPD2B 

SOF 

Q2 

Ql 

QO 

BIT 

LDB 

SUNK 

NO_FRM 

BYT_CK 

BC2 

BCl 

BCO 



XXXXHHHHHH 
XXXXHHHHHH 
XXXXHHHHLL 
XXXXHHHHHH 
XXHHHHHHHH 
XXHHHHHHHH 
XXHHHHHHHH 
XXXXHHHHHH 
XXXXXXLLLL 
XXLLLLLLLL 
XXHHHHHHHH 
XXXXHHHHHH 
XXXXLLLLLL 
XXXXLLLLLL 
XXXXLLLLLL 



eg eg eg 

LLHLLHHLLH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

HLLLHHHHLL 

LHHHLLLLHH 

HHHHLLLLLL 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

LLLLLLHHHH 

LLLLLLLLLL 

HHHHHHHHHH 

HHHHHHHHHH 

LLLLLLLLLL 

LLLLLLLLLL 

LLLLLLLLLL 



eg eg eg 

LLHLLHHLLH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

LHHHLLLLHH 

HHHHLLLLHH 

LLLLHHHHHH 

HHHHHHLLLL 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHLLLL 

LLLLLLLLLL 

HHHHHHLLLL 

HHHHHHHHLL 

LLLLLLLLLL 

LLLLLLLLLL 

LLLLLLLLHH 



eg eg e 
HLLHHLLHHL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHLLLLLLLL 
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HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLHH 
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LLLLLLLLLL 
LLHHHHHHHH 
HHLLLLHHHH 



SYNC 
Page : 

T1_CKB 

RSTB 

N0_T1CB 

DATA 

FPDIB 

FPD2B 

SOF 

Q2 

Ql 

QO 

BIT 

LDB 

SUNK 

NO_FRM 

BYT_CK 

BC2 

BCl 

BCO 



g eg eg 

LHHLLHHLLH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 
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LLLLLLLLLL 
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LLLLLLLLLL 

LLHHHHLLLL 

LLLLLLLLLL 
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LLLLLLLLHH 

LLLLHHHHLL 



eg eg e 
HLLHHLLHHL 
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LLLLLLLLLL 
HHLLLLHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
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LLLLHHHHLL 
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HHHHHHHHHH 
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SYNC 
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DATA 
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SYNC 
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DATA 
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LLLLLLLLLL 
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FPDIB 
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LLLLLLLLLL 


FPD2B 


HHLLLLLLLL 


LLLLLLLLLL 


LLLLLHHHHL 


LLLLLLLLLL 


SOF 


HHHHHHHHHH 


HHHHHHHLLL 


LHHHHHHHHH 


HHHHHHHHHH 


Q2 


LLLLHHHHHH 


HHHHHHHHHL 


LLLLLLLLLL 


LHHHHHHHHH 


Ql 


HHHHHHHLLL 


LLLLHHHHHH 


HHHLLLLHHH 


HHHHLLLLLL 


QO 


LLLLLLLLLL 


HHHHHHHHHH 
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BIT 
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LLLLLLLLLL 
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BYT CK 
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DATA 


LLLLLL 








FPDIB 


LLHHHH 
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TITLE SUPER_FRAME_PAL 

PATTERN SUPER FRAME PAL FOR Tl INTERFACE 

REVISION PI. 03 

AUTHOR STEVE PATTERSON AND THERESA SHAFER 

COMPANY 

DATE 6/4/87 

This PAL device counts the Tl Frames and controls the Signal Bits 
extraction process, including Fly Wheeling. It also provides variou 
other signals which indicate the frames with signal bits. 
The counter is reset with either RSTB or when frame detection is SUNK 
and frame 1 occurs from two different sources (FRMl & SOF) . 



CHIP SUPER FRAME PAL16R6 



; PINS 
;l 2 
Tl CKB RSTB 



FRMIB 



SUNK 



SOF 



NC 



NC 



NC 



NC 



10 
GND 



;ii 

OEB 



12 

NC 



13 
Q3 



INPUTS :T1_CKB 
RSTB 
SOF 

FRMIB 

SUNK 
OEB 

OUTPUTS :Q (3-0) 
FRM_6 
FRM 12 



14 
Q2 



15 
Ql 



16 
QO 



17 
FRM 6 



18 

FRM 12 



19 
NC 



20 
VCC 



ACTIVE LOW EXTERNAL Tl CLOCK 

ACTIVE LOW MASTER RESET 

ACTIVE HIGH INPUT SIGNAL INDICATING 

LAST KNOWN START OF FRAME 
ACTIVE LOW INPUT SIGNAL INDICATING 

START OF FRAME 1 
ACTIVE HIGH INPUT SIGNAL INDICATING "IN FRAME SYNC" 
ACTIVE LOW OUTPUT ENABLE INPUT 

STATE VARIABLES 

CLOCK SIGNAL WHICH INDICATES SIGNAL BIT A 

CLOCK SIGNAL WHICH INDICATES SIGNAL BIT B 



EQUATIONS 



/Q3 := /Q2 * Ql * QO 
+ /Q3 * /Q2 
+ /Q3 * /Ql 
+ /Q3 * /QO 
+ /FRMIB * SOF * SUNK 
+ /RSTB 



r RESET WHEN SUNK AND FRAME 1 
MASTER RESET 



OCCURS 



/Q2 := Q2 * Ql * QO 
+ /Q2 * Q3 
+ /Q2 * /Ql 
+ /Q2 * /QO 
+ /FRMIB * SOF * SUNK 
+ /RSTB 
+ /RSTB 



RESET WHEN SUNK AND FRAME 1 OCCURS 
MASTER RESET 

MASTER RESET 
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/Ql := Ql * QO 

+ /Ql * /QO 

+ /FRMIB * SOF * SUNK 

/QO := QO 

+ /FRMIB * SOF * SUNK 
+ /RSTB 



; RESET WHEN SUNK AND FRAME 1 OCCURS 

; RESET WHEN SUNK AND FRAME 1 OCCURS 
; MASTER RESET 



; SIGNAL BIT EXTRACTION OUTPUTS 

/FRM_6 := Q3 + /Q2 + Ql + QO 

/FRM_12 := /Q3 + Q2 + /Ql + QO 

SIMULATION 

TRACE_ON T1_CKB RSTB FRMIB SOF SUNK 
Q3 Q2 Ql QO FRM_6 FRM_12 

SETF /OEB 

/RSTB 
CLOCKF T1_CKB 
SETF RSTB /SOF FRMIB SUNK 
CLOCKF T1_CKB 

FOR I:=l TO 24 DO 
BEGIN 

CLOCKF T1_CKB 
END 

SETF /SUNK SOF /FRMIB 

CLOCKF T1_CKB 

SETF /SUNK /SOF /FRMIB 

CLOCKF T1_CKB 

SETF /SUNK SOF FRMIB 

CLOCKF T1_CKB 

SETF /SUNK /SOF FRMIB 

CLOCKF T1_CKB 

SETF SUNK SOF /FRMIB 

CLOCKF T1_CKB 

SETF SUNK /SOF /FRMIB 

CLOCKF T1_CKB 

SETF SUNK /SOF FRMIB 

CLOCKF T1_CKB 

SETF SUNK SOF FRMIB 

CLOCKF T1_CKB 

TRACE OFF 



ENABLE OUTPUT 
RESET REGISTERS 



COUNT 



RESET CONDITIONS 
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PALASM SIMULATION, V2.22 - MARKET RELEASE (11-19-86) 
(C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PALASM SIMULATION SELECTIVE TRACE LISTING 



Title : SUPER_FRAME_PAL Author 
Pattern : SUPER FRAME PAL FOR Tl ICompany 
Revision : PI. 03 Date 



STEVE PATTERSON AND THER 



6/4/87 



SUPER FRAME 



Page 



g eg c 
T1_CKB XHHLLHHLHH 
RSTB LLLLHHHHHH 
FRMIB XXXXHHHHHH 
SOF XXXXLLLLLL 
SUNK XXXXHHHHHH 
Q3 XXLLLLLLLL 
Q2 XXLLLLLLLL 
Ql XXLLLLLLLH 
QO XXLLLLHHHL 
FRM_6 XXXXXXLLLL 
FRM 12 XXXXXXLLLL 



c c c c 
LHHLHHLHHL 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLHHHHH 
HHHHHLLLLL 
LLHHHLLLHH 
LLLLLLLLHH 
LLLLLLLLLL 



c c c 
HHLHHLHHLH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LLLLLLLHHH 
HHHHHHHLLL 
LHHHHHHLLL 
HLLLHHHLLL 
HLLLLLLLLL 
LLLLLLLLLL 



c c c 
HLHHLHHLHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHL 
LLLLLLLLLL 
LLLHHHHHHL 
HHHLLLHHHL 
LLLLLLLLLL 
LLLLLLHHHL 



SUPER_FRAME 

Page : 2 

c c c c 
T1_CKB LHHLHHLHHL 
RSTB HHHHHHHHHH 
FRMIB HHHHHHHHHH 
SOF LLLLLLLLLL 
SUNK HHHHHHHHHH 
Q3 LLLLLLLLLL 
Q2 LLLLLLLLLL 
Ql LLLLLHHHHH 
QO LLHHHLLLHH 
FRM_6 LLLLLLLLLL 
FRM 12 LLLLLLLLLL 



c c c 
HHLHHLHHLH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LLLLLLLLLL 
LHHHHHHHHH 
HLLLLLLHHH 
HLLLHHHLLL 
LLLLHHHLLL 
LLLLLLLLLL 



c c c 
HLHHLHHLHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LLLHHHHHHH 
HHHLLLLLLL 
HHHLLLLLLH 
HHHLLLHHHL 
LLLLLLLLLL 
LLLLLLLLLL 



c c c c 
LHHLHHLHHL 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHLLLLL 
LLLLLLLLLL 
HHHHHLLLLL 
LLHHHLLLHH 
LLLLLLLLLL 
LLHHHLLLLL 
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Principles of CRC-6 

In digital transmission, bits are grouped into frames. North 
America's standard, T-carrier uses a framing bit to group data 
into frames. In T1 extended-superf rame (ESF or Fe), the frame- 
bit position carries additional information besides framing. One 
such extended functionality provides transmission line error 
detection. I n transmission channels, most errors are bursty in 
nature; the cyclic-redundancy-checking (CRC) codes provide 
sufficient detection for these types of errors. With CRC, the 
block length and the polynomial determine the type and percent- 
age of error w/hich can be detected. For ESF, the block length is 
one extended-superframe or 4,632 bits. The T1 Fe standard 
specifies the CRC-6 polynomial. The CRC-6 = X^ + X + 1 
polynomial detects 98.4% of one or more errors. 



output as a parallel word. The CHECK_BIT signal enables the 
remainder to be shifted out serially; it must be held HIGH until 
the entire 6-bit remainder has been shifted out. When CHECK_ 
BIT = 0, CRC6 echoes the serial data input stream. CB1-CB6are 
the parallel remainder bits where CB1 is the most significant bit 
and CB6 is the least significant bit. 

There are two implementations for error detection at the 
receiver. First, the transmitted CRC-6 value is fed serially along 
with the data. If a non-zero remainder is calculated, an error has 
occurred. In this case, the ERROR flag indicates at least one 
transmission error. Second, the CRC-6 for the extended- 
superframe can be recalculated and externally compared with 
the transmitted CRC-6 value. 



Functional Description 

Figure 1 shows a simple configuration of XOR-gates and 
registers which calculates the CRC-6 polynomial. 




Hg)* * ^ * ^ 



Figure 1. CRC-6 Block Diagram 



The low-power PALC16R6Z implements the CRC-6 polynomial 
for a serial data stream (see Figure 1 ). At the beginning of CRC 
generation, the remainder is preset to all zeros by enabling the 
synchronous CLR. After shifting in the 4,632-bit superframe, the 
result is the CRC-6 remainder. This remainder is transmitted in 
the check-bit positions of the subsequent superframe. The 
remainder can be shifted out serially, most significant bit first or 




Figure 2. CRC-6 Configuration 



PAL® is a registered trademark of Monolithic Memories. 
ZPAL" is a trademarl< of Monolithic Memories. 
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PAL® Device Design Specification 

TITLE CRC6 

PATTERN CRC 6 ERROR DETECTION PAL 

REVISION 1.01 

AUTHOR THERESA SHAFER 

COMPANY MMI 

DATE 9/10/86 

CHIP CRC6 PAL16R6 

CK DATA /CLR CHECK_BIT EOX NC NC NO NO GND 
/OE ERROR CBl CB2 CB3 CB4 CBS CB6 CRC6 VCC 

THE CRC- 6 PAL PERFORMS ERROR DETECTION ON A SERIAL DATA 
STREAM. CRC-6 PAL SUPPORTS THE Tl Fe STANDARD FOR ERROR 
DETECTION. THE CRC RESULT CAN BE OUTPUT EITHER IN SERIAL 
OR IN PARALLEL. 

CRC-6 = -X**6 + X + 1 



INPUTS: CK EXTERNAL CLOCK 

/OE ACTIVE LOW OUTPUT ENABLE SIGNAL 

/CLR ACTIVE LOW CLEAR SIGNAL V7HICH RESETS THE 

CRC 
DATA SERIAL DATA STREAM INPUT 
CHECK_BIT ACTIVE HIGH SELECT INPUT WHICH SWITCHES 

FROM SERIAL DATA STREAM TO CHECK BITS 
EOX ACTIVE HIGH SIGNAL WHICH INDICATES END OF 

EXTENDED SUPERFRAME AND ENABLES THE 
ERROR DETECTION FLAG 



OUTPUTS: CRC6 CRC SERIAL OUTPUT 

ERROR ACTIVE HIGH OUTPUT FLAG WHICH INDICATES 
A TRANSMISSION ERROR 

CBl - CB6 REMAINDER BITS (CHECK BITS) 
CBl - MOST SIGNIFICANT BIT 
CB6 - LEAST SIGNIFICANT BIT 



EQUATIONS 

; CRC-6 PAL 

/CRC 6 = /CBl * CHECK_BIT 
+ /DATA * /CHECK_BIT 

/ERROR = EOX * /CBl * /CB2 * /CB3 * /CB4 * /CB5 * /CB6 
+ /EOX 

/CBl := /CB2 + CLR 

/CB2 := /CB3 + CLR 

/CB3 := /CB4 + CLR 

/CB4 := /CB5 + CLR 
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/CB5 := /DATA * /CBl * /CB6 
+ DATA * CBl * /CB6 
+ DATA * /CBl * CB6 * /CHECK_BIT 
+ /DATA * CBl * CB6 * /CHECK_BIT 
+ /CB6 * CHECK_BIT 
+ CLR 

/CB6 := DATA * CBl * /CHECK_BIT 
+ /DATA * /CBl * /CHECK_BIT 
+ /CBl * CHECK_BIT 
+ CLR 



SIMULATION 

TRACE_ON CK /OE /CLR CHECK_BIT EOX DATA CRC6 ERROR 
CBl CB2 CB3 CB4 CBS CBS 

SETF OE ; ENABLE OUTPUT 

CLR ; CLEAR SHIFT REGISTER 

CLOCKF CK 

; INITIALIZE 
SETF DATA 

/CHECK_BIT 

/EOX 

/CLR 
CLOCKF CK 

; FIND PERIOD 

SETF /DATA 

FOR J:= TO 70 DO 

BEGIN 

CLOCKF CK 

END 

; CALULATE CRC-6 FOR 1110010101 

SETF CLR ; CLEAR SHIFT REGISTER 

DATA 
CLOCKF CK 
SETF /CLR 

DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 
SETF /DATA 
CLOCKF CK 
SETF /DATA 
CLOCKF CK 
SETF DATA 
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CLOCKF CK 
SETF /DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 
SETF /DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 

; CHECK ERROR DETECTION 

SETF EOX 7 CHECK ERROR FLAG 

SETF /DATA ; 001001 

CLOCKF CK 

SETF /DATA 

CLOCKF CK 

SETF DATA 

CLOCKF CK 

SETF /DATA 

CLOCKF CK 

SETF /DATA 

CLOCKF CK 

SETF DATA 

CLOCKF CK 

; READ CRC RESULT 
SETF /EOX 
SETF /DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 
SETF CHECK_BIT 

DATA 
FOR J:= TO 6 DO 

BEGIN 

CLOCKF CK 

END 



TRACE OFF 
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Introduction 

The Logic Cell Array (LCA device) implements a multiplexer and 
counter used in time division multiplexing. With the device's 
flexible I/O pins, tine multiplexer and counter are implemented 
into a single CMOS device. This application note covers time 
division multiplexing and the design of a multiplexer and coun- 
ter. Additional information on how to design with an LCA device 
can be found in the LCA design methodology chapter in Mono- 
lithic Memories' LCA Design and Applications Handbook. For 
programming the LCA device, refer to "Configuring the LCA De- 
vice", Monolithic Memories' Application Note 182. 



Principles of i\/iuitiplexing 

Multiplexing efficiently utilizes data communication lines by 
combining multiple low-speed signals into a single, high-speed 
line. The two methods of performing multiplexing are Time Divi- 
sion Multiplexing (TDM) and Frequency Division Multiplexing 
(FDM). TDM divides the transmission bandwidth into equal time 
slots where each input signal is assigned one time slot per time 
cycle. Once assigned, that time slot is not used by any other 
input. Figure 1 shows a multiplexer combining three low-speed 
signals into one high-speed line. Terminal A transmits during 
the first time slot, B transmits during the second time slot, and 
C transmits during the third time slot. This sequence is repeat- 
ed every time cycle. FDM, on the other hand, divides the fre- 
quency spectrum among logical channels where each channel 
has full bandwidth of its assigned frequencies. Analog sys- 
tems usually use FDM. 




A B C A B 



Figure 1. Time Division IVIultipiexing 

There are many classes of TDMs including bit interleave, char- 
acter interleave, statistical TDM, and T1 multiplexers. In bit in- 
terleaved TDM, each input is assigned to one time slot. Each 
time slot's length is one bit. Character interleaved TDM is simi- 
lar to bit interleave, except that each time slot represents one 
character. Statistical multiplexing assigns the bandwidth into 
unequal slots where only the active incoming lines are as- 



signed slots. The slots are of variable length, so each input is 
assigned the amount of bandwidth needed. The T1 standard 
specifies twenty-four 64 kbps channels multiplexed on a 1 .544 
Mbps high-speed link. 

Data Selection/Time Slot 
Assignment 

To transmit data from terminal A in the first time slot as shown 
in Figure 1, data buffering, rate adaption and data selection 
must be performed. Data buffering and rate adaption are re- 
quired since the rates of the lines to be multiplexed are slower 
than the high-speed link. A serializing FIFO or shift register can 
implement the required buffering. Rate adaption is performed 
by a clock/shift scheme tailored to the exact application. Once 
buffered, the data must be selected in the proper order. Sever- 
al methods can be used for data selection; one method is a 32- 
to-1 multiplexer. To implement sequential selection of input 
lines, an on-board counter selects the multiplexer's output. 
However, for random selection needed in statistical multiplex- 
ing, more flexibility is needed. A 2-to-1 multiplexer provides 
this flexibility by selecting between the counter and random 
selection inputs. 

Comparison of Design Methodologies 

The 32-to-1 multiplexer and counter design can be implement- 
ed in several ways. Briefly, we will evaluate discrete logic, PAL 
and LCA device implementations. 

For this design, thirty-eight inputs, five registers, and one out- 
put are required. Conventional PLD devices do not meet this 
large I/O requirement. A single PLD device is not suited for this 
specific application because flexible I/O and buried registers 
are not supported. Instead, the design must be divided into 
four sections. The counter fits in a PALC16R6Z device and the 
32-to-1 multiplexer requires three PALC20L8Z devices. 

Since a large 32-to-1 multiplexer is not available as a discrete 
part, it must be built using smaller multiplexers. The total chip 
count, using discrete logic, is five devices: two 74AS850s, two 
74ALS257S and one 74AS867. A combination of PAL devices 
and discrete logic devices is possible, however, it is also a 
multiple chip solution. 

Monolithic Memories' LCA device is a high-density programma- 
ble CMOS circuit with flexible l/Os. It has forty pins which are 
user-defined as either inputs, outputs, or bidirectional. The 
LCA device meets the I/O requirement for this design. Since 
the LCA circuit allows multiple logic levels, implementing the 
counter does not use up output pins. Moreover an LCA device 
enables the 32-to-1 multiplexer with counter to be implemented 
in a single low-power device. 
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Detailed LCA Design 

The M2064 LCA device in a 48-pin DIP is used in this design, 
although both PLCC and PGA packages are available. Eight of 
the forty-eight pins are reserved for programming, power, and 
ground, and the remaining forty I/O pins are available to the 
user. Table 1 shows the efficient use of the package pinout. 



PIN 


DESCRIPTION 


TOTAL PINS 


ADDR(4:0) 


External Address Inputs 


5 inputs 


CK 


External Clock Input 


1 input 


LD 


Counter Load Input 


1 input 


D(31:0) 


32-to-1 Multiplexer 
Data Inputs. 


32 inputs 


OUT 


Output 


1 output 




TOTAL I/O 


40 pins 



Table 1. 32-to-1 Multiplexer Pins 



Figure 2 shows the block diagram of the 32-to-1 multiplexer and 
the counter. The select lines of the 32-to-1 multiplexer are ei- 
ther the address lines or the output of the counter. When LD is 
deasserted, the 5-bit counter sequentially selects each input. 
Asserting the LD signal loads the counter with the external ad- 
dresses. It also selects the external address for the 32-to-1 
multiplexer's select lines. This permits random input selection 
from the external address lines which supports statistical mul- 
tiplexing. 



INPUTS 


OUTPUT 


FUNCTION 


LD ADDR(4:0) 


OUT 


Counter as Select 
Address as Select 
and Load Counter 


XXXXX 

1 ADDR(4:0) 


D(CNT) 
D(ADDR) 



Table 2. 32-to-1 Multiplexer Function Table 

The design was entered with FutureNet's DASH schematic cap- 
ture package using high-level macros. Then the schematic 
capture file was translated into an LCA design file. The transla- 
tion software, called PIN2LCA, partitions the design into CLBs 
and generates the equations for the CLBs' configuration. The 
translation software produces an unrouted LCA design file. 
This design file requires final placement of CLBs and intercon- 
nect routing which was performed using Monolithic Memories' 
APR software, an automatic place and route program. After the 
design is placed and routed, it was optimized with Monolithic 
Memories' XACT Design Editor System. The XACT system pro- 
vides a graphic interface to manually optimize the CLB logic 
functions and connections. 

The logic functions and interconnects of an LCA device are es- 
tablished with CMOS memory cells, so the array is never physi- 
cally altered, although it is physically programmed. A program- 
mable LCA device can be reconfigured for the prototype. In ad- 
dition, it can be reprogrammed any number of times in the tar- 
get system. With reconfigurable devices such as this one, the 
array can also be programmed on power-up, or whenever the 
design details need to be changed. For volume production, a 
hard-array version will be available. 



LD — r 
CK 



5-BIT 
> COUNTER 



\ 2:1 MUX / 



32:1 
MUX 



Figure 2. Block Diagram 



The select signals for the 32-to-1 multiplexer are determined by 
tfie LD signal. When LD = 0, the counter selects the multiplex- 
er's output. When LD = 1 , the counter is loaded and the exter- 
nal address selects the output. Also, the LD asynchronously 
loads the 5-bit counter. Table 2 summarizes the device opera- 
tion. 



Entering the Design with Futurenet 

The schematic entered in FutureNet's DASH is comprised of 
different components called from the macro library supplied by 
Monolithic Memories. The M8-1 and M4-1 macros are the multi- 
plexers which implement the 32-to-1 multiplexer as shown in 
Figure 3. The 5-bit counter was built from INV, C16BPRD, 
X0R2, and FDM macros. Since the C16BPRD macro is only a 
4-bit parallel-load binary counter, the X0R2 and FDM macros 
were added to form another bit which increased the length to 
five bits.The GMUX macro, a 2-to-1 multiplexer, selects be- 
tween the counter and the external address lines. 

To translate from FutureNet to LCA, the PIN2LCA program is in- 
voked. This program generates an unrouted <filename>.LCA 
from the FutureNet's <filename>.PiN. The PIN2LCA software 
partitions the design into CLBs and generates each CLB's con- 
figuration. 

The PIN2LCA performs logic reduction by eliminating unused 
logic to maximize CLB utilization. With this design, for exam- 
ple, C16BPRD's reset logic is not used, and PIN2LCA elimi- 
nates all the reset logic in the C16BPRD macro. Another means 
to maximize CLB utilization is by combining macro logic. At 
times, this combined logic results in a single CLB which imple- 
ments logic from two different macros. For designs not utilizing 
all the CLBs, this logic reduction is not required and may actu- 
ally hinder placement and delay tradeoffs. If it does produce 
delay problems, it is possible to edit the CLBs in XACT. 
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At this stage in the design flow, the PIN2LCA software can 
generate a P-SILOS simulation file. Since the <filename>.SIM 
was generated from an unrouted LCA design file, the simulation 
only has unit delays and does not have actual routing delays. 
The simulation verifies the logic, but does not give any timing 
information. It may be necessary to make changes in DASH™ 
depending on the simulation results. 



Place and Route Using APR 

Once the logic is finalized, the LCA design must be placed and 
routed. Monolithic Memories' APR software performs the final 
placement of CLBs and interconnect routing. The input to APR 
is an unrouted <filename>.LCA, and a routed LCA design file 
and report document are the outputs. 

At first glance, the APR program seems cumbersome to apply 
to every application. However, it can be used to create an effi- 
cient design. While the software uses random placement, it 
does allow tailoring for specific requirements. 

Initially, the multiplexer and counter design did not route com- 
pletely. To achieve good placement and 100% routing comple- 
tion, several things were tried. Some worked well while others 
did not. An overview of how 100% routing completion was 
achieved is explained below. 

Since this design utilizes only 53% of the LCA device, delay is 
a larger concern than CLB utilization. In this case, placement 
is critical to achieve an efficient design with the desired de- 
lays. The ideal placement would be for the 5-bit counter and 2- 
to-1 multiplexer to be placed near the ADDR(4:0), CK, and LD 
signals. The four M8-1 macros should be near the 8-bit cluster 
of inputs. The remaining M4-1 macro should be placed near the 
OUT signal. 

In FutureNet, the l/Os are assigned pin numbers so that the 
signals are clustered into groups. These I/O assignments re- 
strict the APR's placement of the signals and are found in the 
<filename>.SCP. Other restrictions and options are specified 
when invoking the APR software. The options, "-a", "-g", "-e", 
and "-k", tailor the placement for this specific design. 

The APR's "-a" option specifies the depth of CLB logic levels 
which are considered connected. In this design, "-a2" was se- 
lected because the CI 6BPRD and M8-1 macros are implement- 
ed in two CLB logic levels. 

The "-g" and "-e" options are related. The "-g" option specifies 
the number of CLBs that should be grouped together. The "-e" 
option determines the number of CLBs in a group which should 
be evaluated for all possible placements. Since the C16BPRD 
and M8-1 macros contain four to six CLBs, the options, "-e4" 
and "-g4", were used. This particular value was selected be- 
cause it was the largest possible value which maintained the 
e>=g relationship. Avoiding values where e<g insures as much 
as possible that the CLBs within each group are placed in the 
best possible arrangement. This maintains the groups' integrity. 



The "-k" option specifies the number of shapes per group. By 
trial and error, "-k6" works well for this design. With "-k6", six of 
the best arrangements or shapes were saved for each group of 
CLBs. Every combination of shaped group is tried with all six 
other grouped shapes and evaluated for best placement. Gen- 
erally, the larger values result in better placements, however, 
run-time grows exponentially. While larger values of "-k" were 
tried, they did not improve the placement significantly. 

Running APR with these options and restrictions did not result 
in a 100% routed design. Therefore, more application-specific 
information was required. Noting that the APR software was not 
fully utilizing the high-level macro information, placement could 
still be optimized. The APR's report file shows that the CLBs 
which comprise C16BPRD or the M8-1 macros were not 
grouped together. Instead, the CLBs were randomly spread 
over the device. In a constraint file, <filename>.CST, CLBs 
from the high-level macros could be grouped together (see Ap- 
pendix A). If necessary, the exact CLB placement could be 
specified in the same file. 

To generate the constraint file, CLBs which implement each 
high-level macro were identified using the cross-reference file 
generated by PIN2LCA, <filename>.CRF. The <filename>.CRF 
file is organized into three cross-reference sections: macros, 
CLBs, and lOBs. 

The macro cross-reference section identifies and assigns a 
hierarchical symbol number to each macro. The symbol num- 
ber is used to generate the default CLB names. 

21-1, \M8-1. DWG 
^^^ Path: 



ASSIGNED ^ 
SYMBOL NUMBER 



\USER\THERESA\TDM.DWG(32) 
Title: M8-1 



The CLB cross-reference section of the <filename>.CRF con- 
tains the CLB names. For macros with multiple CLBs, the sys- ' 
tem assigns default names. The default CLB name consists of 
two parts. The first is the macro symbol number as specified in 
the macro section and the second number is the signal name. 
By grouping CLBs with related symbol numbers, all the CLBs 
for a given macro will be placed together. 

CLBBDName = '21-D03': 
F = signal '21-D03', contains: 

symbol •26-OR(2)', output signal = '21-D03' 
symbol '26-AND(3)', output signal = '26-SO' 
symbol '27-OR(2)', output signal = '21-D01' 
symbol '26-AND)1)', output signal = '26-S1' 

The lOB cross-reference section shows the lOB assignments. 
For this design, the assignments are those specified in the Fu- 
tureNet's schematic. 

lOB P30: Name = 'D13', Symbol = 'P1N(40)' 

l = signarD13' 
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Optimizing the Design witii 
Monolithic IMemories' XACT 
Design Editor System 

Monolithic Memories' XACT Design Editor System can increase 
resource utilization and performance by manually modifying 
tfie placement and routing. The XACT system provides a 
graphic interface to specify the CLB design. All CLB logic func- 
tions and connections can be optimized for the designs' 
needs. With XACT, the designer can partition the design and 
optimize the placement of logic blocks to gain the utilization 
and performance required. 

The clock line routing was optimized to reduce clock skew. Us- 
ing the clock buffer resources, the common clock is driven by 
the global clock buffer. To optimize internal routing, "long lines" 
were used for signals with large fanouts such a$ the select 
lines for the multiplexers and the LD signal. Use of the "long 
lines" minimizes the delay for these control signals. The "long 
lines" were selected by manually routing the signal net through 
the programmable interconnect points (PIPs). 

On the device itself, implementing the 5-bit counter and 2-to-1 
multiplexers requires ten CLBs; and the 32-to-1 multiplexer re- 
quires twenty-four CLBs, giving a total of thirty-four CLBs. This 
design utilizes 53% of the LCA device. 



Summary and iVIore Information 

For time division multiplexing systems, data selection can be 
implemented in a single CMOS device. FutureNet's DASH 
schematic capture with Monolithic Memories' LCA macro library 
was used to implement a design in an LCA device. Placement 
of CLBs and signal routing were performed by Monolithic Mem- 
ories' APR software. Optimization, which may be required to 
improve performance, can be done using Monolithic Memories' 
XACT Design Editor System. This is useful for placing and rout- 
ing critical signal nets such as clock or control signals. More 
information can be found in P-SILOS, FutureNet, and LCA user 
guides and handbooks. 

This design, developed with an LCA device, is available upon 
request. The FutureNet drawing, LCA design and bit pattern 
files can be provided for programming the LCA device in an EP- 
ROM. Please ask for design XDES07.LCA. 



Print World! THERESA. LCA (2064PD48-70) , XACT 1.30, 10:15:46 JUL 31, 1987 

IflDD] IflDDl 1913153 ^^ |Dl51|U3]|GFig|5|c]|uIB] i|31^ &it 
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pio[2> — ^ 
piiO — >^ 

peO — ^ 



P5C>— >- 
P130 ^ 



P26C> ^ 

P28C> ^ 

P30[^> 1>- 

P34C> ^ 



P3 0— >- 

p«C>H>- 



p^O — 1> 

P40O ^ 

P38C> ^ 

P36[;;3> — ^ 



' — 5F1 \GMUX 



pisO — 1>- 

P19CZ> — ^ 

P210 — o 
P23ni> — ^ 



•[:>->- 



3 0— >- 



)G>H>- 



P220 — ^ 



=[:>— >- 



P27 ^ 



3[:>— >■ 



P33C> 1>- 



'C:>-l>- 



=c^— 1>- 



P48C>— >■ 



P46G> 1>" 



P44[3> ^ 



P410 >- 



P39 ^ 



P37D> ^ 



^^^ sGMUX 



POUT 

-o-o 

P35 



Figure 3. FutureNet Schematic 
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Appendix A> Constraint File 



; CONSTRAINT FILE GROUPING MACROS IN BLOCKS 

DEFINE GROUP CNT_GROUP 
3-T3 3-T2 CKB 
C4 C2 C3 CI CO 
M3 M2; 

DEFINE GROUP OUT_GROUP 
CLBl 45-D23; 

DEFINE GROUP M1_GR0UP 
47-SO 

19-SO 19-Sl 
13-D03 13-D47; 

DEFINE GROUP M2_GR0UP 
47-Sl 

15-SO 15-81 
21-D03 21-D47; 

DEFINE GROUP M3_GR0UP 
27-SO 27-81 
23-80 23-81 
36-80 36-81; 

DEFINE GROUP M4_GR0UP 
0UTM4 

42-80 42-81 
37-D47 37-D67 
39-80; 
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$ Simulation file for design 'TDM.sim' , type '2064N48-50' 
S Created by PIN2LCAVer. l.Olxat 10:58:00 JUL 14, 1987 

S 

! INPUT TDM.sim 



GLOBALRESET- 

CK.PAD 

LD . PAD 

ADDRO . PAD 

+ 78000 

ADDRl.PAD 

+ 78000 1 

ADDR2 . PAD 

+ 78000 1 

ADDR3 . PAD 

+ 78000 

ADDR4 . PAD 

+ 78000 1 

DO. PAD 

Dl.PAD 

D2 . PAD 

D3.PAD 

D4.PAD 

D5.PAD 

D6.PAD 

D7.PAD 

D8.PAD 

D9.PAD 

DIO.PAD 

Dll.PAD 

D12.PAD 

D13.PAD 

D14.PAD 

D15.PAD 

D16.PAD 

D17.PAD 

D18.PAD 

D19.PAD 

D20.PAD 

D21.PAD 

D22.PAD 

D23.PAD 

D24.PAD 

D25.PAD 

D26.PAD 

D27.PAD 

028 . PAD 

D29 . PAD 

D30.PAD 

D31.PAD 



. CLK SO 1 SI S Initial pulse to reset latches 

. CLK 500 1 1000 . REP 

.CLKO 40000 1 

. CLK 40000 41000 1 42000 . REP 40000 

. CLK 40000 42000 1 44000 .REP 40000 

. CLK 40000 44000 1 48000 . REP 40000 

.CLKO 40000 48000 1 56000 .REP 40000 

. CLK 40000 56000 1 72000 . REP 40000 



.CLKO 
.CLKO 
.CLKO 
.CLK 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLK 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
■ CLKO 
.CLKO 
.CLKO 
.CLKO 
.CLKO 
, CLK 
, CLK 
CLKO 
CLKO 
CLKO 
CLKO 
CLKO 



1000 
1000 
2000 
3000 
4000 
5000 
6000 
7000 
8000 
9000 
10000 
11000 
12000 
13000 
14000 
15000 
16000 
17000 
18000 
19000 
20000 
21000 
22000 
23000 
24000 
25000 
26000 
27000 
28000 
29000 
30000 
31000 



2000 
3000 
4000 
5000 
6000 
7000 
8000 
9000 
10000 
11000 
1 12000 
1 13000 
1 14000 
1 15000 
1 16000 
1 17000 
1 18000 
1 19000 
1 20000 
1 21000 
1 22000 
123000 
124000 
125000 
126000 
1 27000 
128000 
129000 
1 30000 
1 31000 
1 32000 



40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 
40000 



41000 

1 41000 
1 42000 
1 43000 
1 44000 
1 45000 
1 46000 
1 47000 
1 48000 
1 49000 
1 50000 
1 51000 
1 52000 
1 53000 
1 54000 
1 55000 
1 56000 
1 57000 
1 58000 
1 59000 
1 60000 
1 61000 
1 62000 
1 63000 
1 64000 
1 65000 
1 66000 
1 67000 
1 68000 
1 69000 
1 70000 
1 71000 



42000 1 

43000 1 

44000 1 

45000 1 

46000 1 

47000 1 

48000 1 

49000 1 

50000 1 

51000 1 

52000 1 

53000 1 

54000 1 

55000 1 

56000 1 

57000 1 

58000 1 

59000 1 

60000 1 

61000 1 

62000 1 

63000 1 73000 

64000 1 

65000 1 

66000 1 

67000 1 

68000 1 

69000 1 

70000 1 

71000 1 

72000 1 



. MON I TOR CK . PAD LD . PAD ; ADDR4 . PAD ADDR3 . PAD ADDR2 . PAD ADDRl . PAD ADDRO . PAD ; 
+ 04 C3 C2 CI CO ; 

+ DO . PAD Dl . PAD D2 . PAD D3 . PAD D4 . PAD D5 . PAD D6 . PAD D7 . PAD ; 
+ D8.PAD D9.PAD DIO.PAD Dll.PAD D12 .PAD D13 . PAD D14 . PAD D15 .PAD ; 
+ D16 . PAD 017 . PAD D18 . PAD D19 . PAD D20 . PAD D21 . PAD D22 . PAD D23 . PAD ; 
+ D24 . PAD 025 . PAD D26 . PAD D27 . PAD D28 . PAD D29 . PAD D30 . PAD D31 . PAD ; OUT . PAD 

. TABLE CK . PAD LD . PAD ; ADDR4 . PAD ADDR3 . PAD ADDR2 . PAD ADDRl . PAD ADDRO . PAD ; ; 
+ C4 03 C2 01 CO ; 

+ DO . PAD DI . PAD D2 . PAD D3 . PAD D4 . PAD D5 . PAD D6 . PAD D7 . PAD ; 
+ D8.PAD D9.PADD10.PADD11.PADD12.PADD13.PADD14.PADD15.PAD ; 
+ D16 . PAD 017 . PAD D18 . PAD D19 . PAD D20 . PAD D21 . PAD D22 . PAD D23 . PAD ; 
+ D24 . PAD 025 . PAD D26 . PAD D27 . PAD D28 . PAD D29 . PAD D30 . PAD D31 . PAD ; OUT . PAD 
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SILOS 



OUTPUTS 



10:26:22 



07-23- 



CL AAAAA 
KD DDDDD 
. . DDDDD 
PP RRRRR 
AA 43210 
DD 



TIME 



500 

1000 

1500 

2000 

2500 

3000 

3500 

4000 

4500 

■ 5000 

5500 

GOOO 

6500 

7000 

7500 

8000 

8500 

9000 

9500 

10000 

10500 

11000 

11500 

12000 

12500 

13000 

13500 

14000 

14500 

15000 

15500 

16000 

16500 

17000 

17500 

18000 

18500 

19000 

19500 

20000 

20500 

21000 

21500 

22000 

22500 

23000 

23500 

24000 

24500 

25000 

25500 

26000 

26500 

27000 

27500 

28000 

28500 

29000 

29500 

30000 

30500 

31000 

31500 

32000 

32500 

33000 

39000 

39500 

40000 

40500 

41000 

41500 

42000 

42500 



PPPPP 
AAAAA 
DDDDD 

00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
00 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 
10 00000 
00 00000 

00 00000 

10 00000 

01 00000 

11 00000 
01 00001 
11 00001 
01 00010 
11 00010 



CCCCC DDDDDDDD DDDDDDDD DDDDDDDD DDDDDDDD O 

43210 01234567 89111111 11112222 22222233 U 

012345 67890123 45678901 T 

PPPPPPPP PP 

AAAAAAAA AAPPPPPP PPPPPPPP PPPPPPPP P 

DDDDDDDD DDAAAAAA AAAAAAAA AAAAAAAA A 

DDDDDD DDDDDDDD DDDDDDDD D 



00000 
00000 
00000 
00001 
00001 
00010 
00010 
00011 
00011 
00100 
00100 
00101 
00101 
00110 
00110 
00111 
00111 
01000 
01000 
01001 
01001 
01010 
01010 
01011 
01011 
01100 
01100 
01101 
01101 
OHIO 
OHIO 
01111 
01111 
10000 
10000 
10001 
10001 
10010 
10010 
10011 
10011 
10011 
10100 
10101 
10101 
10110 
10110 
10111 
10111 
11000 
11000 
11001 
11001 
11010 
11010 
11011 
11011 
11100 
11100 
11101 
11101 
11110 
11110 

11111 
11111 

00000 
00000 
00110 
00111 
00111 
01000 
01000 
00001 
00001 
00010 



10000000 
10000000 
01000000 
01000000 
00100000 
00100000 

oooipooo 

00010000 
00001000 
00001000 
00000100 
00000100 
00000010 
00000010 
00000001 
00000001 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
01111111 
01111111 
10111111 
10111111 
11011111 
11011111 



00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
10000000 
10000000 
01000000 
01000000 
00100000 
00100000 
00010000 
00010000 
00001000 
00001000 
00000100 
00000100 
00000010 
00000010 
00000001 
00000001 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 

11111111 
11111111 
11111111 
11111111 
11111111 
11111111 



00000000 
00000000 
00000000 
00000000 
00000000 
00000000 

ooonoooo 

00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
10000000 
10000000 
01000000 
01000000 
00100000 
00100000 
00010000 
00010000 
00001000 
00001000 
00000100 
00000100 
00000010 
00000010 
00000001 
00000001 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 

11111111 
11111111 
11111111 
11111111 
11111111 
11111111 



00000000 1 

00000000 1 

OOOOOOOO 1 

00000000 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

OOOOOOOO 1 

00000000 1 
10000000 1 
lOOOOOOO 1 
01000000 1 
01000000 1 
00100000 1 
00100000 1 
00010000 1 
00010000 1 
00001000 1 
00001000 1 
OOOOOIOO 1 
00000100 1 
00000010 1 
00000010 1 

00000001 1 
00000001 1 
OOOOOOOO 1 
OOOOOOOO 
OOOOOOOO 
OOOOOOOO 
OOOOOOOO 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
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SILOS 



OUTPUTS 



10:26:22 



07-23-87 



TIME 
43000 
43500 
44000 
44500 
45000 
45500 
46000 
46500 
47000 
47500 
48000 
48500 
49000 
49500 
50000 
50500 
51000 
51500 
52000 
52500 
53000 
53500 
54000 
54500 
55000 
55500 
56000 
56500 
57000 
57500 
58000 
58500 
59000 
59500 
60000 
60500 
61000 
61500 
62000 
62500 
63000 
63500 
64000 
64500 
65000 
65500 
66000 
66500 
67000 
67500 
68000 
68500 
69000 
69500 
70000 
70500 
71000 
71500 
72000 
72500 
73000 
73500 
74000 
74500 
75000 
75500 
76000 
76500 
77000 
77500 
78000 
78500 
79000 
79500 
80000 



CL AAAAA 
KD DDDDD 
. . DDDDD 
PP RRRRR 
AA 43210 

DD 

PPPPP 
AAAAA 
DDDDD 

01 00011 
11 00011 
01 00100 
11 00100 
01 00101 
11 00101 
01 00110 
11 00110 
01 00111 
11 00111 
01 01000 
11 01000 
01 01001 
11 01001 
01 01010 
11 01010 
01 01011 
11 01011 
01 01100 
11 01100 
01 01101 
11 01101 
01 OHIO 
11 OHIO 
01 01111 
11 01111 
01 10000 
11 10000 
01 10001 
11 10001 
01 10010 
11 10010 
01 10011 
11 10011 
01 10100 
11 10100 
01 10101 
11 10101 
01 10110 
11 10110 
01 10111 
11 10111 
01 11000 
11 11000 
01 11001 
11 11001 
01 11010 
11 11010 
01 11011 
11 11011 
01 11100 
11 11100 
01 11101 
11 11101 
01 11110 
11 11110 
01 11111 

11 11111 

01 00000 
11 00000 
01 00001 
11 00001 
01 00010 
11 00010 
01 00011 
11 00011 
01 00100 
11 00100 
01 00101 
11 00101 
01 10110 
11 10110 
01 10110 

11 loin 

01 101 -.J 



CCCCC DDDDDDDD DDDDDDDD DDDDDDDD DDDDDDDD 
43210 01234567 89111111 11112222 22222233 U 

012345 67890123 45678901 T 

PPPPPPPP PP 

AAAAAAAA AAPPPPPP PPPPPPPP PPPPPPPP P 

DDDDDDDD DDAAAAAA AAAAAAAA AAAAAAAA A 

DDDDDD DDDDDDDD DDDDDDDD D 



00010 
00011 
00011 
00000 
00000 
00101 
00101 
00110 
00110 
00111 
00111 
01100 
01100 
01001 
01001 
01010 
01010 
01011 
01011 
01000 
01000 
01101 
01101 
OHIO 
OHIO 
01111 
01111 
10100 
10100 
10001 
10001 
10010 
10010 
10011 
10011 
10000 
10000 
10101 
10101 
10110 
10110 
10111 
10111 
11100 
11100 
11001 
11001 
11010 
11010 
11011 
11011 
11000 
11000 
11101 
11101 
11110 
11110 

11111 
11111 

00100 
00100 
00001 
00001 
00010 
00010 
00011 
00011 
00000 
00000 
00101 
00101 
10110 
10110 
10110 
10110 



11101111 
11101111 
11110111 
11110111 
11111011 
11111011 
11111101 
11111101 
11111110 
11111110 

11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 

lllllHl 

11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 

llllllH 

11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 
11111111 

llHllll 

11111111 
11111111 
11111111 

lllllHl 

11111111 
11111111 

lllllHl 
lllllHl 

11111111 
11111111 
11111111 

lllHlll 
lllllHl 

11111111 
11111111 
11111111 
11111111 

llHllll 
lllllHl 



lllllHl 

11111111 

lllllHl 

11111111 

lllllHl 
lllllHl 
lllllHl 

11111111 
11111111 
11111111 

01111111 
01111111 
10111111 
10111111 
11011111 
11011111 
11101111 
11101111 
11110111 
11110111 
11111011 
11111011 
11111101 
11111101 
11111110 
11111110 

11111111 

lllllHl 

11111111 

llHllll 
lllllHl 
llllllH 
11111111 
11111111 
llHllll 
lllllHl 
llllllH 
11111111 
llHllll 
llllllH 
llHllll 
11111111 
11111111 
llHllll 
11111111 
llHllll 
11111111 
llHllll 
llllllH 
11111111 
lllllHl 
11111111 
11111111 
llllllH 
11111111 
lllHlll 
llHllll 
11111111 
11111111 
llllllH 
11111111 
11111111 
lllHlll 
11111111 
11111111 
llllllH 
llllllH 
11111111 
11111111 
11111111 
11111111 
llllllH 
11111111 
11111111 
11111111 



11111111 
lllHlll 
llHllll 
lllllHl 
11111111 
lllllHl 
11111111 
11111111 
11111111 
11111111 
lllHlll 
llHllll 
llllllH 
llHHll 
llllllH 
llllllH 
llllllH 
llllllH 
lllHlll 
lllllHl 
lllllHl 
llllllH 
11111111 
llllllH 
llllllH 
lllllHl 
01111111 
01111111 
lOlHlll 
10111111 
11011111 
11011111 
11101111 
11101111 
11110111 
IHlOlll 
11111011 
11111011 
11111101 
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Introduction 

In a Private Branch Exchange (PBX) or Central Office (CO), 
incoming digitized voice circuits are "switched" or routed to 
their appropriate destination. The "switching" takes place while 
the data is in serial or parallel format. An eight-bit format conver- 
ter circuit, widely used in a variety of PBX and CO architectures, 
translates the serial data streams into parallel data streams and 
vice-versa. 

The eight-bit format converter circuit requires a large number of 
register elements. These elements, namely serial shift registers, 
can be implemented efficiently in an LCA device due to its high 
register content and flexible structure. The LCA M2018 pro- 
grammable device contains one hundred configurable register 
elements, ninety-nine of which are used in the implementation 
of the converter circuit. This circuit is capable of running at 12.5 
MHz and 18.5 MHz in a 50-MHz and 70-MHz LCA device, 
respectively. 

Powerful software tools and circuit uniformity allowed the 
design to be laid out and simulated in just two days. However, if 
different support logic becomes necessary, PBX vendors are 
able to generate a new configuration program in the LCA re- 
programmable device. Since these devices are manufactured in 
a CMOS technology, a complete PBX system with several LCA 
devices can help keep the active power consumption down. 
(Please refer to the LCA Design and Applications Handbook- 
Reference 3 for basic information regarding structure and pro- 
gramming aspects.) 



Overview of a PBX 

Central Offices (CO) and Private Branch Exchanges (PBX) pro- 
vide both telephone and data services. A PBX may be consi- 
dered as a localized telephone exchange serving the intercon- 
nection requirements of users in office environments. Outgoing 
calls are directed out of the PBX and routed through the local 
CO to the far destination. In general terms, a PBX or a CO 
consists of major blocks shown In Figure 1 . Racks of peripheral 
equipment modules containing line cards provide access to 
telephone units, or to data adapter modules which provide data 
communication services. Trunk cards provide high-speed links 
to host computers, or other PBXs. Network switching modules 
interconnect calling parties to answering parties via digital mul- 
tiplexed links. Control modules containing a Central Processor 
Unit (CPU), mass storage and local memory, perform the 
"setup" and "tear-down" of each circuit connection, as well as 
the monitoring of PBX activities. Central resources are modules 
of hardware and software which operate in a time-shared 
manner. The central resources include facilities such as ringing 
tone generator and message recording modules. 
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Figure 1. PBX Communication and Control Architecture 
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Hierarchy of Channel Multiplexing 
in a PBX 

The main function of switching modules is to interconnect cir- 
cuits between tv;o or more parties such as in a conference call. 
A digital telephone unit, or a line card, contains a Coder/De- 
coder (CODEC) device. The CODEC device digitizes voice and 
transmits it into a Pulse Code Modulated (PCM) data stream on 
a common serial data highway at appropriate time intervals, 
which are assigned at the start of the call. 

Analog voice is sampled at 8 KHz and passes through an eight- 
bit analog-to-digital converter. The digitized sample (eight-bit 
binary word) is transmitted serially to produce a serial data 
stream of (8 x 8 = ) 64 Kbps. Thirty-two of these samples or 
CODECS are normally connected to a common highway operat- 
ing at (32 X 64 = ) 2.048 Mbps. Each CODEC is assigned "Serial 
Time Slots" of eight-bit duration, and sends its eight bits of data 
at the rate of 2.048 Mbps every 8 KHz. Another time slot is 
available on a separate highway for the receive data stream. 

Therefore, a 2.048 Mbps highway can support thirty-two chan- 
nels or voice connections in one direction. However, there are 
several ways to combine serial highways for additional voice 
connections. One way is to combine four highways into one 



high-speed highway of 8.192 Mbps. Further integration is possi- 
ble, but the aggregate data rate becomes high and leads to 
implementation problems including timing, signal reflection, 
and radiation, to name a few. 

Another way to integrate channels without increasing the clock 
rate is to convert eight serial data highways into a single eight-bit 
parallel bus. With this method, the serial clock rate and the 
parallel clock rate are the same, 2.048 Mbps. This means that 
data throughput on the eight serial highways of 16.384 Mbps is 
maintained in the eight-bit parallel bus operating at 2.048 Mbps. 
This works by careful alignment of the incoming parallel time 
slots to serial time slots on the serial highways. An eight-bit 
format converter scheme, sometimes called a "Corner Bender", 
is widely used in PBX or CO switching modules. Four of these 
parallel buses are combined (see Figure 2) to form a single 
parallel system bus that is thirty-two bits wide, yet still operates 
at 2.048 Mbps. The multiplexing architecture shown in Figure 2 
is used in the Harris 20-20™ Integrated Network Switch (see 
Reference 1). 

Space switching and/or time slot switching to interconnect dif- 
ferent parties is performed either in the serial highway format or 
in the parallel data stream format. 



SYSTEM PCM BUS 4MHz x 32 BITS 



FOURTH 
MUX 
LEVEL 



2048 

PORT TIME 

SWITCH 



9 



THIRD 
MUX 
LEVEL 



2MHzx8Bits 

(256 PORTS) 

CORNER BENDER 



■« TIMING - 



2MHzx8Bits 

(256 PORTS) 

CORNER BENDER 



PERIPHERAL GROUP BUS 
2MHZX1 BIT (32 PORTS) 



SECOND 
MUX 
LEVEL 



7 OTHER 
GROUPS 



3 OTHER CARDS 



A— 



ON-UNIT BUS 
y 2 MHz (8 PORTS) 



FIRST 
MUX 
LEVEL 



7 OTHER CODES 



TELEPHONY 
PERIPHERAL 
UNIT 



ANALOG VOICE 



* = TRI-STATE 



Figure 2. Multiplexing Levels 



il 



Monoliihio 



m 



Memories 



2-445 



LCA Device Implements an 8-Bit Format Converter in a PBX Switching i\lloduie 



Description of tlie 
"Corner Bender" Circuit 

The circuit consists of eight parallel-to-serial shift registers, and 
additional shift registers which provide proportional delays to 
synchronize the outputs. The principle of operation is shown in 
Figure 3, and the circuit is shown in Figure 4. Parallel data from 
the parallel bus is loaded with each clock into one of the eight 
sh ift registers in a rotating pattern. The data then shifts out of the 
shift register with each subsequent clock. After all eight registers 
have been loaded, the first register will be empty on the next 
clock. New data is loaded into this register at the same clock 
edge as the last bit shifts out to the next stage. Thus, a continu- 



ous flow of data through the registers is insured. The data out of 
the eight shift registers is skewed in relation to each other as 
shown in Figure 3b. The various space slots on the highways 
need to be aligned to keep synchronization. This is achieved by 
proportionally delaying the outputs. Channel 1 output is delayed 
by seven clocks (using a seven-bit serial shift register), channel 2 
is delayed by six clocks, etc. Channel 8 output does not go 
through any additional delay. The final outputs of the delay 
registers are aligned and data is clocked out in a synchronous 
fashion. See Figure 3c. 
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Figure 4 also shows the additional circuitry needed to control 
the shift registers. A preloadable three-bit counter keeps count 
of eight clock pulses. The parallel-to-serial bus conversion can 
be "programmed" to start in any register by setting the approp- 
riate binary value on the counter preload inputs and applying a 
pulse to the sync input. If the loading sequence produced by the 
counter is not required, it can be disabled by connecting the 
"clock" to "sync" input. At each positive clock edge, the register 
loaded will depend upon the data at the counter inputs on the 
previous positive clock edge. The 3-to-8 decode circuit produ- 



ces load pulses to latch parallel data into the shift registers. 
Figure 5 shovi^s the parallel-to-serial conversion data matrix. 

The description above shows the conversion of parallel data into 
eight streams of serial data. However, the same circuit also 
performs serial-to-parallel conversion. A serial eight-bit data 
stream on one of the eight inputs appears as an eight-bit parallel 
word on the eight outputs. Successive parallel words appearing 
at the eight outputs correspond to the serial data on each of the 
eight inputs in rotation. See Figure 5. 
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DATA 
INPUTS 



1 A0|A1|A2|A3| A4|A5|A6|A7 1 I I I I I | | | 




1 B0| B1|B2|B3|B4{B5|B6 |B7| I I I I I I I I 




1 C0| C1|C2|C3|C4{C5|C6|C7| | | | | | | | | 




|D0|D1|D2|O3|D4|D5|D6|D7| | | | | | | | | 
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PARALLEL TO SERIAL CONVERSION 

AO, BO, CO, DO, EO, FO, GO, HO = IN(l)0, IN(I)1, IN(I)2, IN(I)3, IN(I)4, IN(I)5, IN(I)6, IN(I)7 

A1, B1, C1, D1, E1, F1, G1, HI = IN(ll)0, IN(II)1, IN(II)2, IN(II)3, IN(II)4, IN(II)5, IN(II)6, IN(II)7 

SERIAL TO PARALLEL CONVERSION 

AO, A1,A2,A3,A4,A5,A6,A7 = IN(l)0, IN(I)1, IN(I)2, IN(I)3, IN(I)4, IN(I)5, IN(I)6, iN(l)7 

B0,B1, B2, B3,B4, B5, B6, B7 = IN(ll)0, IN(II)1, IN(II)2, IN(II)3, IN(II)4, IN(II)5, IN(II)6, IN(II)7 

* IN(I)X ARE THE INPUT 8-BIT WORDS 

Figure 5. Data Conversion 



DATA 
OUTPUTS ■ 



1 1 1 1 1 1 1 iHOlGOl F0|E0|DO|CO|B0|A0| 




1 1 1 1 1 1 1 |H1|G1| F1|E1|D1|C1 |B1|A1| 




1 1 1 1 1 1 1 |H2|G2| F2|E2|D2|C2|B2|A2| 




1 1 1 1 1 1 1 |H3|G3| F3|E3|D3|C3|B3|A3| 




1 1 1 1 1 1 1 1 H4|G4| F4|E4|D4|C4|B4|A4| 




1 1 1 1 1 1 1 |H5|GS| F5|E5|D5|C5|B5|A5| 




1 1 1 1 1 1 1 |H6|G6|F6 |E6 |D6|C6|B6|A6| 




1 1 1 1 1 1 1 |H7|G7|F7|E7 |D7|C7|B7|A7| 



Comparison of Implementation 
Techniques 

The "Corner Bender" circuit can be implemented in several 
ways. The efficiency of an LCA design can be compared to that 
of a design in SSI/MSI logic devices or Erasable Programmable 
Logic devices (EPLDs). Table 1 contains the comparison of 
package counts with the different alternatives. 

The circuit implemented in 74LSxx devices requires nineteen 
packages. Alternatively, sixteen simple PLD devices (such as a 
PAL20R8 or a PAL20R4) are required to implement the same 
circuit. Since the circuit is register intensive and not designed for 
high speed, PLDs are not chosen for this application. 

The functionally larger EPLD devices, EP1200and EP1800,are 
more register intensive and contain twenty-eight and fourty- 
eight register elements, respectively. Nevertheless, four EP1200 
devices, or two EP1 800 plus a smaller PLD device, is required to 
implement this design. 

The entire Corner Bender circuit fits neatly into a single LCA 
2018 device. It uses ninety-nine out of the one hundred available 
internal macrocells and demonstrates efficient implementation 
of shift registers and small counters. An LSI device from Plessey 
(Reference 2) is available to implement the same circuit. How- 
ever, it is an NMOS device and limited to speeds of under2 MHz. 



EQUiVALENT CIRCUIT IN SSI/MSi DEVICES 


PACKAGES 


Parallel in serial out shift Registers 74LS165 


8 


Serial in Parallel out shift Registers 74LS164 


7 


4-bit counter 74LS161 


1 


3 to 8 decoder 74LS138 


1 


Three-state buffers 74LS241 


1 


EQUIVALENT CIRCUIT IN EPLDs 


a) EP1200 28 macro cells each 


4 


b) EP1800 48 macro cells each 


2 


plus PAL device e.g., 16R4 


1 


EQUiVALENT CIRCUIT IN MIVII LCA DEVICE 


LCA 2018 device 100 macro cell 


1 



Table 1. Implementation Alternatives for the 8-Bit Format 
Converter Circuit "Corner Bender" 
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"Corner Bender" Design in an 
LCA Device 

A count of the register elements in the circuit shows a need of 
sixty-four elements for the eight-shift registers, twenty-eight 
elements for the delay registers, three elements for the counter, 
and four elements for the 3-to-8 decode circuit. Each configura- 
ble logic block (CLB) in the LCA device can produce two out- 
puts. Hence, only four CLBs are required for the 3-to-8 decode 



circuit. Since, each CLB contains one register element, the total 
count of CLBs is therefore ninety-nine. The LCA 2018 contains 
one hundred CLBs. Twenty-two input/output blocks (lOBs) 
were used. Figure 6 shows the layout of the design in the LCA 
device. The circuit fits into a 68-pin PLCC package. 



XACT 1.30, 10:50:36 MAY 5, 1987 
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Figure 6. Layout Diagram of "Corner Bender" Circuit on LCA Device of Size 10x10 
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The Monolithic IVIemories' XACF" Design Editor is used to 
create the design by implementing the appropriate logic func- 
tions in CLBs and lOBs. An example of a CLB for one bit of the 
eight-bit shift register in this design is shown in Figure 7. Input A 
is the decoder input and it is ANDed with input B, the data to be 
shifted. To implement the eight-bit shift registers from the one- 
bit shift register, eight of the one-bit shift registers were linked 
together so that the output of each register became an input to 
the next register. This is shown in Figure 8 where each CLB 
represents one bit of the eight-bit shift register. 

The three-bit counter and the 3-to-8 decoder in this design were 
implemented in CLBs as shown in Figures 9 and 10. The counter 
is a synchronous binary counter with ripple carry and parallel 
load. The decoder is a standard 3-to-8 decoder. The outputs of 
the counter become inputs to the decoder, whereas the outputs 
of the decoder are used to decode the eight 8-bit shift registers. 

With the XACT Development System, the designer can optimally 
arrange the logic blocks on the LCA device in order to minimize 
net delays between each block. With this in mind, the layout for 
the design is described below: 

Four of the eight-bit parallel-to-serial shift registers were placed 
starting from the top left-hand edge of the LCA device (see 
Figure 7). The three-bit counter {three CLBs) and the 3-to-8 
decoder (four CLBs) were then placed on the following row. The 
next four rows contained the last four parallel-to-serial shift 
registers. This allowed the shift register select lines to have 
minimal delay spread when accessing all eight shift registers. 
The seven serial-to-serial shift registers were placed in the 
remaining CLBs as uniformly as possible. 



The optimum placement and distribution of configured blocks 
in the array is influenced by the performance needs of the 
system. Blocks placed in close proximity can use local intercon- 
nection resources which incur short signal propagation delays, 
whereas blocks placed further apart must use either "long lines" 
or other interconnection resources. Manual optimization using 
the delay efficient "long lines" was performed for the most criti- 
cal net connections. After routing completion, the longest delay 
between two clock pulses was the delay for the counter to 
change state, the state which is decoded via the 3-to-8 decoder 
and selects the appropriate shift registerto load the parallel data 
(see Figure 7). This delay was 54 ns, 79 ns, and 106 ns, respec- 
tively for the 70-, 50-, and 33-MHz versions of the device. The 
delays were measured using the XACT simulation package by 
invoking the timing delay calculator. This translates to a maxi- 
mum circuit operating speed of about 18.5 MHz for the 70-MHz 
version of the device, or 9.43 MHz for the 33 MHz version. 

Although fabricated in a CMOS technology, the inputs to the 
LCA device can be made either TTL or CMOS compatible. For 
high fan-out CMOS or LS TTL-compatible loads, the output 
buffers of the LCA device are capable of driving 4 mA. Moreover, 
each output buffer can be put into a HIGH-Z state for bus-driving 
applications. This features was also used in the design of the 
eight-bit format converter. 

More information about entering a design with the XACT Devel- 
opment System is included in the LCA Design and Applications 
Handbook (Reference 3). Information about configuring the 
LCA device is described in the "Configuring the LCA Device" 
applications note. 



X 

Y 

Q 

SET 

RES 

CLK 



FQ 

FQ 

FF LATCH 

AF 

OF 

KCFNOT 



n 



BLK: AA 
SRAL8 



CLK 
Q 



A: DEC8_ 

B: INA_ 

C: 

D: 

K: CLKIN_ 

X: L8SRA_ 

Y: 



Figure 7. One Bit of an 8-Bit Serial - Parallel Shift Register 
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Figure 9a. Decode Outputs 1 and 2 of 3-to-8 Decoder 



Figure Sc. Decode Outputs 5 and 6 of 3-to-8 Decoder 
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Figure 9b. Decode Outputs 3 and 4 of 3-to-6 Decoder 



Figure gd. Decode Outputs 7 and 8 of 3-to-8 Decoder 



Figure 9. CLB Configuration of a 3-to-8 Decoder 
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Figure 10a. 1st Bit of a 3-Bit Counter 
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Figure 10b. 2nd Bit of a 3-Bit Counter 
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Figure 10c. 3rd Bit of a 3-Bit Counter 



Figure 10. CLB Configuration of a 3-Bit Binary Counter witli Ripple Carry 



Summary 

The LCA device has achieved a good solution to the principles 
used for multiplexing digitally-encoded voice channels in both 
serial and parallel hierarchies. These hierarchies can accom- 
modate thousands of voice circuits in a PBX switching module. 
The detailed design of the eight-bit (parallel-to-serial and serial- 
to-parallel) format converter circuit, or Corner Bender, deve- 
loped in Monolithic Memories' LCA M201 8 device was shown to 
be efficiently implemented. Despite the fast development of the 
system using the XACT Design Editor, the final design was 
programmed and bench tested to verify functional integrity. 

The Corner Bender design is available from Monolithic Memo- 
ries upon request. The bit pattern and .LCA file will be provided 
for programming the LCA device in an EPROM. Please ask for 
design XDES05.LCA 
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Functional Description 

This application was first developed by LIT, Conflans Ste. 
Honorine, France. Part of the design, reprinted with courtesy of 
LTT, is used to control a serial data link based upon a specialized 
LSI chip. 

Originally designed with six standard SSI/MSI circuits, this same 
function can now be implemented, not only into a single 
PAL20RA10, but with even more features and better perfor- 
mance. The function can be divided into three sub-functions: 

1. Address Decoding 

2. Control Flags 

3. Transmission Speed Selection 

Up to four address lines are allowed (eight were actually used), 
plus two extra lines which are special decoding controls 
(MEM/10 selection, Enable Control . . .). Two flip-flops load flag 
conditions, from the address bus (A1 and A2), providing hand- 
shake between the 6850 DART and the communication lines. 
They have a common clock which also serves as Chip Select 
(CSO) for the UART 

The UARTTransmit clock (TXCLK) can be directly connected to 
the Receive Clock (CK or RXCLK) or represents the Receive 
Clock value divided by sixteen. This function was performed by 
four "D" Flip-Flops connected as a 4-stage Asynchronous 
Divider. Since each basis cell in a R'\L20RA10 has four Product 
Terms available, this function could be implemented either 
asynchronously or synchronously. In the PAL Design Specifica- 
tion example, a 4-bit synchronous divider was used instead of 
the asynchronous circuit shown in the schematic. 



Pin Description 

1 . TEST Allows preload function for testing. 

2. SYSRESET Reset line from microprocessor. 

3. A2 Address line from address bus. 

4. A1 Address line from address bus. 

5. HDSHAKE Handshake line (CTS/RTS). 

6. CK External clock. 

7. E Enable line from microprocessor. 

8. AUXDECOD Extra decoding line 

(e.g. board level decoding). 

9. A3 Address line from address bus. 

10. A3 Address line from address bus. 

11. A3 Address line from address bus. 

12. GND Reference power supply ground. 

13. /OE Output enable line. 

14. A6 Address line from address bus. 

15. SPEEDSEL Speed selection line. 

16. DIV4 MSB 4-bit synchronous counter. 

17. DIV3 3rd stage synchronous counter. 

18. DIV2 2nd stage synchronous counter. 

19. DIV1 LSB 4-bit synchronous counter. 

20. SCO UART chip select line (CSO). 

21. BLOCREC Bloc receive line. 

22. DIR DIV Direct or divided clock. 

23. /TPH External use flag. 

24. VCC 5 V power supply. 
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SYSRESET- 
A2- 
A1- 



AUXDECOD- 
A3- 



A5- 
AO- 



R/W - 
IRQ- 
DO- 
D1- 
D2- 
D3- 
D4- 
D5- 
D6- 
D7- 



TEST E 

E 



cso 

CS1 
CS2 



R/W 

IRQ 

DO 

D1 

D2 T) 

D3 

D4 

05 6850 

D6 

D7 



R/S 
RTS 
CTS 
DCD 
RXD 
TXD 
RXCLK 



GND E 



PAL 
20RA10 



3 VCC 

13 

ID 

B 

E 

3 NC 
3NC 
3 NC 

3 

3 



'Ph 



3 

3 /OE 



J. 



-DCD 
-RXD 
-TXD 



Figure 2. 



2-454 



Monolithic 



m 



iyiemories 



Serial Data Link Controller 



PAL Design Specification 



Simulation Results 



Title 

Pattern 

Revision 

Author 

Company 

Date 



Serial Data Link Controller 

Link . pds 

A 

Jose Juntas / Kelvin Chow 

Monolithic Memories Inc., Santa Clara, Ca 

3/1/85 



CHIP SE_CH_CNTRL PAL20RA10 

TEST SYSRESET A2 Al HDSHAKE CX E AUXDECOD A3 A4 A5 GHD 
/OE A6 SPEEDSEL DIV4 DIV3 DIV2 DIVl CSO BLOCREC DIRDIV 
/TPH VCC 



;Load A2 as flag 
;CLK W/ ADDR. decode 
; global system reset 

;Load speed ratio 
;CLK W/ ADDR. decode 
;CLR by CTS/RTS line 

; Controlled by speed 

.•option and CTS/RTS 

;llna 
/A6 * A5 *A4 *A3 *AUXDECOD*E 

;UART address valid 
DIVl ; 4-bit synchronous 

; divider LSB 
CK ;CLK by CK( external) 

/DIRDIV ;CLR by speed option 



EQUATIONS 




/TPH 

/TPH.CLKF 

/TPH.SETF 


!- A2 

- CSO 

- SYSRESET 


DIRDIV 
DIRDIV. CLKF 
DIRDIV. SETF 


:- Al 

- CSO 

- /HDSHAKE 


/BLOCREC 


- /DIRDIV 
+ HDSHAKE 



CSO 
/DIVl 



/DIVl. CLKF 
/DIVl. SETF 



/DIV2.CLKF 
/DIV2.SETF 



/DIV3.CLKr 
/DIV3.SETF 



/DIV4 . CLKF 
/DIV4.SETF 

SPEEDSEL 
SPEEDSEL. CLKF 

SPEEDSEL. SETF 



:- /DIV1*/DIV2 
+ DIV1*DIV2 



:- /DIV2*/DIV3 
+ /DIV1*/DIV3 
+ DIV1*DIV2*DIV3 

- CK 

- /DIRDIV 

:- /DIV3*/DIV4 
+ /DIV2*/DIV4 
+ /DIV1*/DIV4 
+ DIV1*DIV2*DIV3*DIV4 

- CK 

- /DIRDIV 

:- /Al 

- CSO 



;2ND stage of 

.•divider 

;CLK by CK(external) 

;CLR by speed option 

;3RD stage of 
/divider 

;CLK by CK( external) 
.•CLR by speed option 

;4TH stage of 
;divider MSB 



;CLK by CK( external) 
;CLR by speed option 

;Iioad speed choice 
;CLK H/ ADDR. decode 
;CLR by CTS/RTS line 
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SIMULATION 

TRACE_ON A1,A2,A3,A4,A5,A6,E, .-Signals to be 

AUXDECOD. SYSRESET, /TPH, HDSHAKE, ; observed 
CSO, SPEEDSEL, DIRDIV, CK, 
DIVl , DIV2 , DIV3 , DIV4 

SETF SYSRESET, /HDSHAKE ;Reset all regs 

CHECK /SPEEDSEL, /DIRDIV, TPH 

SETF /SYSRESET, A1,A2, A3, A4,A5,/A6, HDSHAKE, ;Set decode 
E, AUXDECOD ; condition 



CHECK /SPEEDSEL, DIRDIV 

FOR I:-l TO 15 DO 
BEGIN 
SETF CK 

SETF /CK 
END 



; Check SPEEDSEL and 
; DIRDIV regs 

.•This portion 
.•simulates divide 
;by four counter 
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The ZPAL PALC20R8Z family consists of four devices: 
combinatorial PALC20L8Z, and registered PALC20R8Z, 
PALC20R6Z, and PALC20R4Z. These devices offer zero 
standby and very low active power dissipation, along with 
traditional architecture and functionality of standard PAL 
devices. These 24-pin devices offer high speed (TPD = 35 ns) 
and are ideal for applications in communications equipment 
and instrumentation, which are typically under severe power 
constraints. 

This note describes one such application in communications 
equipment for Quadrature Amplitude Modulation (QAM) 
encoding and provides a design based on a PALC20R8Z 
device. QAM encoding technique is used in standalone modem 
equipment which uses CMOS devices. Monolithic Memories' 
CMOS ZPAL devices became a natural choice for replacing 
glue logic or any complete function that is being implemented in 
discrete SSI components. 

QAM Encoder 

Data modems use encoders as a means of increasing data 
throughput within a given bandwidth of a channel. Quadrature 
Amplitude Modulation (QAM) gives an input data rate of four 
times the encoding rate, which is the output rate of the 
encoder. The encoding rate determines the channel bandwidth 
required by the output signal. Voice band data modems have 
become popular for use over the switched telephone network 
for interconnecting personal computers to distant computers. 
In these modems the data encoder function is usually 
integrated within the modem chip set. However, higher-speed 
modems for 20-MHz digital radio, satellite up/down links and 
other group band modems implement the encoder as a dis- 
crete hardware function due to the speed requirements. A 
PALC20R8Z device is ideally suited for implementing the data 
encoder function because of its ability to implement state 
machines efficiently and its low power consumption. 

The PAL Design Specification (QAM.PDS) file enclosed shows 
the QAM Encoder example implemented using the PALASM®2 
software state machine syntax. 



Modem Encoding Techniques: 

Figure 1 shows the encoder function within a typical modem 
transmitter. Incoming data may be scrambled (to randomize 
the data) and then passed through the encoder. The output 
streams (I, Q) are passed through pulse-shaping filters before 
being multiplied by the sine and cosine carrier frequencies. 

Let us consider the QAM encoding technique as used in the 
voice band data modems. The scheme is also widely used in 
other high-speed applications. Some of the popular modem 
standards for the switched telephone network are Bell 21 2A 
(for 1200-bps operation), CCITT\/.22 (for 1200-bps operation), 
CCITT V.22bis (for 2400-bps operation) and CCITT V.32 (for 
9600-bps operation). 

According to Bell 21 2A, CCITT V.22 and V.22bis recom- 
mendations, full duplex communication over a pair of tele- 
phone wires is achieved by separating the bandwidth occupied 
by the signals traveling in opposite directions, (see Figure 2). 
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Figure 2. Full Duplex Communication by Bandwidth Separation 

The transmit carrier frequency of one modem is 1200 Hz while 
the far-end modem uses 2400 Hz as its transmit carrier fre- 
quency. The encoding rate (or symbol rate) for each carrier is 
600 baud (equivalent to transitions per second) and the band- 
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Figure 1 . Data Encoder in a Typical Modem Transmitter 
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width occupied by the outgoing signal spectrum is about 800 
Hz, centered around the carrier. Since the two bands of trans- 
mission do not overlap, no interference takes place between 
the two channels of communications. 

The transmit data throughput in each direction is increased 
from the 600 baud encoding rate by means of encoding two or 
four incoming data bits at a time. This produces a multilevel 
signal which is then used to modulate the carriers. The 
transmit data bit rate is then 1200 bits per second (bps) and 
2400 bps respectively. 

CCITT V.22bis and V.32 recommendation modems employ 
Quadrature Amplitude Modulation (QAM) technique for 
modulating the carrier frequency. The 16-state QAM signal 
constellation can be observed at the I and Q outputs (Figure 1) 
of the encoder. The diagram of Figure 3 is observed by 
connecting the I and Q outputs to the A and B channels of a 
dual channel oscilloscope and turning the timebase to the X-Y 
setting. 

The data stream to be transmitted is divided into groups of four 
consecutive bits called quadbits. The first two bits are 
encoded as a phase quadrant change relative to the quadrant 
occupied by the preceding signal element. See Figure 3 and 
Table 1. The last two bits of each quadbit define the amplitude 
of the signal element in each phase quadrant. The left-hand 
bits in Table 1 are the first of each pair in the data stream as it 
enters the encoder after the data scrambler. 

Bell 21 2A and CCITT V.22 recommendation modems employ 
the Differential Quadrature Phase Shift Keying (DQPSK) 
technique for modulating the carrier. The data stream to be 
transmitted is divided into groups of two consecutive bits 
called dibits. The dibits are encoded as a phase quadrant 
change relative to the quadrant occupied by the preceding 
signal element. The signal element corresponding to XX01 (YO, 
Y1,X2, X3) in the V.22bis signal constellation (Figure 3) is 
transmitted irrespective of the quadrant occupied. This ensu- 
res compatibility with the V.22bis modems operating in a fall- 
back mode of 1200 bps. 
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Table 1. Relative Quadrant Change for Input (XO, X1) 
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Figure 4. PALC20R8Z Device QAIVl Encoder 
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Modems in geostationary satellite links use phase encoding 
schemes such as Binary Phase Shift Keying (BPSK) or Dif- 
ferential Phase Shift Keying (DPSK). QAtVI techniques which 
require both phase and amplitude modulation, are traditionally 
used for deep-space satellite links, and digital radio communi- 
cation applications. 

PALC20R8Z-based QAM Encoder 

Figure 4 shows a PALC20R8Z device implementing a high- 
speed QAM encoder. Four bits of transmit data stream are 
input to the device every symbol clock. For a V.22bis modem, 
the symbol clock rate is 600 baud and the bit rate is 2400 bps. 
However the encoder device can run at several megahertz for 
use in deep-space satellite communications. The design 
utilizes all of the available outputs on the PALC20R8Z. QO and 
10 signals are held in logic high state. The extra inputs on the 
device may be used to control the start-up phase of the 
encoder. 
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Karnaugh Map 
for Next /YO State 



Karnaugh Map 
for Next /Y1 Stale 



Figure 6. Karnaugh Maps for Next States of /YD, /Y1 




Figure 5. Phase Quadrant State Diagram 



It is assumed that a quadbit (XO, X1, X2, X3) of incoming 
transmit data is available every symbol clock. The first two 
bits, XO and X1 , are used to compute the phase quadrant of 
the next output symbol. The quadrants are assigned states 
(represented by YO and Y1).and the state transitions (or the 
differential quadrant change) is described using a state 
diagram shown in Figure 5 and Table 1. The state transition 
outputs are latched at the positive edge of the first symbol 
clock. Equations for YO and Y1 could be derived using 
Karnaugh maps (Figure 6) and the reduced equations 
implemented. Alternatively, PALASM 2 software has a state 
machine syntax which can be used to input the state diagram 
directly (see the design file). The Minimize software module of 
PALASM 2 software is then used to reduce the equations. 



The last two bits, X2 and X3, are latched at the same time as 
the new states are computed (see design file). The output data 
values, I and Q, are combinatorial functions of YO, Y1, X2, and 
X3. These outputs are latched at the positive edge of the next 
symbol clock. Thus the computational delay through the 
encoder is two symbol clocks. If the processing delay is 
critical to the design, it could be reduced to one symbol clock 
by using a PAL20RA10 device because this device allows 
independent clocking of registers. The phase quadrant state 
could then be latched on the positive edge of the symbol clock 
and the output could be available on the negative edge of the 
clock. 
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Figure 7. Photograph of QAM Pattern 
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The values +3, +1,-1, -3 of I and Q (Figure 3) are represented 
by the three-bit binary words, 111, 101, Oil, 001, (12 11 10 or 
Q2 Q1 QO), respectively. These three-bit values are output 
every symbol. This number scheme allows the use of simple 
digital-to-analog conversion as required in the satellite 
applications. For these high-speed applications the pulse- 
shaping filtering and carrier-modulation functions are 
performed in analog domain using operational amplifiers and 
active components. 

The PALC20R8Z device has only eight registered outputs, 
whereas this design needs ten registered outputs (two for the 
phase quadrant state, and eight for the I and Q outputs). 
Fortunately the 10 and QO are always "1" (as defined above) 
and hence can be tied to this state at the input to the next 
stage. In this case the eight registers in the PALC20R8Z 
device are sufficient to implement the full encoder function. 

The design file shows an example of using the state machine 
syntax of PALASM 2 software. The QAM encoder is a standard 



Mealy type of state machine. However, to keep the number of 
product terms per output reasonable (less than eight for the 
PALC20R8Z device), it is implemented as a Moore state 
machine followed by the output decode function (see 
appendix). There is a pipeline delay between the new state 
values and their corresponding outputs. The differential 
quadrant change is implemented in the State section of the 
QAM.PDS file whereas output decoding is done in the 
Equations section of the same file. 

In the laboratory verification, the I (10 to 12) and Q (QO to Q2) 
outputs were connected to two digital-to-analog converters. 
The analog outputs were then connected to the A and B 
channels of an oscilloscope and the X-Y pattern was observed 
(see Figure 7). For high data-rate operation (several MHz), the 
PALC20R8Z device encoder together with a data scrambler 
implemented in hardware facilitates a cost-effective solution 
for a modem transmitter. 
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TITLE QAM_ENCODER 

PATTERN QAM.PDS 

REVISION A 

AUTHOR RAJ PARIPATYADAR , M. GZOWSKI 

COMPANY MONOLITHIC MEMORIES INC., SANTA CLARA 

DATE 5 April 1987 

CHIP QAM PAL20R8 

CK X3 X2 XI XO NC NC NC NC NC NC GND 

/OE NC X3A X2A Yl YO 12 II Q2 Ql NC VCC 



STATE 



MOORE MACHINE 



STATE_0 
STATE_1 
STATE_2 
STATE 3 



= CSl ->STATE_1 + CS2 ->STATE_0 + CS3 ->STATE_3 +->STATE_2 

= CSl ->STATE_2 + CS2 ->STATE_1 + CS3 ->STATE_0 +->STATE_3 

= CSl ->STATE_3 + CS2 ->STATE_2 + CS3 ->STATE_1 +->STATE_0 

= CSl ->STATE + CS2 ->STATE 3 + CS3 ->STATE 2 +->STATE 1 



Equations for the next states represented by YO Yl 
These outputs are latched on the positive edge of the 
first symbol clock 

STATE_0 = /YO * /Yl 

STATE_1 = /YO * Yl 

STATE_2 = YO * Yl 

STATE_3 = YO * /Yl 



CONDITIONS 








CSl = 


/XO 


* 


/Xl 


CS2 = 


/XO 


* 


XI 


CS3 = 


XO 


■k 


XI 


CS4 = 


XO 


* 


/Xl 


EQUATIONS 









; latch values of X2 and X3 on the same symbol clock 
/X3A := /X3 
/X2A := /X2 



The following four outputs are latched on the positive edge 
of the next symbol clock. Therefore the ouput of the whole 
encoder is available after the second symbol clock. 
The value of 10 and QO are HIGH and therefore input to the next 
stage may be tied HIGH. 



/I2 := STATE_0 + STATE_1 
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/II := STATE_0* X3A 
+ STATE_1* X2A 
+ STATE_2*/X3A 
+ STATE_3*/X2A 

/Q2 := STATE + STATE 3 



/Ql := STATE_0* X2A 
+ STATE_1*/X3A 
+ STATE_2*/X2A 
+ STATE 3* X3A 



SIMULATION 
TRACE ON 



OE XO XI X2 
II 12 Ql Q2 



X3 



X2a X3a YO Yl CK 



SETF CK OE ; Preset polarity and enable output 

PRLDF X3A X2A /Yl /YO 12 II Q2 Ql 

/check the state transitions from 
SETF /XO /Xl /X2 /X3 /CK 

; state Y0=0 Y1=0 with input 
CLOCKF ;X0=0 and X1=0 

CLOCKF ;X2=0 and X3=0 sets the smallest 

CLOCKF ; vector amplitude 

CLOCKF /output available on the negative 

;edge of clock 



PRLDF X3A X2A /Yl /YO 12 II Q2 Ql 

/check the state transitions from 
SETF XO XI /state Y0=0 Y1=0 with inputs 

CLOCKF /X0=1 and Xl=l 

CLOCKF 
CLOCKF 
CLOCKF 

PRLDF X3A X2A /Yl /YO 12 II Q2 Ql 

/check the state transitions from 
SETF XO /Xl /state Y0=0 Y1=0 with inputs 

CLOCKF /X0=1 and X1=0 

CLOCKF 

PRLDF X3A X2A Yl /YO 12 II Q2 Ql 

/check the state transitions from 
CLOCKF /state Y0=0 Yl=l with inputs 

CLOCKF /X0=1 and X1=0 

PRLDF X3A X2A /Yl /YO 12 II Q2 Ql 

/check the state transition from 
SETF /XO XI /state Y0=0 Y1=0 with inputs 

CLOCKF /X0=0 and Xl=l 
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PRLDF X3A X2A Yl /YO 12 II Q2 Ql 

; check the state transitions from 
CLOCKF ; state Y0=0 Yl=l with inputs 

;X0=0 and Xl=l 

PRLDF X3A X2A /Yl YO 12 II Q2 Ql 

; check the state transitions from 
CLOCKF ; state Y0=1 Y1=0 with inputs 

;X0=0 and Xl=l 

PRLDF X3A X2A Yl YO 12 II Q2 Ql 

; check the state transitions from 
CLOCKF ; state Y0=1 Yl=l with inputs 

;X0=0 and Xl=l 



PRLDF X3A X2A /Yl /YO 12 II Q2 Ql 

In state Y0=0 Y1=0 

check the 4 vector amplitudes 



SETF 

CLOCKF 

CLOCKF 


/XO XI /X2 / 


SETF 

CLOCKF 

CLOCKF 


/X2 X3 


SETF 

CLOCKF 

CLOCKF 


X2 /X3 


SETF 

CLOCKF 

CLOCKF 


X2 X3 


SETF 

CLOCKF 

CLOCKF 


/X2 /X3 


TRACE OFF 
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The continuing drive for higher communication data rates is 
resulting in the development of data modems complying with 
CCITT specification V.32. This standard uses the Quadrature 
Amplitude Modulation (QAM) encoding scheme to achieve a 
data rate of 9600 bps while restricting the modulation rate to 
2400 Hz. However, this high rate of transmission is sensitive to 
random errors caused by Gaussian noise on the telephone 
lines. Forward error correction by means of convolution 
encoding and Viterbi decoding improves the bit-error rate 
performance in the presence of white Gaussian noise. The 
CCITT V.32 (see Reference 1) specification provides for the 
optional use of a convolution encoding scheme whenever the 
background noise is high. A PAL20X8A and a PAL20RS8 can 
implement this complex encoder function in a cost-effective 
manner, thereby providing a simple way of implementing the 
transmitter of a V.32 modem. This design, using PAL devices, 
provides a way of implementing a separate modem transmitter 
synchronized only to the transmit clock. The receiver of the 
modem can then be implemented on a separate digital signal 
processing chip synchronized to the receive clock. 

Bmplementation of the 
V.32 Convolution Encoder 

Figure 1 shows the position of the encoder function within the 
simplified V.32 modem transmitter. Incoming data is scrambled 
to randomize the data and then passed through the encoder. 
The In-phase and Qut-of-phase (I, Q) output data streams are 
passed through pulse-shaping filters before being multiplied with 
the sine and cosine carrier frequencies. If I and Q values are 
applied to the X and the Y axes of an oscilloscope with the time 



base turned to the X-Y setting, a signal constellation is 
observed. This signal pattern (photograph in Figure 8) indicates 
the modulation amplitude and the phase of the outgoing 2400 Hz 
carrier frequency. The standard signal constellation for the V.32 
is the QAM scheme and is shown in Figure 2. This scheme 
produces sixteen states or points on the signal constellation 
map. The diagram may be observed by connecting the I and Q 
outputs of the encoder (Figure 1) to the A,B channels of an 
oscilloscope and turning the time base to the X-Y setting. The 
optional convolutional encoding scheme specified in the V.32 
standard produces thirty-two points in the signal constellation 
(Figure 3). 

The points in Figure 3 are closer to each other than the points in 
the straight QAM scheme depicted in Figure 2, and it may not be 
obvious why the second scheme should give better performance 
in the presence of noise. The convolution encoder, with its 
implicit memory, prevents adjacent (neighbor) points appearing 
in succession. The resulting distance between two successive 
points after convolution encoding is therefore bigger than the 
distance between any two points in Figure 2. This translates to a 
more accurate decision of the received state and hence better 
performance under high Gaussian noise. 

We now consider the implementation of this modem encoder 
with convolution encoding. Figure 4 shows the block diagram of 
the full V.32 encoder with convolution encoding. Figure 5 shows 
the implementation of this circuit in two PAL devices, a 
PAL20X8A and a PAL20RS8. The quadrant-change state 
machine and the convolution encoder are implemented in the 
PAL20X8A and the coordinate mapping of the 32-point signal 
constellation is achieved through the PAL20RS8 device. 



TRANSMIT 
DATA - 
STREAM 



DATA 
SCRAMBLER 



CONNECT TO A, B CHANNELS OF 

OSCILLOSCOPE FOR 

CONSTELLATION PATTERN 



IN - PHASE 
DATA I 



V.32 

CONVOLUTION 

ENCODER 



QUADRATURE 
PHASE DATA (0) 



PULSE 

SHAPING 

FILTER 



PULSE 
SHAPING 
FILTER 
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! 



COSINE (Wet) 



Figure 1 . V.32 Convolution Encoder in a Typical Modem Transmitter 
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QUADRATURE PHASE (Q) 
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Figure 2. QAM Signal Pattern (Y1 ,Y2,X3,X4) 
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Figure 3. 32-Point Signal Pattern (Y0,Y1,Y2,X3,X4) 
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Figure 4. Full V.32 Convolution Encoder (9600 bps) 
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The Encoders in the 

PAL20X8A Device 

The scrambled data stream to be transmitted is divided into 
groups of four consecutive data bits. As shown in Figure 4, the 
first two bits arrive in time X1, X2 in each group are first 
differentially encoded into Y1 and Y2, according to Table 1 . 

These state transitions are mapped on Karnaugh maps (Figure 
6), and the reduced equations are derived and implemented in 
the PAL20X8A design. The equation for Y1 turns out to be a 
single Exclusive-OR (XOR) function of the previous state of Y1 
and XO. The Karnaugh map for Y2 (Figure 6b) appears to 
indicate a fairly complex function. However, after a number of 
steps of juggling with the Boolean equations, the final equation 
can be expressed as an XOR function of two product terms. 
The PAL20X8 device offers an efficient implementation of this 
equation. 

The two differentially encoded bits, Y1 and Y2, are then used 
as input to a systematic convolution encoder, which generates 
a redundant bit YO. The convolution encoder circuit of Figure 4 
contains five XOR functions and this requires the use of a PAL 
device containing XOR elements. The XORs feed into each 
other. Combinatorial values DO and D3 are produced (see 
equations for pin 14 and 23 of the PAL20X8A). Additionally, 
registered values D1 and D2 are generated as intermediate 
delay values. Note that some of the XOR elements had to be 
multiplied out, such that the equations for D2 and YO have only 
1 XOR term in the equation. However, product terms feed the 
two inputs of the XOR gate. This arrangement uses the full 
functionality of the PAL20X8A device. 



INPUTS 


PREVIOUS 
OUTPUTS 


OUTPUTS 


XO X1 


Y1n-1 Y2n.i 


Yin Y2n 














1 


1 





1 


1 





1 1 


1 1 


1 





1 


1 


1 





1 


1 


1 1 
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1 1 


1 


1 
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1 


1 


1 1 


1 


1 


1 


1 


1 1 





1 1 





1 1 


1 1 


1 


1 


1 1 


1 





1 1 


1 1 


1 



Table 1. Differential Encoding for V.32 Signal Pattern at 9600 bps 
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Figure 6a. Karnaugh Map for Y1(n) 



Figure 6b. Karnaugh Map for Y2(n) 
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Signal Mapping in PAL20RS8 Device 

The extra bit YD and the four information carrying bits Y1 , Y2, 
<2, and X3 are then mapped into the coordinates of the signal 
3lement to be transmitted according to the signal space 
diagram shown in Figure 3 and listed in Table 2. The coordinate 
/alues for I and Q are each represented as a 4-bit binary 
number. The values (13,12,11,10) 1100, 1011, 1010, 1001, 
1 000, 0111, 0110, 0101, 01 00 represent the magnitudes +4, 
f3, +2, +1, 0, -1, -2, -3, -4 respectively. The values for the Q 
bits have a similar representation. Karnaugh maps (Figure 7) 
are produced for each of the outputs 13 to 10 and Q3 to QO, and 
the reduced equations are implemented in the PAL device 
design. These eight output equations are dependent on only 
five outputs, YO, Y1, Y2, X2, and X3, but one of the outputs 
(12) requires nine product terms. A standard PAL device (e.g., 
PAL20R8) has only eight product terms. A PAL device (such 
as PAL20RS8) which has sixteen product terms shared 
between pairs of outputs would be fine. The PAL20RS8 is 
ideally suited to this application because it allows the nine 
product terms required for calculating 12 to be allocated to this 
output (on pin 17) and the other seven product terms to be 
allocated to the neighboring signal 13 (on pin 18). 



Results 

The output of the first PAL20X8A is available after the first half of 
the symbol clock and the final output is available from the 
second PAL20RS8 after the second half of the symbol clock. 
Thus a delay of only one symbol clock is introduced by the full 
encoder. The two PAL device designs have been verified and 
the 32-state constellation of Figure 3 was observed on an 
oscilloscope. The encoding rate was 2400 baud, but the PAL 
device design will work at 10 MHz for applications that may 
require this speed. 



Summary 

The full V.32 encoder with convolution encoding has been 
implemented in two PAL devices as shown in Figure 5. Both the 
differential quadrant encoder state machine and the 
convolution encoder fit into the PAL20X8A and the signal co- 
ordinate mapping is achieved through the PAL20RS8. This PAL 
device design for the encoder provides a superior way of 
implementing a V.32 modem transmitter synchronized only to 
the transmit clock. 
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Table 2. Signal State Mapping for V.32, 9600 bps 
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Karnaugh Map for 11 
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Karnaugh Map for 13 
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Karnaugh Map for Q1 
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Figure 7. Karnaugh maps for outputs 10-13 and Q0-Q3 
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V.32 trellis encoder PAL#1, 20X8 

This PAL device performs the quadrant change based on the first 

two bits and the the convolution encoding to produce the output yO, 



TITLE V32_1.PDS 
PATTERN V32_ENCODER 
REVISION A 

AUTHOR RAJ PARI PAT YADAR 
COMPANY MMI, SANTA CLARA 
DATE APRIL 3 1987 

CHIP TRELl PAL20X8 

;PINS 1 234 5 6 7 8 9 10 11 12 

CK X3 X2 XI XO NC NC NC NC NC NC GND 

;PINS 13 14 15 16 17 18 19 20 21 22 23 24 

/OE /DO /Dl /D2 NC /YOn /Yin /Y2n X2n X3n /D3 VCC 

EQUATIONS 

; These equations implement the differential quadrant encoder 
;on the bits XO and XI. The outputs are Yin and Y2n. 

Yin := Yin :+: XO 

Y2n := (XO *Yln) :+: (Xl*/Y2n + /Xl*Y2n) 

; These equations produce the extra output YOn. Intermediate values 
;D0,D1,D2 AND D3 need to be calculated first. Dl and D2 are 
; registered outputs. 

Dl := YOn 

D2 := (D0*/D1 + /D0*D1) 

:+: (Yln*/Y2n + /Yln*Y2n) 

YOn := Yin * YOn :+: D3 

DO = Y0n*D2*/Y2n + Y0n*/D2*Y2n 

D3 = D2*/Y2n + /D2*Y2n 



;The following equations are just registered storage of the bits 
;X2 and X3 

/X2n := /X2 ; internal node equation 

/X3n := /X3 ; internal node equation 
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SIMULATION 



TRACE ON XO XI Y2n Yin YOn CK DO Dl D2 D3 X2n X3n 



SETF 
PRLDF 

SETF 



CLOCKF 
CLOCKF 
CLOCKF 
CLOCKF 



OE ; Initialize Output-Enable 

Dl D2 YOn /Yin /Y2n X2n X3n 



XO /Xl /X2 /X3 



; state Yln=0 Y2n=0 with input 

; PRELOAD VALUES ARE OPPOSITE 

;T0 THOSE INTENDED, TO GET AROUND 

;A BUG IN PALASM2 V2 . 2 

; check the state transitions from 

;X0=1 and X1=0 

;X2=0 and X3=0 sets the smallest 

; vector amplitude 

; output available on the negative 

;edge of clock 



SETF 
PRLDF 

CLOCKF 
CLOCKF 
CLOCKF 
CLOCKF 



XO XI 

Dl D2 YOn /Yin /Y2n X2n X3n 

; Check the state transitions from 
; state Yln=0 Y2n=0 with inputs 
;X0=1 and Xl=l 



SETF /XO XI 

PRLDF Dl D2 YOn /Yin /Y2n X2n X3n 

; check the state transitions from 
CLOCKF ; state Yln=0 Y2n=0 with inputs 

CLOCKF ;X0=0 and Xl=l 



PRLDF Dl D2 YOn Yin /Y2n X2n X3n 

; check the state transitions from 
CLOCKF ; state Yln=l Y2n=0 with inputs 

CLOCKF ;X0=0 and Xl=l 



SETF /XO /Xl 

PRLDF Dl D2 YOn /Yin /Y2n X2n X3n 

; check the state transition from 
CLOCKF ; state Yln=0 Y2n=0 with inputs 

;X0=0 and X1=0 

PRLDF Dl D2 YOn /Yin Y2n X2n X3n 

; check the state transitions from 
CLOCKF ; state Yln=0 Y2n=l with inputs 

;X0=0 and X1=0 
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PRLDF Dl D2 YOn Yin /Y2n X2n X3n 

; check the state transitions from 
CLOCKF ; state Yln=l Y2n=0 with inputs 

;X0=0 and X1=0 

PRLDF Dl D2 YOn Yin Y2n X2n X3n 

/•check the state transitions from 
CLOCKF ; state Yln=l Y2n=l with inputs 

;X0=0 and X1=0 

TRACE OFF 
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;V.32 trellis encoder PAL#2, 20RS8. 

;This PAL performs the signal mapping onto the 32 

; state constellation according to CCITT V.32, 9600 bps 

/•specification. 



TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



V32_ENCODER 

V32_2.PDS 

A 

RAJ PARI PAT YADAR 

MMI, SANTA CLARA 

APRIL 1 1987 



CHIP TREL2 PAL20RS8 



;PINS 12 3 4 5 6 

CLK X3a X2a /Y2 /Yl /YO 

;PINS 13 14 15 16 17 18 

/OE INVSCK 10 II 12 13 



7 


8 


9 


10 


11 


12 


NC 


NC 


NC 


NC 


SCKIN 


GND 


19 


20 


21 


22 


23 


24 


QO 


Ql 


Q2 


Q3 


SCK 


VCC 



EQUATIONS 




13 


; = 


/YlVY2*X3a 




+ 


/Y0V'!^l*X2a*/X3a 




+ 


/Y0*Y2VX2a 




+ 


Yl*/Y2*X2a 




+ 


Y0*Yl*/Y2VX3a 




+ 


Y0*Yl*Y2*/X2a*X3a 




+ 


yO*/Yl*Y2*/X3a 


12 


: = 


/Y0*/Yl*/X2a*/X3a 




+ 


/Y0*/Yl*X2a*X3a 




+ 


/Y0*Yl*Y2*X2a 




+ 


/Y0*Y1*/Y2*/X2a 




+ 


Yl*/Y2*/X2a*X3a 




+ 


Y0*Yl*Y2VX3a 




+ 


Y0*Y2*X2a*X3a 




+ 


Y0*/Yl*Y2*X3a 




+ 


Y0*/Yl*/Y2*/X3a 


11 


• :s 


/Y0*Y1 




+ 


Y0*Y2 



;IN-PHASE OUTPUT 
;(I0— 13) 



10 



:= YO 



Q3 := /Y0*/Yl*/Y2*/X3a 

+ /Yl*/Y2*X2a 

+ /Y0*Y2*/X2a*X3a 

+ /Y0*Yl*Y2*X3a 

+ Yl*/Y2*/X3a 

+ Y0*Yl*/X2a*X3a 

+ Y0*Yl*Y2*X2a 



; QUADRATURE OUTPUT 
;(Q0-Q3) 
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+ Y0*/Y1*Y2*/X2a 

Q2 := /Yl*/Y2*/X2a*X3a 

+ /Y0*Y2*/X2a*/X3a 

+ /Y0*Y2*X2a*/X3a 

+ /Yl*Y2*X2a 

+ /Y0*Yl*/'^2*X3a 

+ Y0*Y1*/X2a*/X3a 

+ Y0*Yl*X2a*X3a 

+ Y0*/"^l*/'^2*/X2a 



Ql := /Y0*/"^1*Y2 

+ /Y0*Y1VY2 

+ YOV'i^l 

QO := /YO 



INVSCK = /SCKIN 

SCK = SCKIN 

SIMULATION 

TRACE ON YO Yl Y2 X2a X3a 



/•GENERATE /SYMBOL CLOCK 
/GENERATE SYMBOL CLOCK 





10 


11 


12 


13 QO Ql 


SETF OE /CLK 








SETF 


/YO 
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Applications of 4B3T Line Coding 

Line coding devices are required in line terminals and line re- 
peaters to convert binary data into a proper coded format on the 
transmission lines and vice versa (see Figure 1 ). 

4B3T line coding is used for the Integrated Services Digital Net- 
work (ISDN) in Germany. Also, 4B3T is used on T-carrier trans- 
mission lines for two 24-channel DS-1 signals (forty-eight chan- 
nels). Because 4B3T coding decreases transmission 
requirements by twenty-five percent, transmission rates for two 
24-channel DS-1 signals are only fifty percent more than one 24- 
channel DS-1 signals. The 4B3T is used on the T148 transmis- 
sion line developed by ITT Telecommunication. 

Bipolar Codes 

Bipolar coding is a popular coding scheme over long transmis- 
sion lines. Bipolar codes use three different voltages (+5 V, V, 
and -5 V) to represent logic one (1) and logic zero (0) (see Fig- 
ure 2a). Logic-one signals alternate between +5 V and -5 V to al- 
low the mean DC level to be integrated to zero volts. Therefore, 
the bipolar code can compensate for DC wander on the trans- 
mission lines. However, the bipolar code does not take advan- 
tage of using three logic levels. Thus, binary bipolar code is 
called a pseudo-ternary code. 

Ternary Codes 

Ternary code takes advantage of all three logic levels. Each bit 
of the ternary code represents a positive logic one (+), a logic 



zero (0), or a negative logic one (-) (see Figure 2b). These three 
logic levels are mapped to the three voltage levels. 

On the transmission lines, the power of three (3*^) codes are car- 
ried using the ternary code and the power of two (2") codes are 
carried using the binary code. For example, two bits of binary (0, 
1 ) codes only provide four combinations of data (00, 01,10, and 
11). But two bits of ternary (-, 0, +) codes provide nine combina- 
tions of data (00, 0-, -0, ~, -+, 0+, +0, +-, and ++). The ternary 
code is more efficient than the binary code because more infor- 
mation can be represented on the transmission lines. Three bi- 
polar bits are capable of representing eight codes. For ternary 
code, only two bits are required to represent eight codes as 
shown in table 1. 

Comparison of Binary Codes and 
Ternary Codes 

4B3T is a ternary code which maps four continuous binary bits 
into three ternary bits. To transmit twenty binary data bits, twenty 
bipolar bits are needed. But with ternary code only fifteen bits 
are needed. A twenty-five percent saving on bits is made using 
4B3T line coding on the transmission lines. Without increasing 
the transmission rate, ternary coding carries more information 
than bipolar coding. Four binary bits on the computer system will 
run at the same rate as three ternary bits on the transmission 
lines using the 4B3T line coding. For example, a group of four bi- 
nary bits in a 10-MHz computer system. Substituted to a group 
of three ternary bits at 7.5 I^Hz on a transmission line. 
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Figure 1 . Line Coding Devices in a Line Tenninal and a Line Repeater 
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Table 1. Eight Codes Represented by Three Bipolar Bits or 
Two Ternary Bits 

4B3T Line Coding 

Four binary bits represent sixteen combinations of data, but 
three ternary bits represent twenty-seven combinations of data. 
Except for all zeros code, thie remaining combinations of ternary 
codes can be categorized into thiree groups: zero, positive, and 
negative accumulated disparity. A zero-accumulated disparity is 
a set of ternary codes tfiat fiave the same number of positive 
codes and negative codes, e.g., the accumulated disparity of O-i- 
is zero (0). A positive accumulated disparity is a set of ternary 
codes that have more positive codes than negative codes, e.g., 
the accumulated disparity of the +-i- is positive (+1 ). A negative 
accumulated disparity is a set of ternary codes that have more 
negative codes than positive codes, e.g., the accumulated dis- 
parity of the 0" is negative (-2). The 4B3T line coding with accu- 
mulated disparity is shown in Table 2. Six sets of four binary bits 
have zero-accumulated disparity. The remaining ten sets of four 
binary bits map into words with accumulated disparity. 

Four binary bits can be converted into any one of four possible 
ternary codes. This is normally represented by four columns, as 
shown in Table 3. Figure 3 is the state diagram of the next col- 
umn generation. Four possible columns are generated for next 
ternary codes. The column number of the next word depends on 
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the previous column and the current accumulated disparity. If 
the current disparity is zero, the number of the next column is the 
same as the previous one. If the current disparity is positive, the 
column number for the next word is found by adding the current 
disparity. Likewise, if the current disparity is negative, the num- 
ber for the the next column is found by subtracting the current 
disparity. In column 1, the next word may only have positive or 
zero disparity since the maximum of -3 negative disparity has 
been reached. Similarly, in column 4, the next word will have 
negative or zero disparity. 
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Table 2. 4B3T Line Coding with Accumulated Disparity 

The 4B3T line coding is represented in the four different col- 
umns of Table 3 and shows that a 4-bit binary word can be con- 
verted to any of four different sets of a 3-bit ternary word. Each 
set of a 3-bit ternary word includes the substituted 3-bit ternary 
code and the column number for the next ternary word. 



Figure 3. 4B3T Next Column Generation State Diagram 
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Denotes next new column. 



Table 3. 4B3T Line Coding 



4B3T Line Encoder Implementation 

A 4B3T encoder converts binary data into two signals: a positive 
ternary signal and a negative ternary signal. These two signals 
generate a three-voltage level signal for the transmission line. 
The 4B3T encoder divides the incoming binary data stream into 
4-bit blocks. This 4-bit word is converted to a 3-bit ternary word. 
The 4-bit binary word and the number of the column generate a 
ternary word. 

Figure 4 is the block diagram of the 4B3T encoder. It consists of 
a 4-bit serial-in parallel-out shift register, a 2-bit counter, a 6-bit 
input and an 8-bit output look-up table converter, and two 3-bit 
parallel-in serial-out shift registers. Two Programmable Array 
Logic (PAL) devices and one registered PROM device imple- 
ment the 4B3T encoder (see Figure 5). The first PALC16R6Z 
performs a 4-bit serial-in parallel-out shift register and a 2-bit 
counter. A 4-bit shift register (SR3, SR2, SRI, and SRO) con- 
verts the serial data into a 4-bit parallel word. A 2-bit counter pro- 
vides the code conversion's clock rate which is one-quarter the 
speed of the serial-in parallel-out shift register's clock rate. The 
63RS481A implements the 6-input and 8-output look-up table 
converter. The 4B3T encoding conversion table is shown in Fig- 
ure 6. The 4-bit binary word (B3, B2, B1 , and BO) and 2-bit cur- 
rent column numbers (CI , CO) generate 8-bit output (PT2, PT1 , 
PTO, NT2, NT1, NTO, NCI, and NGO). NCI and NCO are the 
next column numbers. They are fed into the 63RS481A as cur- 
rent column numbers for the next operation. The remaining six 
outputs are divided into two groups. One group is the positive 
ternary signals (PT2, PT1, and PTO). The second group is the 



negative ternary signals (NT2, NT1, and NTO). The second 
PALC16R6Z provides two 3-bit parallel-in serial-out shift regis- 
ters at three-quarters external clock rate. PT2, PT1 , and PTO are 
serialized to generate the positive ternary signal (P4B3T). Like- 
wise, NT2, NT1 , and NTO are serialized to produce the negative 
ternary signal (N4B3T). 

4B3T Line Decoder implementation 

A 4B3T decoder detects three continuous pairs of positive ter- 
nary signals (P4B3T) and negative ternary signals (N4B3T) and 
converts them to 4-bit binary words. Figure 7 is the block dia- 
gram of the 4B3T decoder. It includes a 4B3T decoding state 
machine and an NRZdata generation. 

A 5-bit state machine generates four inputs for the parallel-in se- 
rial-out shift register and a load signal. A PALC22V10 is used for 
the 4B3T decoding state machine because it has enough prod- 
uct terms and flexible outputs. The PALC22V10's output has 
programmable register bypass and programmable output polari- 
ty features. 04 generates shift/load signal for the 4-bit parallel-in 
serial-out shift register (HCT 195). This shift/load signal will be 
assertive low only at every three clock cycles. The remaining 
four register outputs (03, 02, 01 , and OO) are parallel inputs for 
the 4-bit parallel-in serial-out shift register (see Figure 8). This 
shift register clocks at four-thirds external clock rate. The decod- 
ing state machine diagram is shown in Figure 9. An error flag is 
generated when positive ternary signals and negative ternary 
signals both are high. The error flag also detects three continu- 
ous pairs of zero ternary signals. 
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Figure 4. Block Diagram of 4B3T Line Encoding 
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ERROR 
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Figure 9. 4B3T Decoding State Machine 
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Title 4B3T ENCODER LOGICl 

Pattern EN4B3Tl.pds 

Revision A 

Author Cindy Lee 

Company Monolithic Memories 

Date 4/2/87 

CHIP EN4B3T1 PAL16R6 

;PIN1 PIN2 PINS PIN4 PINS PIN6 PIN7 PINS PIN9 PINIO 
CLK MR NRZIN NC NC NC NC NC NC GND 

;PIN11 PIN12 PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 PIN19 PIN20 
/OE CK4_LD CQO CQl BO Bl B2 B3 NC VCC 

INPUT SIGNALS 

CLK : EXTENAL CLOCK 

MR : MASTER RESET 

NRZIN : NON-RETURN-ZERO INPUT SIGNAL 

/OE : ASSERTIVE LOW OUTPUT ENABLE 

OUTPUT SIGNALS 

CK4_LD : 1/4* EXTERNAL CLOCK AND LOAD SIGNAL 
CQO, CQl : 2-BIT COUNTER STATE VARIABLES 
BO, Bl, B2, B3 : FOUR PARALLEL BINARY DATA OUTPUTS 



EQUATIONS 

;THE 16R6 GENERATES: 

;(1) A 4-BIT SERIAL-IN PARALLEL-OUT SHIFT REGISTER 

/BO := /Bl ; LSB OF 4-BIT PARALLEL OUTPUTS 

+ MR ; MASTER RESET 

/Bl := /B2 

+ MR 

/B2 := /B3 

+ MR 

/B3 := /NRZIN ; MSB OF 4-BIT PARALLEL OUTPUTS 
+ MR 

;(2) A 1/4 CLOCK AND LOAD/SHIFT SIGNAL 

/CQO := /CQl ; LSB OF 2 -BIT COUNTER 

+ MR ; MASTER RESET 

/CQl := CQO ; MSB OF 2 -BIT COUNTER 
+ MR 

/CK4_LD = CQO ; 1/4 * EXTERNAL CLOCK 

+ CQl ; AND A LOAD/SHIFT SIGNAL 
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SIMULATION 

TRACE_ON MR CLK NRZIN 

SETF MR OE 
CLOCKF CLK 



BO Bl B2 B3 CQO CQl CK4_LD 
; RESET CONDITION 



SETF NRZIN 
CLOCKF CLK 



SETF /MR NRZIN 
CLOCKF CLK 

SETF /NRZIN 
CLOCKF CLK 



; DISABLE THE MASTER RESET 
; SHIFT A POSITIVE NRZ DATA 

; SHIFT TWO NEGATIVE NRZ DATA 



SETF /NRZIN 
CLOCKF CLK 

SETF NRZIN 
CLOCKF CLK 



; SHIFT TWO POSITIVE NRZ DATA 



SETF NRZIN 
CLOCKF CLK 

SETF /NRZIN 
CLOCKF CLK 

SETF NRZIN 
CLOCKF CLK 



; SHIFT A NEGATIVE NRZ DATA 



; SHIFT A POSITIVE NRZ DATA 



SETF /NRZIN 
CLOCKF CLK 

SETF /NRZIN 
CLOCKF CLK 



; SHIFT THREE NEGATIVE NRZ DATA 



SETF /NRZIN 
CLOCKF CLK 

SETF NRZIN 
CLOCKF CLK 

SETF /NRZIN 
CLOCKF CLK 

SETF NRZIN 
CLOCKF CLK 



; SHIFT A POSITIVE NRZ DATA 



; SHIFT A NEGATIVE NRZ DATA 



; SHIFT THREE POSITIVE NRZ DATA 



SETF NRZIN 
CLOCKF CLK 

SETF NRZIN 
CLOCKF CLK 

TRACE OFF 
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PLE9R8 ; PROM 63RS481A 
TABLE . PLE 

6 -BIT TO 8 -BIT TABLE CONVERTER 
MMI, SANTA CLARA, CA 



PLE DESIGN SPECIFICATION 
CINDY LEE 3/15/87 



PIN 8 7 6 5 4 3 

AO Al A2 A3 A4 A5 

ADD CO CI B3 B2 Bl BO 



PIN 9 10 11 13 14 15 16 17 
Ql Q2 Q3 Q4 Q5 Q6 Q7 Q8 
DAT PTO PTl PT2 NTO NTl NT2 NCI NCO 
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FUNCTION TABLE 

BO Bl B2 B3 CI CO PTO PTl PT2 NTO NTl NT2 NCI NCO 

; BINARY INPUT, COLUMN, OUTPUT NEW COLUMN 

;B0 Bl B2 B3 CI CO, PTO PTl PT2 NTO NTl NT2 NCI NCO 
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DESCRIPTION 



THIS REGISTERED PROM (63RS481A) IMPLEMENTS A 6-TO-8 TABLE 
CONVERTER. SIX INPUTS ARE BO, Bl, B2, B3 (BINARY INPUTS), 
AND CI, CO (CURRENT COLUMN NUMBER). BECAUSE THESE FOUR 
PARALLEL BINARY INPUTS ARE FROM THE 4-BIT SERIAL-IN 
PARALLEL-OUT SHIFT REGISTER, THIS PROM (63RS481A) MUST 
OPERATE AT A QUARTER OF THE SHIFT REGISTER'S RATE. EIGHT 
OUTPUTS ARE PTO, PTl, PT2, NTO, NTl, NT2 , NCI, AND NCO. THE 
PTO, PTl, AND PT2 ARE POSITIVE TERNARY SIGNALS WHICH FORMS 
THREE SERIAL P4B3T SIGNAL THROUGH A 3-BIT PARALLEL-IN 
SERIAL-OUT SHIFT REGISTER. LIKEWISE, NTO, NTl, AND NT2 ARE 
NEGATIVE TERNARY SIGNALS. NCI AND NCO ARE NEXT COLUMN 
NUMBERS . 
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Title 4B3T ENCODER L0GIC2 

Pattern EN4B3T2.pds 

Revision A 

Author Cindy Lee 

Company Monolithic Memories 

Date 3/5/87 

CHIP EN4B3T2 PAL16R6 

;PIN1 PIN2 PIN3 PIN4 PINS PIN6 PIN7 PINS PIN9 PINir 
CLK MR PTO PTl PT2 NTO NTl NT2 LOAD GND 

;PIN11 PIN12 PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 PIN19 PIN20 
/OE NC NS2 NSl N4B3T PS2 PSl P4B3T NC VCC 

INPUT SIGNALS 

CLK : EXTERNAL CLOCK 

MR : MASTER RESET 

PTO, PTl, PT2 : THREE POSITIVE TERNARY SIGNALS 

NTO, NTl, NT2 : THREE NEGATIVE TERNARY SIGNALS 

LOAD : LOAD SIGNAL FOR TWO SHIFT REGISTERS 

/OE : ASSERTIVE LOW OUTPUT ENABLE 



OUTPUT SIGNALS 



NS2, NSl 
N4B3T 

PS2, PSl 
P4B3T 



NEGATIVE TERNARY SHIFT REGISTER VARIABLES 
OUTPUT SIGNAL OF NEGATIVE TERNARY SHIFT REGISTER 
NPOSITIVE TERNARY SHIFT REGISTER VARIABLES 
OUTPUT SIGNAL OF POSITIVE TERNARY SHIFT REGISTER 



EQUATIONS 



THE 16R6 GENERATES: 

(1) A 3 -BIT PARALLEL-IN SERIAL-OUT SHIFT REGISTER 
FOR POSITIVE TERNARY SIGNAL 



/P4B3T := /PTO * /PSl 
+ /LOAD * /PSl 
+ /PTO * LOAD 
+ MR 



; SHIFT REGISTER OUTPUT 



/PSl := /PTl * /PS2 
+ /LOAD * /PS2 
+ /PTl * LOAD 
+ MR 



/PS2 



;= /PT2 
+ /LOAD 
+ MR 



;(2) A 3-BIT PARALLEL-IN SERIAL-OUT SHIFT REGISTER 
FOR NEGATIVE TERNARY SIGNAL 



/N4B3T := /NTO * /NSl 
+ /LOAD * /NSl 



SHIFT REGISTER OUTPUT 
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+ /NTO * LOAD 
+ MR 

/NSl := /NTl * /NS2 
+ /LOAD * /NS2 
+ /NTl * LOAD 
+ MR 

/NS2 := /NT2 
+ /LOAD 
+ MR 



SIMULATION 

TRACE_ON MR CLK LOAD PT2 PTl PTO P4B3T NT2 NTl NTO N4B3T 



SETF /LOAD MR OE 
SETF /MR 

SETF LOAD PT2 PTl PTO NT2 NTl NTO 
CLOCKF 



; RESET CONDITION 
; DISABLE MASTER RESET 

; LOAD INPUT DATA 



SETF /LOAD /PT2 /PTl PTO /NT2 /NTl NTO ; SHIFT DATA 
CLOCKF 



SETF PT2 /PTl PTO NT2 /NTl NTO 
CLOCKF 



SHIFT DATA 



SETF LOAD PT2 /PTl PTO NT2 /NTl NTO 
CLOCKF 



LOAD INPUT DATA 



SETF /LOAD PT2 /PTl PTO NT2 /NTl NTO 
CLOCKF 



SHIFT DATA 



SETF /PT2 PTl /PTO NT2 NTl /NTO 
CLOCKF 

SETF LOAD PT2 /PTl PTO /NT2 NTl NTO 
CLOCKF 



SHIFT DATA 
LOAD INPUT DATA 



SETF /LOAD PT2 /PTl PTO /NT2 NTl NTO ; SHIFT DATA 
CLOCKF 

SETF PT2 /PTl PTO /NT2 NTl NTO ; SHIFT DATA 

CLOCKF 

SETF LOAD PT2 PTl /PTO /NT2 /NTl /NTO ; LOAD INPUT DATA 
CLOCKF 

SETF /LOAD PT2 /PTl /PTO NT2 /NTl /NTO ; SHIFT DATA 
CLOCKF 



SETF PT2 /PTl /PTO NT2 /NTl /NTO 
CLOCKF 



; SHIFT DATA 
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SETF LOAD PT2 /PTl /PTO NT2 /NTl /NTO ; LOAD INPUT DATA 
CLOCKF 

SETF /LOAD PT2 /PTl /PTO NT2 /NTl /NTO ; SHIFT DATA 
CLOCKF 

SETF PT2 /PTl /PTO NT2 /NTl /NTO ; SHIFT DATA 
CLOCKF 

TRACE OFF 
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Title 4B3T DECODER LOGIC 

Pattern DE4B3T.pds 

Revision A 

Author Cindy Lee 

Company Monolithic Memories 

Date 3/18/87 

CHIP DE4B3T PAL22V10 

;PIN1 PIN2 PIN3 PIN4 PINS PIN6 
/CLK NC MR P4B3T N4B3T NC 

;PIN7 PINS PIN9 PINIO PINll PIN12 
NC NC NC NC NC GND 



;PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 
NC ERROR NC /Q4 Q3 Q2 

;PIN19 PIN20 PIN21 PIN22 PIN23 PIN24 
Ql QO NC NC NC VCC 
GLOBAL 



INPUT SIGNALS 

/CLK : ASSERTIVE LOW EXTERNAL CLOCK 
MR : MASTER RESET 
P4B3T : POSITIVE TERNARY SIGNAL 
N4B3T : NEGATIVE TERNARY SIGNAL 

OUTPUT SIGNALS 

ERROR : ERROR FLAG 

/Q4 : ASSERTIVE LOW Q4 SIGNAL WHICH IS A /LOAD SIGNAL 
FOR A SHIFT REGISTER 
Q3, Q2, Ql, QO : 4B3T DECODING STATE VARIABLES 



EQUATIONS 

;ERROR SIGNAL IS GENERATED WHEN P4B3T AND N4B3T SIGNALS BOTH 
;ARE HIGH, OR P4B3T AND N4B3T SIGNALS BOTH ARE LOW AT THE 
/TRANSITION FROM STATE4 TO STATED. 



ERROR = P4B3T * N4B3T ; P4B3T AND N4B3T ARE HIGH 

+ /Q4 * /Q3 * /Q2 * /Ql * /QO * /P4B3T * /N4B3T 

;/Q4 lA A ASSERTIVE LOW LOAD SIGNAL. THE FOUR PARALLEL DATA 
;ARE LOADED INTO 4-BIT PARALLEL-IN SERIAL-OUT SHIFT REGISTER 
;WHEN LAOD IS LOW, OTHERWISE, THE DATA IS SHIFTED OUT. 



Q4 



:= /N4B3T * P4B3T * /Ql * Q3 * /Q4 
+ /N4B3T * /QO * Q2 * /Q3 * /Q4 
+ /N4B3T * /QO * Ql * Q3 * /Q4 
+ /N4B3T * /Ql * Q2 * /Q3 * /Q4 
+ N4B3T * /P4B3T * /Ql * Q3 * /Q4 



; LOAD SIGNAL 
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+ /P4B3T 
+ /P4B3T 
+ /P4B3T 
+ /P4B3T 
+ /P4B3T 
GLOBAL. RSTF = MR 



/QO * Q2 * /Q3 * /Q4 
/QO * Ql * Q3 * /Q4 
/Ql * /Q2 * Q3 * /Q4 
/Ql * Q2 * /Q3 * /Q4 
QO * /Ql * Q3 * /Q4 



;Q3, Q2, Ql, AND QO ARE THE FOUR BINARY SIGNALS WHICH ARE 
; CONVERTED FROM THTREE TERNARY CODE WORD. 



Q3 



Q2 



Ql 



:= /N4B3T * P4B3T * /QO * /Ql * /Q2 


* /Q3 


+ /N4B3T * P4B3T * QO * /Q2 * Q3 


; MSB OF THE FOUR 


+ /N4B3T * QO * Ql * /Q2 * /Q4 


; BINARY SIGNALS 


+ /N4B3T * Ql * /Q2 * Q3 * /Q4 




+ /N4B3T * P4B3T * Q4 




+ /P4B3T * /QO * /Q2 * Q3 * /Q4 




+ N4B3T * /P4B3T * /QO * Q2 * /Q4 




+ /P4B3T * /QO * Ql * Q3 * /Q4 




+ N4B3T * /P4B3T * /Ql * Q2 * /Q4 




+ N4B3T * /P4B3T * /Ql * Q3 * /Q4 




+ /P4B3T * QO * Ql * /Q2 * /Q4 




:= /N4B3T * /QO * Ql * Q3 * /Q4 




+ /N4B3T * P4B3T * Q2 * /Q3 * /Q4 




+ /N4B3T * QO * Ql * /Q3 * /Q4 




+ N4B3T * /P4B3T * /QO * /Ql * /Q3 




+ /P4B3T * /QO * Q2 * /Q3 * /Q4 




+ N4B3T * /P4B3T * /QO * Q2 




+ /P4B3T * QO * Ql * /Q3 * /Q4 




+ N4B3T * /P4B3T * Q4 




:= /N4B3T * /QO * /Ql * /Q2 * /Q3 




+ /N4B3T * P4B3T * /QO * /Ql * Q2 




+ /N4B3T * QO * /Ql * Q2 * /Q3 




+ /N4B3T * P4B3T * QO * Ql * /Q2 




+ /N4B3T * P4B3T * Q2 * /Q3 




+ /N4B3T * Q4 




+ /P4B3T * /QO * /Ql * /Q2 * /Q3 




+ /P4B3T * /QO * Ql * /Q2 * Q3 




+ N4B3T * /P4B3T * /QO * Q3 




+ /P4B3T * QO * /Ql * /Q2 * Q3 




+ /P4B3T * Q4 





QO 



;= /N4B3T * /P4B3T * /QO * Ql ■ 
+ /N4B3T * /QO * /Ql * /Q3 
+ /N4B3T * /QO * Ql * Q2 * Q3 



+ /N4B3T * P4B3T 
+ /N4B3T * P4B3T 
+ /N4B3T * Q4 
+ N4B3T * /P4B3T 
+ /P4B3T * /Ql * 
+ N4B3T * /P4B3T 
+ N4B3T * /P4B3T 
+ /P4B3T * Q4 



QO 
/Ql 



* /Ql * 
* Q2 * 



Q3 



Q3 
Q3 



* /QO * /Ql 
Q2 * /Q3 

* QO * Ql * /Q2 

* QO * Q2 * /Q3 



LSB OF THE FOUR 
BINARY SIGNALS 
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SIMULATION 

TRACE_ON MR /CLK P4B3T N4B3T /Q4 Q3 Q2 Ql QO ERROR 

;T0 RESET WHOLE DEVICES 

SETF MR ; STATE # = Q4(L0AD) Q3 Q2 Ql QO & ERROR 

;TEST VECTOR #1 TO CHECK SO, SI, S4, S13, AND S16 



SETF /MR P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3f 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 



;S0 = 0, ERROR 
/•INITIAL CONDITION 

;S1 = 1 1 

;S4 = 1 1 

;S13 =11111 

; SI = 1 1 

;S4 ==01100 

;S14 =10 11 

;S1 =00011 

;S4 = 1 1 

; so = 0, ERROR 



;TEST VECTOR #2 TO CHECK SI, S5, S15, S16, AND S17 

; SI = 1 1 



SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 



;S5 = 1 1 1 
;S15 =10 

; SI = 1 1 
;S5 = 1 1 1 
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SETF /P4B3T N4B3T ;S16 =110 

CLOCKF CLK 

SETF /P4B3T /N4B3T ;S1 = 1 1 

CLOCKF CLK 



SETF /P4B3T N4B3T ;S5 = 1 1 1 

CLOCKF CLK 

SETF P4B3T /N4B3T ;S17 =10 1 

CLOCKF CLK 

;TEST VECTOR #3 TO CHECK SI, S6, S18, S19, AND S20 

SETF /P4B3T /N4B3T ;S1 = 1 1 

CLOCKF CLK 

SETF P4B3T /N4B3T ;S6 = 1 1 1 

CLOCKF CLK 

SETF /P4B3T /N4B3T ;S18 =1110 1 

CLOCKF CLK 

SETF /P4B3T /N4B3T ;S1 = 1 1 

CLOCKF CLK 

SETF P4B3T /N4B3T 7S6 = 1 1 1 

CLOCKF CLK 

SETF /P4B3T N4B3T ;S19 =11110 

CLOCKF CLK 

SETF /P4B3T /N4B3T ;S1 = 1 1 

CLOCKF CLK 

SETF P4B3T /N4B3T ;S6 = 1 1 1 

CLOCKF CLK 

SETF P4B3T /N4B3T ;S20 =10 10 1 

CLOCKF CLK 

;TEST VECTOR #4 TO CHECK S2, S7, S20, S18, AND S21 

SETF /P4B3T N4B3T ;S2 = 1 1 1 

CLOCKF CLK 

SETF /P4B3T /N4B3T ;S7 = 1 

CLOCKF CLK 

SETF /P4B3T /N4B3T ;S20 =10 10 1 

CLOCKF CLK 

SETF /P4B3T N4B3T ;S2 = 1 1 1 

CLOCKF CLK 
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SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

;TEST VECTOR #5 TO CHECK SO 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 



S7 = 1 
S18 =1110 1 
S2 = 1 1 1 
S7 = 1 

521 =10 111 

S2, S7, S8, S22, AND S23 
S2 = 1 1 1 

57 = 1 

SO = 0, ERROR 
S2 = 1 1 1 

58 = 1 1 
S14 =10 11 
S2 = 1 1 1 
SB = 1 1 

522 =110 1 
S2 = 1 1 1 
SB = 1 1 

523 =10 110 
S2 = 1 1 1 
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SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T N4B3T 
CLOCKF CLK 

;TEST VECTOR #6 TO CHECK S2 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

;TEST VECTOR #7 TO CHECK S3 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 



58 = 1 1 

SO = 0, ERROR 

S9, S24, S25, AND S23 
S2 = 1 1 1 

59 = 1 1 

524 =10 10 
S2 = 1 1 1 
S9 = 1 1 

525 =1110 

52 = 1 1 1 
S9 = 1 1 
S23 =10 110 

SIO, S16, S26, AND S15 

53 = 1 1 1 

SIO =01000 
S16 =110 
S3 = 1 1 1 
SIO =01011 

526 =110 11 
S3 = 1 1 1 
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SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

;TEST VECTOR #8 TO CHECK S3 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

;TEST VECTOR #9 TO CHECK SO 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 

SETF /P4B3T N4B3T 
CLOCKF CLK 

SETF P4B3T /N4B3T 
CLOCKF CLK 



510 =01000 

S15 =10 

Sll, S27, S28, »ND S22 
S3 = 1 1 1 

511 =01001 

527 =10 10 
S3 = 1 1 1 
Sll =01001 

528 =110 10 
S3 = 1 1 1 

511 =01001 

S22 =110 1 

S3, S12, S13, S28, AND S25 
S3 = 1 1 1 

512 =01010 

513 =11111 
S3 = 1 1 1 
S12 =01010 
S28 =110 10 
S3 = 1 1 1 
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SETF P4B3T /N4B3T ;S12 =01010 

CLOCKF CLK 

SETF P4B3T /N4B3T ;S25 =1110 

CLOCKF CLK 

SETF P4B3T /N4B3T ; S3 = 1 1 1 

CLOCKF CLK 

SETF P4B3T N4B3T ; SO = 0, ERROR 

CLOCKF CLK 



TRACE OFF 
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IMancliester Encoding 

The PALC22V10 has been programmed to give a Manchester 
encoded output for eight digital TTL-level inputs or a bit-serial 
TTL level input. The reason for using such encoding is to 
remove the DC component from a digital waveform prior to 
transmission. If the data were transmitted without encoding, 
'DC wander' could result. 

A transmitted binary signal consists of a series of High and 
Low voltage pulses. If the signal consists of primarily High 
pulses, a mean DC component will exist. If the signal then 
becomes primarily Low pulses, the mean DC component will 
shift, or wander. This can cause transmission problems on 
capacitive lines. A Ivlanchester encoded signal cannot have a 
DC component. 

The principle of Manchester encoding is to introduce phase 
shifting in a carrier wave, which is continuously switching, 
such that logic input changes are represented by a phase shift 



of that carrier. If data at the input of an encoder switches from 
a logic High to a logic Low, then a phase shift of 180 degrees 
(equivalent to a polarity shift) is performed. When a Low-to- 
High transition occurs at the input, the carrier phase is 
switched back by 180 degrees to the original phase. 

For example, a logic High could be encoded as a zero phase 
shift, and a logic Low as a 180-degree phase shift from a 
reference. The result in the encoded signal is that two 
successive Highs or Lows represent either a positive or 
negative data transition (Figure 1). When the data in the figure 
toggles Low to High the encoded output gives two successive 
High levels as the phase of the carrier shifts by 180 degrees. 
The transition of data from High to Low will generate two 
successive Low outputs in the encoded signal, returning the 
signal to the original phase. 



1_ 



MANCHESTER 1 
ENCODED 
SIGNAL n 



^r^mruM^Lnnji 



Figure 1. A Manchester Encoded Signal from a Digital TTL Input 



Applications 

Manchester encoding is applied to local area networks, such 
as Ethernet, and can be used in storage of data on magnetic 
media, or in other transmissions including infrared. Both data 
and clock are imbedded in the Manchester encoded signal. 
Clock information may easily be extracted from the encoded 
waveform by using phase-locked loop techniques, and the 
clock can be used to recover the encoded data. Thus, the data 
can be resynchronized on reception. 



Circuit 

Figure 2 shows a schematic diagram of the Manchester 
encoder circuit. The design has been made to be compatible 
with either serial or parallel data output from standard UART 
(Universal Asynchronous Receiver/Transmitter) circuits. The 
clock input to the system will come from the clock input to the 
UART circuit, which is typically sixteen times the data rate. 
The divide-by-sixteen circuit will generate a square wave 
output at R3, the original data rate as applied to the UART. This 
signal is available as an output from the system. On a PCB, the 
R3 signal can be used to decode the data. 
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Figure 2. Manchester Encoder Circuit 



A further counter synchronously clocked is a divide-by-eight 
counter with outputs RA, RB and RC. These outputs will select, 
in the case of a parallel data input, one of the individual data 
lines D0-D7. The selected data line is routed through the 
multiplexer and applied to the input of an eXciusive-OR (XOR) 
gate. One of the inputs to the XOR gate is the data clock rate, 
available at the R3 flip-flop output. The function of the XOR 
gate has been configured in the PALC22V10 from a 'sum-of- 
products' architecture. 

The phase encoded signal has been derived at the output of 
flip-flop RX. Inverting the data gives a 180-degree phase shift 
from the reference established at the R3 output. The selected 
data input line determines whether or not phase (polarity) 
inversion should take place. If the data input is Low, no 
inversion takes place; if High, the reference signal is inverted 
to give the required phase shift. 

Equations 

The general output equation is: 

RX := R3*DATA + RTDATA (= R3 :+: DATA). (1) 

The data input is selected by a one-of-eight condition of the 
state machine RA, RB, and RC. RA is the clock reference 
divided by two, RB is the clock divided by four and RC by eight. 



For a parallel input, data DO is selected to control the output 
polarity at RX when RA*RB*RC is True, D1 when RA*RB*RC is 
True, D2 when RA*RB*RC is True, etc. This procedure 
serializes the parallel data. The Manchester encoded signal is 
created by this dynamic control of the inverting input to the 
XOR configuration through the selected data input. 

There is a serial select jnput to the multiplexer circuit such that 
when driven active High, serial data applied to the DO input will 
encode the carrier signal on a bit-by-bit basis. Input lines 01- 
D7 are ignored when the encoder is used in this mode. The SER 
mode should be hard-wired Low for parallel data input. 

To enable external handshake to be accomplished in either 
serial or parallel mode, a RDY output is available. Whe'n a 
complete byte or bit is encoded and transmitted to the output 
RX, the RDY pin will go High to request more data. Note that the 
Manchester encoder does not latch the data; it must remain 
valid for the duration of the encoding. The logic circuit required 
for the RDY pin is shown as a sum of product terms as applied 
to the 'D' input of the RDY flip-flop. 

The data that is applied to the input is available in a serial form 
at the DTST output and can be used for test purposes. DTST is 
simply the serialized output of the data input D0-D7. 
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PAL Device Design Considerations 

The PALC22V10 was chosen for this application because of 
the large number of product terms required to encode the XOR 
function. Up to sixteen product terms can be used to create an 
XOR function on eight data inputs. The PALC22V10 also offers 
low power with TTL compatibility, reducing power supply 
requirements while increasing reliability. 

The flip-flops RO, R1, R2, R3, RA, RB, and RC form a long 
binary count chain. The output from R3 forms the reference 
phase output which is the clock input divided by sixteen. As 
discussed, RA, RB, and RC form a binary state machine to 
select D0-D7 data inputs individually. 
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4a. State Assignment Map 



Design Procedure 

The following figures show the design procedure used to derive 
equations for the binary divide-by-sixteen counter R0-R3. The 
Boolean equations have been derived using the PALASM® 2 
software operators. 

Figure 3 shows the counter states in hexadecimal and binary 
formats. Figure 4 shows two Karnaugh maps, the state 
assignment and state transition; Figure 4a shows the 
hexadecimal value of the current state of the counter, and 
Figure 4b shows the next state. For example, the location in 
the state assignment map of hex 'A' would be represented by 
the next state hex 'B' in the transition map. 
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R3 
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V R1R0 
R3R2\ 10 11 01 00 
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11 

01 
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4b. State Transition Map 



Figure 4. The State Assignment Map (4a) Shows the Cun^nt 
Hexadecimal Value in the Appropriate Position in the 
Map. The State Transition Map (4b) Shows Next 
State. For Example, State A in the Assignment Map 
is Shown as B in the Transition Map Because that is 
the Next Count in the Sequence 



Figure 5 shows the binary weighting of the hexadecimal values 
of the flip-flops RO, R1 , R2, and R3 of the transition map. From 
this map, individual Karnaugh maps are drawn for each 
individual flip-flop, and equations derived by performing 
minimization for each flip-flop. 

V R1R0 
R3R2V 10 

10 



1101 


0011 


0101 


1001 


1111 


0000 


0111 


1011 


1110 


0001 


0110 


1010 


1100 


0010 


0100 


1000 



Figure 3. Binary Counter Truth Table. The Conditions of the 
Flip-flops RO, R1, R2, and R3 Are Shown in the Truth 
Table with the Hexadecimal Value of the Binary 
Weighting 



Figure 5. State Transition Map with the Binary Weighting of 
the Hexadecimal Entries. The Register Significance 
of the Entry is from Left to ffight. For Example, 1 101 
Represents the Binary Weighting of RO R1 /R2 R3 
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Figure 6 shows the Karnaugh maps and the equations for either 
true or complement outputs. For true outputs, minimization is 
performed on the logic One entries in the map. Where an 
inverting buffer is used between the internal flip-flop and the 
outside world, minimization is performed on the logic Zero 
entries In the map (shown bolder). This is because the internal 
flip-flop's output is set High, but the state of the output pin is a 
logic Low due to the inverting action of the buffer. With the 
output configuration of the PALC22V10, it is possible to 
choose either inverted or non-inverted outputs, so either 
equation will be suitable. 



The equations derived for a general counter of length N 
become: 



QN 



:= QN*QN-1*QN-2*...Q2*Q1*Q0 
+ QN'QTvM 



QN*QN-2 



+ QN*Q1 
+ QN*QO 
using the inverting option, or: 



QN 



R1 no V R1 RO 

R3R2X 10 11 01 00 R3R^. 10 11 01 00 

10 





^ 
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/RO:=RO (1) 
RO:=/RO (2) 



6a. Map for Flip-flop RO. 



1 Oil 



hA'A^M 



/R1 := RO'RI + /R0*/R1 (3) 
R1 :=/R0*R1 + R0*/R1 (4) 

6b. Map for Flip-flop R1. 



qn*qn-1*qn-2*...q2*q1*q0 
qn*qFm 

QN'QFT^ 



QN*Q1 
QN*QO 



for the non-inverting option. 

This generalized equation requires N + 1 product terms for a 
general flip-flop, QN. For example, Q7, in a binary count 
sequence, will require a summation of eight product terms. The 
complete design specification is shown in the PAL device 
Design Specification of Pattern 04. 



y R1 RO 

R3R2\. 10, 
10 



[J)KiM 



01 , 00 
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y R1R0 

R3R2 



/R2 := R2*R1*R0 + /R2VR1 + /R2VR0 (5) 
R2 := /R2*R1*R0 + R2*/R1 + R2*/R0 (6) 



6c. Map for Flip-flop R2. 



2\ 


10 


11 


01 


00 


10 


^ 


1 


1 


^ 


11 


^J 


© 


1 


vj 


01 


o\ 


© 





?T 


00 


^L 





u 


y 



/R3 := R3*R2*RrR0 + /R3*/R2 
+ /R3*/R1 + /R3*/R0 (7) 

R3 := /R3*R2*R1*R0 + R37R2 
+ R3'/R1 ♦ R3*/R0 (8) 

6d. Map for Flip-flop R3. 



Rgure 6. The Karnaugh Map of Rgure 5. Broken Down for 
Individual Flip-flops RO, R1 , R2, and R3. 
Minimization is Perfonned for Each Individual Rip- 
fiop 



The PALASM 2 software operators describing the equation 
are: 

* logical AND 

+ logical OR 

/ logical inversion 

:= registered output equation. 



Design Results 

Figure 7 is an oscilloscope photograph of the outputs of the 
programmed device. The top trace is the RDY signal. It goes 
High to indicate that a complete data byte has been 
transmitted and that the next data byte may be pr esented to 
the data inputs D0-D7. The second trace is DTST, the serial 
data stream D0-D7 of the parallel data input D0-D7. The bottom 
trace is the Manchester encoded waveform of the parallel data 
input, or RX. The markers show one complete frame of 
information. 




Figure 7. Oscilloscope Trace of Programmed Device. The 
Manchester Encoded Output is RX 
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TITLE MANCHESTER ENCODER. 

PATTERN 04. 

REVISION 01. 

AUTHOR CHRIS JAY. 

COMPANY MMI SANTA CLARA, CA. 

DATE 17 JULY 1986. 

THE PALC22V10 HAS BEEN DESIGNED TO FUNCTION AS A MANCHESTER 
ENCODER CIRCUIT. THE BINARY DATA INPUT WILL DETERMINE WHETHER 
THE TRANSMISSION SIGNAL SHOULD BE IN PHASE WITH A CLOCK 
REFERENCE OR SHIFTED BY 180 DEGREES FROM THAT REFERENCE. 
THE CHARACTERISTIC OF A MANCHESTER ENCODED SIGNAL IS A 
CHANGE OF PHASE FOR EACH TRANSITION OF THE DATA INPUT. 
ASSUMING THAT 'X' IS THE PHASE OF THE CARRIER THAT ENCODES 
A LOGIC LOW INPUT, WHEN THE INPUT DATA TRANSITIONS 
TO A LOGIC HIGH THE PHASE OF THE CARRIER WILL BE SHIFTED 
TO 'X + 180'. IT WILL CHANGE BACK TO 'X' ONLY WHEN THE 
DATA INPUT TOGGLES FROM A LOGIC HIGH TO A LOGIC LOW. 
THE KEY FEATURES OF THE ENCODED SIGNAL ARE: 

1. DATA AND CLOCK FREQUENCY INFORMATION IS 
CONTAINED IN THE ENCODED SIGNAL. CLOCK 
INFORMATION MAY BE EXTRACTED BY USING A 
PHASE LOCKED LOOP IN THE RECEIVER. 

2. NO DC COMPONENT IS CONTAINED IN THE 
PHASE ENCODED SIGNAL. 

THE APPLICATIONS OF PHASE ENCODED SIGNALS CAN BE IN SHORT 
COMMUNICATIONS LINKS, SUCH AS ETHERNET OR INFRARED TRANS- 
MISSIONS, OR IN STORAGE ON SOME MAGNETIC MEDIA. 

THERE ARE EIGHT DATA INPUTS, DO TO D7 . DATA PRESENT ON 
THESE PINS WILL BE ENCODED INTO A BI-PHASE SIGNAL AT THE 
OUTPUT QX, WHEN THE SER (SERIAL) INPUT DATA PIN IS CONFIGURED 
TO A LOGIC ZERO. TO CONVERT A SERIAL DATA STREAM OF LOGIC 
ONES AND ZEROS INTO A PHASE ENCODED SIGNAL THE SER INPUT 
IS TIED HIGH, AND THE DATA IS APPLIED TO THE DO INPUT 
OF THE PAL DEVICE. A CLOCK INPUT OF 16 TIMES THE DATA RATE 
IS REQUIRED FOR THE PARALLEL OR SERIAL ENCODING OF THAT 
DATA, SO THE DESIGN MAY BE USED WITH MANY POPULAR UART LSI 
CIRCUITS. THERE IS ONE HANDSHAKE OUTPUT FROM THE DEVICE, 
RDY. THIS SIGNAL GOES HIGH WHEN THE NEXT DATA BYTE OR BIT 
IS REQUIRED TO BE SET UP AT THE INPUT. 
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CHIP MANENC PAL2 2V10 






;PIN 


1 




2 


3 


4 




CLK 


/RST 


DO 


Dl 


;PIN 


7 




8 


9 


10 




D4 




D5 


D6 


D7 


;PIN 


13 




14 


15 


16 




SER 


RDY 


/DTST 


RC 


;PIN 


19 




20 


21 


22 




RA 




R3 


R2 


Rl 


GLOBAL 












STRING 


SO 


/RC* 


/RB*/RA' 






STRING 


SI 


/RC* 


/RB* RA' 






STRING 


S2 


/RC* 


RB*/RA' 






STRING 


S3 


/RC* 


RB* RA' 






STRING 


S4 


RC* 


/RB*/RA' 






STRING 


S5 


RC* 


/RB* RA' 






STRING 


S6 


RC* 


RB*/RA' 






STRING 


S7 


RC* 


RB* RA' 







EQUATIONS 




GLOBAL . 


SETF 


= RST 


/RO : = 


RO 




/Rl : = 

+ 


Rl* 
/Rl* 


RO 
/RO 



/R2 := R2* Rl* RO 
+ /R2*/R1 
+ /R2* /RO 

/R3 := R3* R2* Rl* RO 
+ /R3*/R2 
+ /R3* /Rl 
+ /R3* /RO 

RDY := RC*RB*RA*R3*R2*R1*R0*/SER 
+ R3*R2*R1*R0* SER 



5 


6 




D2 D3 




11 12 




NC GND 




17 18 




RB RX 




23 24 




RO VCC 




; GLOBAL TERM 






ENABLES RESET. 




STRING DECLARATIONS 




ENCODE STATE 







ENCODE STATE 


1 




ENCODE STATE 


2 




ENCODE STATE 


3 




ENCODE STATE 


4 




ENCODE STATE 


5 




ENCODE STATE 


6 




ENCODE STATE 


7 



FOR PARALLEL INPUT 

50 SELECTS DO INPUT, 

51 SELECTS Dl INPUT, 
ETC. 



SET INITIAL 
CONDITIONS, RESET. 
RO DIVIDES THE CLOCK 
INPUT BY 2. 
Rl DIVIDES THE CLOCK 
BY 4. 

R2 DIVIDES THE CLOCK 
BY 8. 



R3 DIVIDES THE CLOCK 
BY 16. 



RDY OUTPUT PERFORMS 
A HANDSHAKE OPERATION, 
WHEN HIGH, NEW DATA 
MAY BE APPLIED. 
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/RA := RA* R3* R2* Rl* RO*/SER 
+ /RA*/R3* /SER 

+ /RA* /R2* /SER 
+ /RA* /Rl* /SER 
+ /RA* /RO*/SER 

+ SER 

/RB := RB* RA* R3* R2* Rl* RO*/SER 
+ /RB* /R3* /SER 

+ /RB* /R2* /SER 
+ /RB* /Rl* /SER 

+ /RB* /RO*/SER 

+ /RB*/RA* /SER 

+ SER 

/RC := RC* RB* RA* R3* R2* Rl* RO*/SER 



+ 


/RC* /R3^ 


k 


/SER 


+ 


/RC* 


/R2* /SER 


+ 


/RC* 




/Rl* /SER 


+ 


/RC* 




/RO*/SER 


+ 


/RC* /RA* 




/SER 


+ 


/RC*/RB* 




/SER 


+ 






SER 


/DTST:= 


SO*DO 






+ 


S1*D1*/SER 






+ 


S2*D2*/SER 






+ 


S3*D3*/SER 






+ 


S4*D4*/SER 






+ 


S5*D5*/SER 






+ 


S6*D6*/SER 






+ 


S7*D7*/SER 






/RX : = 


S0*/R3*D0 


+ 


S0*R3*/D0 


+ 


S1*/R3*D1*/SER 


+ 


S1*R3*/D1*/SER 


+ 


S2*/R3*D2*/SER 


+ 


S2*R3*/D2*/SER 


+ 


S3*/R3*D3*/SER 


+ 


S3*R3*/D3*/SER 


+ 


S4*/R3*D4*/SER 


+ 


S4*R3*/D4*/SER 


+ 


S5*/R3*D5*/SER 


+ 


S5*R3*/D5*/SER 


+ 


S6*/R3*D6*/SER 


+ 


S6*R3*/D6*/SER 


+ 


S7*/R3*D7*/SER 


+ 


S7*R3*/D7*/SER 



COUNTER [RA, RB, RC ] 
COUNTS THE NUMBER 
OF BITS TRANSMITTED 
WHEN A BYTE IS 
APPLIED TO THE 
DO - D7 DATA INPUTS. 
THE LOGIC CONDITION 
OF THE SELECTED 
INPUT IS MULTIPLEXED 
TO THE RX REGISTERED 
OUTPUT BY THE 'ONE OF 
EIGHT' STATE SELECTOR, 
IN THE DIVIDE BY EIGHT 
COUNTER RA, RB, RC. FOR 
SERIAL MODE /RA*/RB*/RC 
SELECTS DATA INPUT DO, 
RA*/RB*/RC SELECTS Dl , 
ETC., WHILE /SER SELECTS 
PARALLEL OPERATION. IF 
SER IS ACTIVE, SERIAL 
ENCODING IS SELECTED. 
THE SERIAL DATA APPLIED 
TO THE DO INPUT IS 
ENCODED. Dl - D7 
INPUTS ARE DESELECTED. 
DO - D7 IS SERIALIZED 
FOR PARALLEL MODE. 
FOR SERIAL MODE, DO ONLY 
IS CLOCKED THROUGH TO 
THE DTST OUTPUT. 
/DTST OUTPUT IS THE 
SERIALIZED ENCODING 
OF THE DATA INPUT 
DO - D7, OR DO. 
THE RX OUTPUT PROVIDES 
THE MANCHESTER ENCODED 
OUTPUT OF THE REFERENCE 
FREQUENCY, WHICH OCCURS 
AT THE R3 OUTPUT. THE 
PHASE AT R3 IS SHIFTED 
AT THE RX OUTPUT IF THE 
SELECTED DATA INPUT PO- 
LARITY CHANGES. IF /SER 
IS INACTIVE, DO - D7 
INPUTS ARE SELECTED. IF 
/SER IS ACTIVE, ONLY THE 
SERIAL INPUT APPLIED TO 
DO IS SELECTED. 



Monolithic 



m 



Memories 



Z\ 



2-505 



PALC22V10 Creates Manchester Encoder Circuit 



SIMULATION 

TRACE_ON CLK RO Rl R2 R3 RA RB RC RX 

/DTST RDY DO Dl D2 D3 D4 D5 

D6 D7 SER /RST 
SETF /CLK RST DO Dl /D2 D3 D4 /D5 

/D6 /D7 /SER 
SETF RST 
CLOCKF CLK 
SETF /RST 
FOR I := 1 TO 128 DO 
BEGIN CLOCKF CLK 
END 

SETF SER DO 
FOR I := 1 TO 32 DO 
BEGIN CLOCKF CLK 
END 
TRACE OFF 



TRACE ALL SIGNALS. 

SET INITIAL CONDITIONS 
FOR DATA & CONTROL IN- 
PUTS. PARALLEL MODE SET. 
RESET ACTIVE TO 
INITIALIZE THE 
REGISTERS 
CLOCK 8 DATA BITS 
THROUGH TO R3 & RX 
OUTPUTS . 

SET SERIAL MODE VIA 
SERIAL CONTROL PIN. 
CLOCK SYSTEM TO SELECT 
SERIAL OPERATION. 
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Peripherals are integral to any computer system. They range 
from basic keyboards and printers to mass storage disksAapes. 
In this section we will focus on mass storage systems such as disk 
drives and tape drives. 

A typical mass storage subsystem can be thought of in terms of 
two major sections: the controller and the storage device. The 
controller provides an interface between the host and the storage 
device. It may control one or several storage devices at a time. 
Common storage devices are disk drives (Winchester and floppy) 
and tape drives (reel and cartridge). 

The discussion that follows will be oriented toward the storage 
device being a disk drive although it could just as easily have been 
done using tape drives. 

Figure 1 shows two configurationsof disk storage subsystems. In 
Figure 1 A the storage device is connected to the host via a string 
controller. Communication between the storage device and the 
controller is accomplished through standardized interface proto- 
cols. Examples of common disk drive interfaces in use today are 
SMD (Storage Module Device), ESMD (Enhanced SMD), ESDI 
(Enhanced Small Disk Interface), ST506, and SA465. An appli- 
cation example of an ESDI interface controller is shown on page 
2-509. An intelligent storage device is created when the string 
controller function is embedded into the storage device (Figure 
1 B). Intelligent storage devices typically communicate to the host 
through a host adapter using either SCSI (Small Computer 
System Interface) or IPI (Intelligent Peripheral Interface) interface 
protocols. 
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STRING 
CONTROLLER 



ESMD 
SMD 
ESDI 
ST506 
SA465 



STORAGE 
DEVICE 



a. String Controller With a Storage Device 



C 



HOST 
ADAPTER 



c 



IPI 

SCSI 



INTELLIGENT 
STORAGE DEVICE 



b. Host Adapter With an Intelligent Storage Device 



Figure 1. 



Controller 

The controller function can be conceptually divided into several 
sections: the Host Bus Interface, the Data Buffer, the Kernel and 
the Device Interface. 



C 



HOST BUS 
INTERFACE 



DATA 
BUFFER 



DEVICE 
INTERFACE 



ESMD 
SMD 
ESDI 
ST506 
^ SA465 , 



Figure 2. 

The Host Bus Interface provides the connections to the host CPU 
and main memory. It can serve as either a slave or a master on 
the host bus. When the controller is a bus slave, the host CPU 
must be able to access the controller, pass commands, and 
obtain subsystem status using established host bus protocol. 
The controller behaves as a bus master while transferring data in 
either direction, or posting status. A bus master must be able to 
arbitrate for the host bus and execute the necessary handshake 
protocols. 

The Data Buffer stores the data being read from or written to the 
disk and checks for data integrity. The physical buffer can serve 
several purposes: data transfer rate adaptor between the host 
and the disk, a temporary storage for data going to the host until 
it can be verified error free, and a cache. Depending on the 
complexity of the data buffer it may be as small as one byte or as 
large as a complete track of data. 

Data integrity can be monitored fordatatransfersfrom the host as 
well as from the disk drive. The sophistication used to monitor 
data integrity varies significantly from simple single bit error 
detection (parity) to large multiple bit error detection and correc- 
tion. The host may elect to transfer data with a parity bit for each 
byte. Data error detection and correction schemes are employed 
between the controller and the disk drive. As storage densities 
have increased, so have the size of the errors. The demand for 
more capable error detection and correction codes that can more 
accurately detect and correct larger number of bits in error has 
increased accordingly. 
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Such error detection codes as CRC code are satisfactory if the 
error occurs in the data transmission channel and the data can be 
recovered by retransmission. Data errors caused by media 
defects require the capability of both error detection and correc- 
tion. As the nature of media errors has changed, so have the error 
detection and correction codes being used. Fire Codes and 
computer generated codes have been the most popular codes. 
Symbol oriented codes, such as Reed-Solomon codes, are 
emerging as the solutions for the future. 

The Kernel is the brain of the controller and the manager of the 
subsystem. It translates commandsfrom the host into commands 
for the storage device. It receives requests from the operating 
system through the host interface and notifies the host when the 
request is completed. To execute the host's request for data, the 
kernel may have to access the storage device or it may only have 
to access a cache. When accessing the storage device the kernel 
sends commands according to the device interface protocols. 

The Device Interface communicates directly with the storage 
device. Using the particular interface protocols, the device inter- 
face sends commands, retrieves status, and transfers data 
between the controller and the storage device. The device inter- 
faces shown in Figure 3 are predominantly disk and would have 
to be changed to accommodate tape drives. An application 
example of a tape drive interface controller is shown on 
page 2-519. 

Storage Device 

The storage device can also be divided into several sections: the 
Interface & Control, the Servo and the Read/Write. 
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Figure 3. 



The Interface & Control is the brain of the storage device. 
It interprets the commands from the controller. It supervises the 
servo and the read/write and maintains the device status. 



The Servo is responsible for the movement of the media and the 
positioning of the read/write heads. For disk drives D.C. brush- 
less motors are predominantly used to drive the spindle. Head 
positioning is done either with a stepper motor or a voice coil 
actuator. In the case of tape drives, the Servo is only responsible 
for capstan motor control because the position of the read/write 
heads is fixed. 

The Read/Write is responsible for storing and retrieving data from 
the media. The data encoding method must balance data density 
and ease of data recovery. Several Run-Length-Limited (RLL) 
coding techniques are in use today.. Winchester disk drives have 
used frequency modulation (FfVI), modified FM (IVIFM), modified 
MFM (f^lMFM), and are currently using 2,7 to obtain present day 
densities. Tape drives employ different RLL codes than disk 
drives. The two common codes in use for tape storage are phase 
encoding (PE) and group-coded recording (GCR). The applica- 
tion note on page 2-541 shows an example of a GCR encoder im- 
plemented with PAL devices. 

When the storage device is configured with an embedded control- 
ler (Figure 4) the Device Interface secWon of the controller and the 
Interface & Control section of the storage device are not required. 
The Kernel will directly control the storage device. 
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Figure 4. 

PLDs have been used in designs throughout mass storage 
subsystems. Applications range from consolidation of random 
logic to address decoding, bus arbitration logic to protocol state 
machines, and complex encoding and decoding. Application 
notes and technical articles are included here to aid designers in 
understanding the benefits of implementing different designs with 
programmable logic. 



2-508 



Monolithic 



fdnEI 



Memories 



lK 



an ESDI Translator 
the EVi2064 Logic Cell Array 



AN-179 



The ESDI Translator 

ESDI is a low-cost, high-performance interface standard suita- 
ble for the smaller, high-performance Winchester disk drives 
currently being produced. The ESDI interface consists of a 
control cable and a data cable. The control cable allows for a 
daisy chain connection of up to seven devices (disk or optical 
drives) with only the last device being terminated. In our de- 
sign, we assumed that the device is a disk drive. The data 
cable is attached in a radial configuration (See Figure 1). 
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Figure 1. Connection Between thie Controiler and Multiple 
Drives 



The ESDI Translator handshakes serial commands from a disk 
controller, deserializes the commands and passes the com- 
mands to a microcontroller. The command data word structure 
is shown in Figure 2. 

The Command Function bits define functions to be executed by 
the disk drive. These functions are seek, recalibrate, request 
status, request configuration, select head group, control, data 
strobe offset, track offset, initiate diagnostics, set bytes per 
sector and set configuration. Some of these functions, such 
as the control function, have modifiers for more detailed func- 
tional description. Other commands have parameters that con- 
tain numbers. For the seek command, the parameter specifies 
the cylinder number that the drive will seek to. The request 
status and request configuration commands require data from 
the disk drive to be transferred back to the disk controller. In 
our current design, internal registers A and B in the LCA device 
represent the upper and lower bytes of the command respec- 
tively. 

Figure 3 illustrates the relationships between the disk control- 
ler, the ESDI Translator, and the microcontroller. The PROM is 
used to store t he configuration data for the LCA device. The 
Done/Program (D/P) output is driven LOW when the device is 
being configured. Configuration data is read from the PROM 
device during configuration. After configuration is complete, 
the LCA device drives the D/P pin HIGH to deselect the PROM. 
Drive selection, write protection, command completion and 
fault detection are also handled by the ESDI Translator. 
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Figure 3. An ESDI Translator Implemented on the LCA Device 



Why Use an LCA Device in an ESDI 
Translator 

The ESDI interface standard requires more logic functions to 
be built into the disk drive than some other interface stan- 
dards, such as the ST506 standard. However, the external di- 
mensions of a disk drive usually have to conform to an industri- 
al standard form factor. Thus, the use of high-density semicus- 
tom chips is the logical solution to increase functionality with- 
out increasing the external dimensions. 

The LCA device is a high-density CMOS integrated circuit 
available from Monolithic Memories. Its high gate density al- 
lows the implementation of an ESDI Translator in a single chip. 
Fifteen standard SSI and MSI chips would be necessary for the 
same application. If PLDs are used to implement the ESDI 
Translator, more than one would be necessary because a large 
amount of logic is required, thus occupying more board space. 

A major advantage in using the LCA device is that it can speed 
up the design cycle, enabling the manufacturer to have a 
shorter time-to-market. Also, many peripheral products are 
produced in relatively small quantities aiming at very special- 
ized markets. The LCA device, which has no NRE cost, makes 
the production of small quantities more economical than the 
gate array. Another advantage of the LCA device over other 
semicustom chips, such as the gate array, is its reprogramma- 
bility feature. The LCA device is RAM-based which can easily 
be reprogrammed by the user in the final system. This feature 
is especially important in the peripheral products market, 
where many products have short life spans. 



Design Implementation 

The ESDI Translator is responsible for all control interfaces 
between the disk controller and the disk drive. An internal 
block diagram of the ESDI Translator is shown in Figure 4. It 
consists of five major logic building blocks: 

• Drive selection 

• Read gate/write gate 

• Counter/controller 

• Shift register and parity generator/checker 

• Internal register address decoder 

Drive selection on ESDI-compatible drives involves three 
signals from the disk controller. Drive Select 0-2. These three 
drive select lines are encoded so that up to seven drives may 
be connected to the same ESDI port, as shown in Table 1 . 

On the LCA device, the drive number is selected by connecting 
the drive switch pins to either VCC or GND. When the code on 
the drive select lines, DSX, equals the code on the drive switch 
pins, DSWX, where X may be 0, 1, or 2, the drive is selected 
and the Drive Selected signal, DSELD, is asserted. Once the 
drive has been selected, serial commands output by the disk 
controller will be read by the LCA device. The actual implemen- 
tation is shown in Figure 5, using two CLBs and seven lOBs. 
BDSO and BDS1 are the names of the CLBs in the current de- 
sign. In our design, names that begin with the letter B or P are 
used to designate a CLE or an lOB respectively. SCLK is the 
system clock. 
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Figure 4. ESDI Translator Internal Block Diagram 
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Table 1. Drive Selection 



Figure 5. Configuration of the BDSO and BDS1 CLBs to 
Provide the Drive Selected Signal 
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Figure 6 shows the configurations of the two CLBs as dis- 
played on the computer screen by the XACT development soft- 
ware. In Figure 6a, the CLB is configured as one function of 
four variables. The D flip-flop is not used. The logic representa- 
tion, truth table, Karnaugh map, signal names, block name and 
Boolean equation are shown. 

In Figure 6b, the CLB is configured as one function of three 
variables, the output of which is connected to the D flip-flop. 
The Q output of the D flip-flop becomes the output of this CLB. 
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Figure 6a. Drive Selection Logic Implemented in a 
CLB (BDSO) 



X 

Y 

Q 

SET 

RES 

CLK 


Q 
FF 

K 


d[J ^^^ 
K 


Blk:HD 
BDSI 


DBA 


F 


A 


CLK 
Q 


A:DS2 

B: DSELDA 

0: 

D: DSW2 

K:SCLK 

X: 

Y: DSELD 


HHH 
L H L 


H 
H 


1 1 1 1 Jl 


M I i 1}d 

B 


F = -(D@A)*B 




WRGOUT 




Figure 7. Read Gate/Write Gate Logic Implementation 



The Counter/Controller handshakes commands from the disk 
controller. It also handshakes status/configuration data to the 
disk controller. It is also responsible for generating the Inter- 
rupt, Attention and Command Complete signals. Seventeen bit 
commands (one bit is parity) are transferred from the disk con- 
troller to the LCA device via the Command Data line. The serial 
bit transfer is performed using a pair of handshaking signals. 
Transfer Request (TREQ) and Transfer Acknowledge (TACK). 
TREQ is asserted by the disk controller when a bit is valid on 
the Command Data line, and TACK is asserted by the LCA de- 
vice when the command bit has been read. The handshaking 
action is shown in Figure 8. 
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Figure 6b. Drive Selection Logic Implemented in a 
CLB (BDS1) 
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The LCA device also performs logic functions for the Read 
Gate and Write Gate logic blocks. The Read Gate signal allows 
data to be read from the disk, and the Write Gate signal allows 
data to be written on the disk. These signals from the disk 
controller are input to the LCA device as Read Gate In and 
Write Gate In. Under normal operating conditions, Read Gate 
Out is asserted when Read Gate In is asserted and Write Gate 
Out is asserted when Write Gate In is asserted. When both 
Read Gate In and Write Gate In are asserted, a write error 
condition results and the Attention line is asserted, signalling 
the disk controller that an error has occurred. Also, the LCA 
device may be used to provide write protection to a disk drive. 
When the Write Protect signal is asserted, the Write Gate Out 
signal will not be asserted when the Write Gate In signal is 
asserted, preventing the write circuitry from being activated. 
These logic functions are implemented in two CLBs, BRWGO 
and BRWG1 , as shown in Figure 7. 



Figure 8. Command Data Transfer 



A 17-state counter counts the number of command bits shifted 
in or shifted out of the data registers. Its implementation is 
shown in Figure 9. The SHIFT signal is asserted, thus 
incrementing the counter, whenever there is a transfer request 
and the LCA device is selected. CNT_QO is the lowest bit of the 
shift register counter. This bit is inverted whenever SHIFT is 
asserted unless sixteen bits have already been shifted into the 
LCA device (CNT16 asserted, or CKCMD asserted when all 
seventeen bits have been shifted in). The DATAOUT signal is 
asserted after a Request Status command or a Request 
Configuration command has been transferred and the internal 
data registers have been loaded with data to be serially shifted 
out. Hence, when DATAOUT is asserted, CKCMD is negated 
and the counter is enabled. 
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Figure 9. 17-Statc Counter Implemented in six CLBs 

The logic generation of the Attention signal, ATTEN, is shown 
in Figure 10. Whenever there is a write fault, WRFLT, a parity 
error, PARERR, an interface fault, INTFLT, or an external error, 
CSTS(Change of Status), the ATTEN signal is asserted. When 
both WRGIN and RDGIN signals are asserted and the LCA 
device is selected, the WRFLT signal is asserted. The WRFLT 
signal is negated when the command transferred to the LCA 
device Register A is the Reset command defined by the ESDI 
standard, which has a Command Function of 0101 (Control) 
and a Command Modifier of 0000 (Reset Attention and 
Standard Status). The Command Function and Command 
Modifier formats are shown in Figure 2. 

Three CLBs, BPGO, BPG1, and BPG2, are responsible for 
generating the PARERR signal. BPGO is a multiplexer that 
selects the source of the input to the parity generator/checker 
(BPG1). If data is being shifted into the LCA device (parity 
checker mode), the CMDBITA input is chosen. If data is being 
shifted out of the LCA device (parity generator mode), the 
RA_Q7 output is chosen. These two signals are also shown in 



Figure 11. BPG1 is an odd parity generator/checker. In the 
parity checker mode, whenever an odd number of ones are 
passed through this CLB, the output is one. This output signal 
is connected to BPG2, which inverts the signal and asserts or 
negates the PARERR signal accordingly. If parity is correct, an 
interrupt is asserted to the microcontroller informing the 
microcontroller that a command has been received and is 
ready to be read. In the parity generator mode, the output of 
BPG1 is the parity bit. BPG1 is clocked by the SHIFT input, 
which is asserted whenever there is a transfer request and the 
LCA device is selected. BPG1 is reset by the Reset Parity 
Generator input, RSTPGEN. This signal is asserted when ei- 
ther INT is asserted or an interface fault is detected. 

The Interface Fault signal is asserted when the LCA device is 
selected and CNTR is negated before seventeen bits have 
been transferred. CNTR is asserted after the first command bit 
is shifted in and is negated after the seventeenth bit is shifted 
out. BINTFLTO is clocked by the system clock, SCLK, and 
reset by the RSTCOS signal. The lOB PCSTS is configured as 
a buffered input to signal external error conditions. 

The microcontroller is able to address four register locations in 
the LCA: two data registers, BRGA and BRGB, one error 
register, BDMX, and one command complete register, BRC7. 
Only three bits in the error register are used. They are bit for 
parity error, bit 4 for interface fault and bit 7 for write fault. Only 
one bit in the command complete register is used. This is bit 0, 
which has a value of zero when the command is completed. 
The addresses and contents of the registers are shown in 
Table 2. 



A1 AO 


REGISTER 


BIT NUMBER 
7 6 5 4 3 2 10 





REGISTER A 


















1 


REGISTER B 


















1 


ERROR 
REGISTER 


















1 1 


COMMAND 
COMPLETE REGISTER 



















Table 2. Register Addresses and Contents 

The LCA device implementation of the registers and multiplexer 
is shown in Figure 11. When the interrupt signal is asserted, 
the microcontroller reads the two data registers by setting the 
A1 and AO address lines appropriately and asserting the RD 
signal to the LCA device. These data registers contain the 
command that is transferred from the disk controller through 
the CMDBITA input. If the command is a request data 
command, configuration or status data is written to these two 
data registers by the microcontroller. These two bytes, plus a 
parity bit that is generated by the parity generator in the LCA 
device, are serially transferred to the disk controller over the 
Config/Status Data line through RA_Q7. After all seventeen 
bits have been transferred, the Command Complete signal is 
asserted. If the command is not a request data command, the 
microcontroller executes the command and upon completion, 
writes a byte of zeros to the command complete register of the 
LCA. When the command complete register is written with all 
zeros, the Command Complete signal is asserted by the LCA 
device.The command complete register may only be written, 
not read. The status register contains error bits that are set 
when errors are detected. This register may only be read and 
not written. 
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Figure 10. Generation of the Attention Signal by the Four Possible Error Conditions 
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Figure 11. Registers A, B, Error Register, and Multiplexer 
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Figure 11. Registers A, B, Error Register, and IVIuitiplexer (Continued) 
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Design Considerations 

The circuit diagram of the programmed LCA device is shown in 
Figure 12. This design implementation uses sixty-three of the 
sixty-four CLBs within the LCA. This translates into a 98% 
usage. The rightmost column of CLBs contains the multiplexer, 
which selects between registers A, B, or the error register to 
be placed on the data bus. Registers A and B are placed on the 
third and second column from the right end, respectively. The 
registers and multiplexer are placed physically close to each 
other to simplify routing. The CLBs which execute a particular 
function are placed physically next to each other. These 
functions include the parity generator/checker, 17-state 
counter, read gate/write gate logic, drive selection logic and 
error detection logic. The read gate/write gate logic and the 
drive selection logic, which require much I/O activity, are 
placed in CLBs near the edges of the device. 

Another consideration in implementing an LCA design is 
routing. Long lines are available for signals that have to travel 
a long distance within the LCA device. These lines can also be 
used for signals that must have minimal skew between differ- 
ent destinations. An automatic routing program is available in 
the XACT software package. 

Routing requires careful consideration when a design uses a 
high percentage of the available CLBs (above 95%). Although 
automatic routing is available, designs with high CLB usage 
may require point-to-point routing (EDITNET command in 
XACT). Some of the techniques that can be used are swapping 
input pins, swapping CLBs, and implementing buffers in 
unused sections of the CLBs. When swapping input pins, the 
designer must be careful because certain functions, such as 
the clock input to the flip-flop, may only be input on certain 
pins. Swapping CLBs is very simple because the XACT 
software provides a command for swapping CLBs, but 



sometimes not all of the signals can be successfully rerouted. 
Passing a signal through a CLB presents a routing channel that 
is not otherwise available. However, a delay is added to the 
signal. Usually, a combination of these techniques can be 
used to successfully route a high-density design. 

The high number of lOBs in the LCA device gives the designer 
much flexibility in designing the pinout of the device. In this 
design, thirty-five of the lOBs are used. The lOBs that are not 
being used for the actual design are not left unused, but are 
configured as outputs and are connected to various signals 
within the LCA device to provide test points for the LCA device. 
This greatly increases the testability of the design once it is 
placed on the board. 

XACT is used by the designer to define the CLBs and lOBs, 
and to perform EDITNET. Alternately, the design may be input 
using the schematic capture software offered by Daisy and 
Futurenet. In these methods, an automatic place and route 
software package divides the design into blocks that may be 
implemented in CLBs and lOBs, and routes the CLBs and 
lOBs automatically. Once the design has been completed, it may 
be simulated using the software package P-SILOS. 



Conclusion 

The LCA device provides many advantages to the user. Its 
high gate count and I/O capability could potentially replace 
several PLDs in many applications, hence reducing board 
space. The LCA device is preferred by many customers over 
gate arrays because it is reprogrammable 'on-the-fly' and there 
are no long design cycles and initial NRE cost of a gate array. 
The current design file, XDES15.LCA, is available upon 
request. The bit pattern and the .LCA file will be provided for 
programming the LCA device in an EPROM. 
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Figure 12. ESDI Translator LCA Design 
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Writing a Tape Drive 
Controller in PROSE 



Introduction 

The increasing complexity of today's computers is requiring in- 
creased use of distributed control design techniques. Such a 
system might include a 32-bit microprocessor as the Central 
Processing Unit (CPU), a 1 6- or 8-bit microcontroller as a periph- 
eral controller, and a complex sequencer as a peripheral drive 
controller. The new PROSE {PROgrammable SEquencer) de- 
vice from Monolithic Memories offers a simple, fast, and effective 
method for implementation of peripheral drive controllers. 

This article will describe the design of a QIC-02 compatible tape 
drive sequencer in the PROSE device. The design requirements 
will be outlined, and then the design process will be described in 
detail for a section of the design. 

QIC-02 Tape Drive 

The peripheral to be controlled by our circuit is a 1/4" streaming 
tape drive that operates according to the QIC-02 interface stan- 
dard. This standard specifies the command set and resulting tape 
drive control signals, as shown in Figure 1 . 



COMMAND 


QIC-02 COMMAND* 


D7 D6 D5 D4 D3 D2 D1 DO 


Invalid Command 


xxxxxxxx 


Read Status 


111111 


Beginning of Tape 


110 11110 


Erase Entire Tape 


110 1110 1 


Initialize Tape 


110 110 11 


Write 


10 111111 


Read 


1111111 


Read File Mark 


10 11111 


Write File Mark 


10 11111 



*The 8 bit commands shown in this table are inverted from the 
QIC-02 specification. This is because the PAL device has 
been programmed to accept active low signal inputs as 
defined by the QIC-02 specification. 

Figure 1. QIC-02 Commands 



Each command requires a certain sequence of events for execu- 
tion. For example, the Erase Tape command requires rewinding 
the tape, asserting the Erase and Initialize signals, moving the 
tape forward to the end, removing Erase and Initialize, and then 
rewinding the tape again. This type of function requires feedback 
of current state information, and additional inputs, for next state 
selection. This is the basic definition of a sequencer, or state 
machine (Figure 2). 

Sequencers are a common design element; they include any 
digital device which traverses through a sequence of states in an 
orderly fashion. A state is a set of values (a count) measured at 
different parts of the circuit. State machines are often used for 
timing delays, control signal generation, arbitration, event moni- 
toring, and multiple condition testing. 

Our tape drive sequencer must provide the proper control signals 
for the tape drive and then hold them until executed, monitor the 
status of the tape drive, and watch for new commands from the 
microcontroller. The block diagram of the system is shown in 
Figure 3. 

The peripheral microcontroller generates the 8-bit QIC-02 com- 
mands in response to the central 32-bit microprocessor. The 
sequencer decodes the command, determining which of the 
commands has been sent, and then begins execution of the 
command. 

Handshaking is performed by a Request signal from the micro- 
controller, signifying that a valid command is available, and a 
Ready signal from the sequencer, signifying that the command 
has been decoded or executed and a new command can be sent 
(Figure 4). Upon reset, the PROSE device asserts READY, 
informing the peripheral controller that the tape drive is ready to 
accept a command. The peripheral controller asserts REQUEST 
after issuing a command on the eight-bit data bus. When the 
PROSE device detects REQUEST asserted, READY is negated 
and the command is decoded. After the command is decoded, 
READY is asserted by the PROSE device, at which time RE- 
QUEST is negated by the peripheral controller. When REQUEST 
is negated, READY is negated by the PROSE device. 
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Figure 2. General Stato Machine. 
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Afterthe PROSE device decodes the command, the command is 
executed by outputting the proper drive codes and asserting the 
propersignals. Afterthe command has been completed, READY 
is asserted by the PROSE device, informing the peripheral 
controller that the next command may be issued. 

Command Execution 

The PROSE device must be reset after power up before any 
command execution takes place. This is accomplished by 
pulsing the Preset pin low. This places the PROSE device into the 
Initialize state. The first command that is issued by the peripheral 
controller must be a Select Drive command. If the correct drive 
is selected, the PROSE device will enter a Drive Selected state. 
A Select Drive command that selects an incorrect drive or any 
other command will be ignored. Once in the Drive Selected state, 
any command issued will be executed. The drive is unselected 
and returned to the Initialize state by issuing a Select Drive 
command for a different drive. 

The following text describes the execution of each of the QIC-02 
commands (see QIC-02 specification for timing diagrams): 

Beginning of Tape Command 

The Beginning of Tape command code directs the PROSE device 
to output the Rewind drive code. This causes the tape drive to 
rewind the tape. When BOT (Beginning Of Tape) is asserted by 
the tape drive, the Rewind drive code is negated and READY is 
asserted. BOT is asserted when the Beginning of Tape hole is 
detected by the tape drive. 



Erase Command 

The Erase command code directs the PROSE device to output 
the Rewind drive code until BOT is asserted, then assert ERASE 
and INITIALIZE. When ERASE is asserted, the tape drive 
activates the erase head and moves the tape forward until EOT 
(End Of Tape) is asserted by the tape drive. EOTis asserted when 
the End of Tape hole is detected by the tape drive. When EOT is 
asserted, ERASE and INITIALIZE are negated and the Rewind 
drive code is output until BOT is asserted. When BOTis asserted, 
the Rewind drive code is negated and READY is asserted. 

Initialize Command 

The Initialize command code directs the PROSE device to assert 
INITIALIZE and to output the Rewind drive code. When BOT is 
asserted, the Rewind drive code is negated and the Fast Forward 
drive code is output. When EOT is asserted, the Fast Forward 
drive code is negated and the Rewind drive code is output. 
Finally, when BOT is asserted, the Rewind drive code and 
INITIALIZE are negated and READY is asserted. 

Write File IVIark Command 

The Write File Mark command directs the PROSE device to 
output the Write File Mark drive code. When FM is asserted, the 
Write File Mark drive code is negated and READY is asserted. 
FM is asserted after the file mark has been written. 




Figure 3. Peripheral Controller System Including Tape Drive Sequencer. 
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Read Command 

Priorto issuing the Read command, ONLINE must be asserted by 
the peripheral controller. If ONLINE is not asserted when the 
Read command code is decoded, the Invalid Command drive 
code will be output and the PROSE device will wait for the next 
command. When the Read command is issued and ONLINE is 
asserted, the PROSE device outputs the Read drive code. Once 
the Read drive code has been output, the Read command may be 
terminated by either the tape drive or the peripheral controller. 
The tape drive terminates the Read command by asserting FM. 
FM is asserted when a file mark is detected. When this occurs, 
the Read command code is negated and READY is asserted. The 
peripheral controller may terminate the Read command by either 
negating ONLINE or by issuing a Read File Mark command. 
When ONLINE is negated, the Read drive code is negated and 
the Rewind drive code is output until BOT is detected, at which 
time READY is asserted. When the Read File Mark command is 
issued, the Read drive code is negated and the Read File Mark 
command is executed. 

Read File Mark Command 

The Read File Mark command directs the PROSE device to 
output the Read File Mark drive code. The Read File Mark code 
causes the tape drive to move to the next file mark. When FM is 
asserted by the tape drive, the Read File Mark drive code is 
negated and READY is asserted. FM is asserted when af ile mark 
is detected. 

Read Status Command 

The Read Status command directs the PROSE device to transfer 
six bytes of status data to the peripheral controller. After the 
command is decoded, the PROSE enables the first byte by 
outputting the Status Byte 1 drive code and asserts READY. The 
peripheral controller responds by reading the status byte and 
asserting REQUEST When the PROSE device delects RE- 
QUEST asserted, READY is negated and the next status byte is 
enabled. READY is not asserted until after the peripheral 
controller negates REQUEST. The remaining five status bytes 
are transferred in a similar manner. After all six bytes are 
transferred, READY is asserted. 

Write Command 

We will look at the Write command in detail as an example of the 
design process. The Write command allows the peripheral 
controller to write data onto the tape. Data is transferred via the 
same eight-bit bus that provides the command code. 

Priorto issuing the Write command. Online must be asserted by 
the peripheral controller. If Online is not asserted when the Write 
command code is decoded, the sequencer will assert the Invalid 
Command signal and wait for the next command. 

If Online is asserted when the Write command is decoded, the 
sequencer will assert the Write signal for the tape drive. Data is 
sent from the peripheral controller to the tape drive on the eight- 
bit bus, and the data is written onto the tape. 



The Write command may be terminated by either the tape drive 
or the peripheral controller. The tape drive may cause the Write 
command to be terminated if the end of the tape is reached. When 
the early warning hole is detected by the tape drive, it asserts 
EWH. When this happens, the sequencer must negate the Write 
signal and then assert the End of Media signal. It then waits to 
decode the next command. 

The peripheral controller may terminate the Write command 
when all of the data has been written. This may be noted by 
negating the Online signal, or by asserting the Request signal. 
When Online is negated, the sequencer must negate the Write 
signal. It then outputs the Write File Mark signal until the file mark 
is activated by the tape drive. Once the file mark is activated, the 
sequencer must rewind the tape. 

When the Request signal is asserted by the peripheral controller, 
the sequencer negates the Write signal. It then decodes the new 
command, which usually would be a Write File Mark command. 

State Diagram Representation 

All of this complex interaction can be described in a simple state 
diagram. State diagrams are the universal language for describ- 
ing state machines. The diagrams depict the two major opera- 
tions in a state machine: control sequencing and output genera- 
tion. Control sequencing is changing from present state to next 
state. It can be direct, based only upon present state, or 
conditional, based upon present state and input conditions. 
Similarly, output generation can be direct, based upon present 
state only, or conditional, based upon present state and input 
conditions. The former is called a Moore state machine, while the 
latter is a Mealy state machine. 

Transitions for our tape drive sequencer depend on the state and 
conditional inputs, as with the Request and Online inputs during 
execution of the Write command. Outputs, however, will depend 
only on the present state; thus, our sequencer will be a Moore 
state machine. 

The state diagram for a Moore machine is simple to understand. 
Each state is represented by a bubble, and each transition is 
represented by an arrow. Conditions required for transitions, if 
any, are listed next to the arrows. Outputs associated with each 
state are listed with the state name in the bubble. 

Write Command State Diagram 

The state diagram for the Write command is shown below. 
Thirteen states are shown. The diagram is entered from the top, 
from the part of the state diagram that performs command 
decoding. WRCMD is the state entered when the Write command 
is decoded. In this state the sequencer asserts the Ready signal, 
telling the peripheral controllerthat it has decoded the signal. The 
sequencer then waits for the peripheral controller to remove the 
Request signal before continuing to state WRCMDA, or A for 
short. 

In state A the sequencer immediately checks to make sure that 
Online is asserted before beginning the Write sequence. If it is 
not, the Invalid Command signal is output and the sequencer 
enters another part of the state diagram (CMDERRA). 
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If Online is asserted, the sequencer moves to state B, asserting 
the Write signal to the tape drive to allow writing to begin. The 
sequencer moves from states B, E, and F and back to B, until the 
Write process is interrupted. The tape drive responds by moving 
forward and writing the information provided on the eight-bit 
data bus. 



(COMMAND DECODING) 




(DRVSELD) 



(BOTCMDA) 



Figure 5. Write Command State Diagram. 



As described, the Write command can be terminated by either the 
tape drive or the peripheral controller. State B checks for 
assertion of the EWH (Early Warning Hole) signal from the tape 
drive. If asserted, the sequencer immediately negates the Write 
signal in state C, and then asserts the EOM (End of Media) signal 
to the peripheral drive controller in state D. The sequencer then 
goes to the Drive Selected state (DRVSELD) and begins the 
command decoding procedure in another part of the state 
diagram. 

State E checks for assertion of the Request signal, signalling the 
end of the data to be written. In this case, the sequencer would 
immediately negate the Write signal and go to the Drive Selected 
state. 

State F checks for removal of the Online signal. If Online is true, 
the sequencer moves backto state B and continues writing. If not, 
it will move to state G, removing the Write signal, and then state 
H, outputting the Write File Mark signal, it then waits in state H 
until the File Mark signal is activated by the tape drive, indicating 
that the file mark has been written. When File Mark is activated, 
the Write File Mark signal is removed in state I, and the sequencer 
moves to the Beginning of Tape command sequence to rewind 
the tape. 

Device Selection 

So far, the design has been described and defined without regard 
to the implementation methodology. The complete sequencer 
can be implemented in several medium-size discrete logic de- 
vices, or one large custom circuit. 

The best choice is often a programmable logic device. Program- 
mable logic provides the integration of custom logic with the quick 
time-to-market of standard products. Especially important in this 
application is the ability to adapt to design changes. The QIC-02 
specification can be modified to enhance performance or add 
features. Of course, any design errors can also be quickly fixed, 
even after layout of the PC board. 

Programmable logic devices offer the complexity to implement 
large designs in few devices. At the same time, software support 
tools allow very simple design entry, even of complex designs. 
This design will require almost no additional work to describe for 
PLD software. Device programmers provide an instant custom 
device, and can even test the device immediately after 
programming. 

Selection of PROSE Device 

The PROSE device is selected as the heart of this application. 
The primary reason is the number of states required for implem- 
entation of the sequencer. We sawthat the Write command alone 
had thirteen states, although some were transitions to other parts 
of the state diagram. There are eight other commands required, 
totaling over 100 states. Only the PROSE device provides 
enough states in a single programmable chip. 

Another requirement in many applications is high speed. The 
PROSE device offers a maximum frequency of 30 MHz internally. 
At the same time, the PROSE device has enough logic on board 
to require very few surrounding logic chips. 
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PROSE Device Description 

The PROSE device has the architecture shown in Figure 6. It is 
based on a 128x21 PROM array, providing 128 states of 21 bits 
each. The 128 states will include the states WRCMD and its 
following states from A through I. Eight of the twenty-one bits in 
each state are the outputs of the device, providing signals such 
as the Write and Ready signals. 

The other thirteen bits in each state feed back internally for next 
state selection. Six of the feedback signals feed back into a PAL 
array, which is also fed by eight external inputs. These external 
inputs are the condition inputs, such as the Request and Online 
signals, which determine the next state. The feedback signals 
determine which of these eight inputs will determine the next 
branch. 

The outputs of the PAL array are two address lines to the PROM 
array. Before reaching the PROM array, however, these signals 
go through two Exclusive-OR (XOR) gates controlled by feed- 
back signals. These gates are used by the software to optimize 
the fit of the design within the device, and will be described later. 
The other five address lines to the PROM array are fed back 
directly from the previous state. Thus, two of the lines can be 
changed by input conditions, providing four-way branching capa- 
bility from any state. 

Expanding Inputs for the PROSE 
Device 

The PROSE device has a balanced architecture of eight inputs 
and eight outputs. However, we already know that we need more 
than eight inputs in the design. The eight bits of the command 
code from the peripheral controller need to be decoded, along 




Q(7-0) 



Figure 6. PMS14R21 Architecture 



with the Request and Online signals, and the sequencer must 
monitor the tape drive signals, including the Early Warning Hole 
and File Mark signals already mentioned. In addition to these 
signals, the sequencer must monitor the End of Tape and Begin- 
ning of Tape signals from the tape drive, making atotal of fourteen 
inputs. 

A simple solution is encoding the eight different eight-bit com- 
mands into a three-bit command code before reaching the 
PROSE device. Thiscan be easily done in almost any 20-pin PAL 
device. With a PALdevice as the solution, wecan easily add even 
more logic surrounding the sequencer, including drive selection. 
Drive selection logic requires that more commands be added to 
the original eight, forcing the use of four bits for the encoded 
commands. Three of the independent status signals from the 
tape drive can also be combined in the PAL device (Beginning of 
Tape, End of Tape, and File Mark), providing one Status signal. 
(Early Warning Hole remains a separate input.) 

The resulting logic block fits easily into a simple PAL20L8 combi- 
natorial 24-pin PAL device (Figure 7). Several versions of this 
device are available to choose from, including a CMOS device 
offering zero standby power, and high-speed devices at 15 
nanoseconds or less propagation delay. 

The encoding of the command codes, including drive selection, 
is shown in Figure 8. Now we must write the equations for 
PALASM 2 software for implementation in a PAL20L8. Writing the 
equationsfor each output is a simple matter; active-low equations 
are written where each product term is one of the commands in 
which that particular command code bit is zero. For example, the 
equation for CC3 is simply the sum of the products defining the 
Write, Read, Read File Mark, and Write File Mark commands. 
The equation would be: 

/CC3 = D7*/D6* D5* D4* D3* D2* Dl* DO (Write) 

+ /Dl* D6* D5* D4* D3* D2* Dl* DO (Read) 

+ /D7* D6*/D5* D4* D3* D2* Dl* DO (Read 
File Mark) 

+ D7*/D6*/D5* D4* D3* D2* Dl* DO (Write 
File Mark) 

where / means inversion, * means AND, and -i- means OR. The 
equations can be minimized either by observation or by running 
them through the program MINIMIZE in the PALASM 2 software 
suite. The full PALASM 2 file can be simulated for design 
verification, and then processed to create a JEDEC file for 
programming. 



D0-D7- 
DS0-DS1 ■ 

FM- 
BOT- 
EOT- 



PAL20L8 



-> CCO-^CS 



-> STATUS 



Figure 7. PAL Encoding Logic Block. 
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Expanding Outputs for the PROSE 
Device 

The section of the design that we lool<ed at in detail required five 
output signals: Ready, Invalid Command, Write, End of Media, 
and Write File Mark. Other required outputs include Read, Read 
File Mark, Fast Forward, Rewind, Erase, Initialize, and Drive 
Selected. These twelve outputs already extend us beyond the 
eight outputs available in the PROSE device, in addition, six 
Enable Status bits are required, making a total of eighteen 
outputs required. 

Asimple solution can again be provided by a programmable logic 
device, to decode outputs from the PROSE device into the 
required output signals. The requirement for many outputs is met 
by two PAL devices, the PAL6L1 6 and PAL8L1 4. However, even 
faster solutions are provided by two PLE (PROM as Logic 
Element) devices, the PLE5P16 and PLE6P16. These devices 
are specialized PROMs, both with sixteen outputs. The PLE5P1 6 
offers the highest speed at 1 8 ns propagation delay, and provides 
an output enable, which can be useful when testing the tape drive 
independent of the sequencer logic. 



In the PLE5P1 6 we want to decode four outputs from the PROSE 
device into fourteen signals. The four signals to be left as direct 
outputs from the PROSE device can be the signals used more 
often, or those that require higher speed. In this case, Ready, 
Drive Selected, Erase, and Initialize are left direct, while the other 
signals are decoded through the PLE device. The equations for 
the decoding are arbitrary, as long as every signal in asserted by 
its own code. (None of these signals are to be asserted at the 
same time.) The decoding is shown below. 

The generation of the equations is similar to that shown previ- 
ously. For example, 

STSO = /CD3*/CD2*/CD1* CDO 

is the decoding for asserting the Enable Status byte signal. 

However, this time PLEASM software is used for design entry. 
The design method is equivalent to that for PAL devices in 
PALASM software, butthe software creates a programming file in 
the PROM format. 



COMMAND 


QiC-02 COMMAND 


PAL COMMAND CODE 


D7 D6 D5 D4 D3 D2 D1 DO 


CC3 CC2 CC1 ceo 


Invalid Command** 


xxxxxxxx 


1111 


Select Drive 1* 


11111110 


1110 


Select Drive 2* 


1111110 1 


1110 


Select Drive 3* 


111110 11 


1110 


Select Drive 4* 


11110 111 


1110 


Read Status 


111111 


10 11 


Beginning of Tape 


110 11110 


10 10 


Erase Entire Tape 


110 1110 1 


10 1 


Initialize Tape 


110 110 11 


10 


Write 


10 111111 


11 


Read 


1111111 


10 


Read File Mark 


10 11111 


1 


Write File Mark 


10 11111 






If the drive is the incorrect drive number, the four bit command code is 1101 
* Any command code not shown above will result in an illegal command code. 



Figure 8. PAL Command Codes Table. 
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CD3 


CD2 


CD1 


CDO 


SIGNAL ASSERTED 














No Command 











1 


STSO (EnablQ Status byte 0) 








1 





STS1 (Enable Status byte 1 ) 








1 


1 


STS2 (Enable Status byte 2) 





1 








STS3 (Enable Status byte 3) 





1 





1 


STS4 (Enable Status byte 4) 





1 


1 





STS5 (Enable Status byte 5) 





1 


1 


1 


EOM (End of Media) 













RFM (Read File Mark) 










1 


WFM (Write File Mark) 







1 





READ (Read Enable) 


10 11 


WRITE (Write Enable) 




1 








FFWD (Fast Forward) 


110 1 


RWD (Rewind) 




1 


1 





ICMD (Invalid Command) 


1111 


No Command 



Figure 9. PLE Devico Decoding 

PROSE Design Specification 

We are now ready to discuss the details of the design specifica- 
tion for the PROSE device. The software used is a part of the 
PALASM 2 software suite called PROASM for PROSE Assem- 
bler, it requires almost no additional work on the part of the 
designer; it simply requires a description of the state diagram. 
PROASM has the following file format: 

• Declaration Section 

• State Section 

• Conditions Section 

Declaration Section 

The Declaration Section is the location of the general information 
about the design, including the design engineer's name and 
company. It provides for device selection; in this case, we are 
using the PROSE device, which has the part number PMS1 4R21 . 
It also allows the definition of all of the inputs and output signal 
names. Remember, four of the inputs are PAL device outputs 
(CC0-CC3), and four of the outputs feed inputs to the PLE device 
(CD0-CD3). 



QIC-02 COMMAND DECODER 

1 

B 

WON 

MMI 

JANUARY 19, 1987 

QIC_02_CMD_DEC PMS14R21 

12 3 4 

CLK DCLK /ONLINE /REQUEST 

5 6 7 8 9 10 
/ceo /CCl /CC2 /CC3 STATUS EWH 
11 12 

SDI GND 

13 14 15 16 17 18 
PRESET SDO CD3 CD2 CDl CDO 

19 20 21 22 
INITIALIZE ERASE DRSELD READY 
23 24 
MODE VCC 



TITLE 

PATTERN 

REVISION 

AUTHOR KEN 

COMPANY 

DATE 

CHIP 

; INPUT PIN 

; INPUT PIN 
; INPUT PIN 

; OUTPUT PIN 

; OUTPUT PIN 
/OUTPUT PIN 

State Section 



The next part of the design file is the State section. This section 
begins with the keyword STATE. The first item required is 
selection of either a Moore or a Mealy type state machine. As 
discussed previously, a Moore type state machine is required. 

STATE 
MOORE_MACHINE 

Next, several general or default values are specified. First is 
selection of the programmable Preset/Enable function pin. Either 
can be used: in this case. Preset is desired, so the keyword 
MASTER_RESET is entered. 

MASTER_RESET 

Next are global default output values. Global default output 
values are the values to be assigned to the outputs when the 
design specification does not specify a particular value. This is 
not used, so no default output function is entered. However, the 
design file can be simplified by specifying a default value for the 
outputs (either High or Low), and then only specifying the output 
value when it differs from the default value. 

Next are global default transitions or branches. Global default 
branches are the branches to be taken when no branch is 
specified in the design file. This is a method of not only simplifying 
tlie design file, but also guaranteeing that there is atransition from 
every state for every input condition. In this design, the default is 
chosen to be HOLD_STATE; this means that the sequencer will 
remain in the same state if no other transition is enabled. 

DEFAULT_BRANCH HOLD_STATE 

Next is the specification of the state to enter after power-up. The 
PROSE device has power-up preset, whereby all twenty-one flip- 
flops power up into the logic One state. This state is called 
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POWER_UP. The transition from ttiis state (on the first clock 
edge) shows how transition equations are written. The format is 



STATE X 



CONDITION A -> STATE Y 



meaning that when in State X, if condition A is true the machine 
will transition to state Y. In this design, we want to transition to 
state INITon any condition, so the equation is written 

POWER_UP := VCC -> INIT 

where VCC means unconditionally. 

The first transition in the Write command state diagram requires 
two branches. The local default branch is indicated by the + sign, 
which is the "otherwise" operator. Thus, from state WRCMD, 
we have 

WRCMD := REQ -> WRCMD 
+-> WRCMDA 

which means "if REQ (Request) is true, go to state WRCMD (stay 
in the same state), otherwise (if REQ is not true) go to slate 
WRCMDA." 

The remaining transition equations are written similarly. In effect, 
the arrows in the state diagram are directly transcribed into 
arrows in the equations. 

WRCMDA := ONLN -> WRCMDB 
+-> CMDERRA 

WRCMDB := EW -> WRCMDC 
+-> WRCMDE 

WRCMDC : = VCC -> WRCMDD 

WRCMDD := VCC -> DRVSELD 

WRCMDE := REQ -> DRVSELD 
+-> WRCMDF 

WRCMDF := ONLN -> WRCMDB 
+-> WRCMDG 

WRCMDG := VCC -> WRCMDH 

WRCMDH := STS -> WRCMDI 
+-> WRCMDH 

WRCMDI := VCC -> BOTCMDA 

Note that the equation for WRCMDH does not need the second 
line; the global default transition is to stay in the same state, so it 
does not need to be specified here. 

Output Definition 

The next task is to define the eight outputs for each of these 
states. Outputs are defined in the form of an equation. For 
example, 

STATE_X.OUTF := 0UT_1 * /0UT_2 

means that in state X, output 1 will be High and output 2 will be 
Low. If other outputs exist, they will be assigned their default 
values. 



Output equations for the Write command section are written 
according to the state diagram. In all of the Write states. Drive 
Selected (DRVSELD) remains active. Thus, in state WRCMD 
only Drive Selected and the Ready signal are active, telling the 
peripheral controller that the Write command has been decoded 
and is ready to be executed. The resulting output equation for 
state WRCMD is 

WRCMD. OUTF := READY*/CD3*/CD2*/CD1*/CD0 
*/ERASE*/INITIALIZE* DRSELD 

The other states have similar output equations, translating the 
required output signal to the appropriate encoding for the PLE 
device to decode. 

WRCMDA. OUTF := /READY*/CD3*/CD2*/CD1*/CD0 
*/ERASE*/INITIALIZE* DRSELD 

WRCMDB. OUTF := /READY* CD3*/CD2* CDl* CDO 
*/ERASE*/INITIALIZE* DRSELD 

WRCMDC. OUTF := /READY*/CD3*/CD2*/CD1*/CD0 
*/ERASE*/INITIALIZE* DRSELD 

WRCMDD. OUTF := /READY*/CD3* CD2* CDl* CDO 
*/ERASE*/INITIALIZE* DRSELD 

WRCMDE. OUTF := /READY* CD3*/CD2* CDl* CDO 
*/ERASE*/INITIALIZE* DRSELD 

WRCMDF. OUTF := /READY* CD3*/CD2* CDl* CDO 
*/ERASE*/INITIALIZE* DRSELD 

WRCMDG. OUTF := /READY*/CD3*/CD2*/CD1*/CD0 
*/ERASE*/INITIALIZE* DRSELD 

WRCMDH. OUTF := /READY* CD3*/CD2*/CD1* CDO 
*/ERASE*/INITIALIZE* DRSELD 

WRCMDI. OUTF := /BEADY*/CD3*/CD2*/CD1*/CD0 
*/ERASE*/INITIALIZE* DRSELD 

Again, if default output values were used, these equations could 
be extremely simplified. 

Conditions Section 

The final section is the Conditions section, designated by the 
keyword Conditions. This section defines the conditions used in 
the state equations. In this design, the conditions are simply the 
inputs to the PROSE device, and are straightforward: 

CONDITIONS 

ONLN = ONLINE 
STS = STATUS 
EW = EWH 
REQ = REQUEST 

Other conditions used in the design file are also placed in this 
Conditions section. 
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Automatic State Assignment 

The other sections of the design file are completed in a similar 
manner, and the whole design is assembled by PROASM soft- 
ware. A key feature of PROASM software, in addition to design 
verification, is automatic state assignment. State locations in the 
PROM and conditional product terms in the PAL array are used 
to the greatest possible efficiency, to provide the best fit of the 
logic within the device. 

The automatic fitting is easily seen in Figure 10. Each individual 
state can branch to one of four locations: these four locations form 
a "primary location." For example, the hex addresses 1 A, 3A, 5A, 
and 7A form a primary location, which can be designated by the 
smallest address, 1A. 

The software optimizes use of these primary locations to elimi- 
nate excessive duplication of states in different primary locations. 
For example, the state WRCMD is duplicated in primary location 
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1A (hex address 3A) and 19 (hex address 79). This duplication 
is necessary because of the branch from WRCMD to either 
WRCMD itself or WRCMDA (or A in the shortened form). The 
branch cannot be made to the primary location where WRCMD 
already exists, because the other three locations have already 
been used. Thus, WRCMD must be duplicated in anotherprimary 
location which has an open location for state A. 

The software then puts the two-way branch from state A in primary 
location 5. It then makes use of the other half of the primary 
location for the two-way branch from state B. However, when it 
then assigns the two-way branch from state F to states B or G, it 
must duplicate B in primary location 4 since there is no room left 
in its primary location (5) for G. 

Note that the software avoids duplication of state DRVSELD, 
however. The two-way branch from state E to states DRVSELD 
or F is placed in the other half of primary location 4. When 
DRVSELD is branched to again from state D, the same location 
can be used. 

This duplication avoidance is where the XOR gates are used. To 
place a branch in the lower order half of a primary location, the 
branch addresses must be 00 or 01. This is accomplished by 
making the XOR feedback 00 and allowing a PAL product term to 
determine whether the second address bit should be a 1 . To use 
the higherorderhalf of a primary location (branch addresses 10 
or 11), the XOR feedback would be made 10 and again a PAL 
product term would be used to determine the other bit. XOR 
feedback is also used to determine "right or left side" of a primary 
location, or to use a product term that already exists on one side 
of the PAL array. 

The PROASM software provides detailed implementation re- 
sults, showing how the states and product terms were assigned 
for every state and transition. 

PROSIM Simulation 

The entire sequencer can be simulated by PROSIM software, to 
verify the design. This can be done before any parts are 
programmed. Simulation results can be added to the program- 
ming file to provide automatic testing after programming. 

Completed Design 

Thecomplete design required less than a weekto implement from 
scratch. Two days were required to define the application. Only 
one day was required to write the PROSE design specification. 
Simulation and design checking took an additional day. Only half 
a day each were required for the PAL and PLE device designs. 
This included revisions to the PAL and PROSE devices to fix 
minor design errors. The complete design is shown in Figure 11 , 
and the design files are shown on the following pages. 



Figure 10. State Assignments for the Six Primary PROM 

Locations Used for the Write Command Sequence. 

Arrow Indicate Potential State Transitions, to a Set of 

Circled Next States. All Next States Must be In the Same 

Primary Location. 
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Conclusion 

Programmable logic devices were once the glue logic that tied 
together the intelligence of a digital system. Today, advanced 
chips such as the PROSE device are capable of implementing 
distributed control functions and make up the heart of the control 
of the system. With complementary advanced design tools 
available with PALASI^ 2 software, PLDs are an effective ap- 
proach to even the most complex tasks. 



The PROSE device has been designed to implement a complex 
tape drive sequencer. Design implementation in the PROSE 
device is made easy for several reasons, including: 

1. Design file requires merely transcription of the state diagram 

2. Software automatically optimizes fit 

3. Programmability of the device allows design errors to be fixed 
quickly 

As a result, the complete tape drive sequencer required only four 
days to design and implement. 
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THIS PAL DEVICE IS PART OF THE QIC-02 COMMAND SEQUENCER 
DESIGN. THE PRIMARY PURPOSE OF THIS PAL DEVICE IS TO ENCODE 
8 BIT COMMANDS INTO 4 BIT COMMAND CODES. IT IS ALSO USED TO 
ENCODE TAPE DRIVE STATUS SIGNALS AND SELECT THE DRIVE NUMBER. 



TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



QIC-02 COMMAND DECODER PAL 



KEN WON 

MONOLITHIC MEMORIES 

JANUARY 26, 1987 



CHIP QIC_0 2_CMD_PAL PAL20L8 

D7 D6 D5 D4 03 D2 Dl DO DSO DSl FM GND 

BOT EOT NC /DSELD /CC2A /STATUS CC3 CC2 CCl CCO NC VCC 



EQUATIONS 

STATUS = /EOT*/BOT*/FM 

/CCO = D7*D6*/D5*D4*D3*D2*D1*/D0 
+ D7*D6*/D5*D4*D3*/D2*D1*D0 
+ D7*/D6*/D5*D4*D3*D2*D1*D0 
+ /D7*D6*D5*D4*D3*D2*D1*D0 
+ D7*D6*D5*D4*DSELD 



;ASSERT STATUS TO PMS14R21 

BOT COMMAND 

INITIALIZE 

WRITE FILE MARK 

READ 

SELECT DRIVE 



/CCl = D7*D6*/D5*D4*D3*D2*/D1*D0 
+ D7*D6*/D5*D4*D3*/D2*D1*D0 
+ D7*/D6*/D5*D4*D3*D2*D1*D0 
+ /D7*D6*/D5*D4*D3*D2*D1*D0 
+ D7*D6*D5*D4*/DSELD 

/CC2 = D7*D6*/D5*D4*D3*D2*D1*/D0 
+ D7*D6*/D5*D4*D3*D2*/D1*D0 
+ D7*D6*/D5*D4*D3*/D2*D1*D0 
+ D7*/D6*D5*D4*D3*D2*D1*D0 
+ D7*/D6*/D5*D4*D3*D2*D1*D0 
+ /D7*D6*/D5*D4*D3*D2*D1*D0 
+ /D7*D6*D5*D4*D3*D2*D1*D0 
+ /D7*/D6*D5*D4*D3*D2*D1*D0 



/CC3 = D7*/D6*D5*D4*D3*D2*D1*D0 
+ D7*/D6*/D5*D4*D3*D2*D1*D0 
+ /D7*D6*D5*D4*D3*D2*D1*D0 
+ /D7*D6*/D5*D4*D3*D2*D1*D0 

DSELD = /D0*/DS1*/DS0*D1*D2*D3 
+ /D1*/DS1*DS0*D0*D2*D3 
+ /D2*DS1*/DS0*D0*D1*D3 
+ /D3*DS1*DS0*D0*D1*D2 



ERASE 

INITIALIZE 
WRITE FILE MARK 
READ FILE MARK 
UNSELECT DRIVE 

BOT COMMAND 

ERASE 

INITIALIZE 

WRITE 

WRITE FILE MARK 

READ FILE MARK 

READ 

READ STATUS 

(MINIMIZATION REDUCES 

PRODUCT TERMS) 

WRITE 

WRITE FILE MARK 

READ 

READ FILE MARK 

DRIVE 1 SELECT 

DRIVE 2 SELECT 

DRIVE 3 SELECT 

DRIVE 4 SELECT 



SIMULATION 

; THIS SIMULATION FILE TESTS THE OUTPUT FOR ALL VALID AND ONE INVALID 
; COMMAND INPUT 



TRACE_ON D7 D6 D5 D4 D3 D2 Dl DO 
SETF D7 D6 /D5 D4 D3 D2 Dl /DO 
SETF D7 D6 /D5 D4 D3 D2 /Dl DO 
SETF D7 D6 /D5 D4 D3 /D2 Dl DO 
SETF D7 /D6 D5 D4 D3 D2 Dl DO 
SETF D7 /D6 /D5 D4 D3 D2 Dl DO 
SETF /D7 D6 D5 D4 D3 D2 Dl DO 
SETF /D7 D6 /D5 D4 D3 D2 Dl DO 
SETF /D7 /D6 D5 D4 D3 D2 Dl DO 
SETF /D7 D6 D5 D4 /D3 /D2 Dl DO 
SETF EOT 
SETF /EOT 
SETF BOT 
SETF /BOT 
SETF FM 
SETF /FM 
SETF DSl /DSO 

SETF D7 D6 D5 D4 D3 /D2 Dl DO 
SETF 07 06 05 04 03 02 /Dl DO 
TRACE OFF 



FM EOT BOT CCO CCl CC2 CC3 /STATUS /DSELD 
: BEGINNING OF TAPE 
; ERASE 
INITIALIZE 
;WRITE 

;WRITE FILE MARK 
iREAD 

iREAD FILE MARK 
;READ STATUS 
: INVALID COMMAND 
; ASSERT EOT 
I NEGATE EOT 
; ASSERT BOT 
[NEGATE BOT 
; ASSERT FM 
; NEGATE FM 

I SET DRIVE NUMBER TO 3 
; SELECT DRIVE 3 
; UNSELECT DRIVE 3 
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THIS IS A SECOND REVISION OF AN APPLICATION FOR THE PROSE DEVICE. 
THE ORIGINAL APPLICATION IS CALLED 14R21.APP. THIS APPLICATION 
IMPLEMENTS A QIC-02 COMMAND DECODER USING A PROSE DEVICE, A PAL 
DEVICE, AND A PLE DEVICE. THE PROSE DEVICE FILE IS CALLED 
CMD PAL.APP AND THE PLE DEVICE FILE IS CALLED CMD PLE.APP. 



TITLE QIC-02 COMMAND DECODER 

PATTERN 

REVISION B 

AUTHOR KEN WON 

COMPANY MMI 

DATE JANUARY 19, 1987 



CHIP 



QIC02 CMD DEC PMS14R21 



CLK DCLK /ONLINE /REQUEST /CCO /CCl /CC2 /CC3 STATUS EWH SDI GND 

PRESET SDO CD3 CD2 CDl CDO INITIALIZE ERASE DRSELD READY MODE VCC 

STATE 

MOORE_MACHINE 

MASTER_RESET 

DEFAULT_BRANCH HOLD_STATE 

POWER_UP :=VCC->INIT 

;@@@e@@ EQUATIONS eeeeee 



.A***** DRIVE SELECTION ****** 



INIT 


:= REQ 


-> 


DRVSELDA 






+-> 


INIT 


DRVSELDA 


:= CONDO 


-> 


DRVSELDB 






+-> 


INIT 


DRVSELDB 


:= CONDI 


-> 


INIT 






+-> 


DRVSELDC 


DRVSELDC 


:= C0ND2 


-> 


INIT 






+-> 


DRVSELDD 


DRVSELDD 


:= C0ND3 


-> 


INIT 






+-> 


DRVSELDE 


DRVSELDE 


:= REQ 


-> 


DRVSELDE 






+-> 


DRVSELDF 


DRVSELDF 


:= VCC 


-> 


DRVSELD 



[IF /REQUEST IS TRUE, CHECK 
; FOR DRIVE SELECTION 
[OTHERWISE GOTO INIT 



[DRIVE IS SELECTED 
[WAIT FOR REQ DEACTIVATED 
[THEN GOTO DRVSELDF 
[DRIVE SELECTION COMPLETE 

INIT.OUTF:= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 

DRVSELDA. OUTF 

DRVSELDB. OUTF 

DRVSELDC. OUTF 

DRVSELDD. OUTF 

DRVSELDE. OUTF 

DRVSELDF. OUTF 



= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 



DRVSELD. OUTF := /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 



DRVSELD 



RDWRCMD 



RDWRCMDA 



FMCMD 



CMDDECB 



CMDDECC 



CMDDECD 



CMDDECE 



COMMAND DECODING 
:= REQ 



C0ND3 



:= CONDO 



CONDO 



:= C0ND2 



CONDO 



AAA 


AA 






-> 


CMDDECA 




+ -> 


DRVSELD 




-> 


RDWRCMD 




+ -> 


CMDDECB 




-> 


RDWRCMDA 




+ -> 


CMDERR 




-> 


FMCMD 




+ -> 


RDWRCMDB 




-> 


RDCMD 




+ -> 


WRCMD 




-> 


WFMCMD 




+ -> 


RFMCMD 




-> 


CMDDECC 




+ -> 


DESELDA 




-> 


CMDDECD 




+ -> 


CMDDECE 




-> 


INITCMD 




+ -> 


ERASECMD 




-> 


BOTCMD 




+ -> 


RDSTSCMD 



IF /REQUEST IS TRUE, 

DECODE COMMAND 
OTHERWISE GOTO DRVSELD 
IF CC3 IS TRUE, GOTO 

READ/WRITE DECODING 
OTHERWISE CONTINUE 

COMMAND DECODING 
IF CC2 IS TRUE, GOTO 

READ/WRITE DECODING 
OTHERWISE GOTO CMDERR 
IF CCl IS TRUE, COMMAND 

IS A FILE MARK COMMAND 
OTHERWISE READ OR WRITE 
IF CCO TRUE, COMMAND READ 
OTHERWISE COMMAND IS WRITE 
IF CCO IS TRUE, COMMAND IS 

WRITE FILE MARK 
OTHERWISE COMMAND IS READ 

FILE MARK 
IF CC2 IS TRUE, CONTINUE 

COMMAND DECODING 
OTHERWISE CHECK FOR 

DESELECTING DRIVE 
CONTINUE DECODING COMMAND 
CONTINUE DECODING COMMAND 
IF CCO TRUE, COMMAND INIT 
OTHERWISE COMMAND IS ERASE 
IF CCO TRUE, COMMAND BOT 
OTHERWISE COMMAND IS READ 

STATUS 
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CMDDECA.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
CMDDECB.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
CMDDECC.OUTF:= READY */CD3*/CD2*/CDl*/CD0*/ERASE*/INITIALIZE*DRSELD 
CMDDECD.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
CMDDECE.OUTF:= READy*/CD3*/CD2*/CDl*/CD0*/ERASE*/INITIALIZE*DRSELD 
RDWRCMD.OUTF:= READY */CD3*/CD2*/CDl*/CD0*/ERASE*/INITIALIZE*DRSELD 
RDWRCMDA.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RDWRCMDB.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
FMCMD.OUTF:= READy*/CD3*/CD2*/CDl*/CD0*/ERASE*/INITIALIZE*DRSELD 



;****** DESELECT DRIVE ****** 



DESELDA 


: = 


CONDI 


-> 


DESELDB 


DESELDB 


: = 


CONDO 


+-> 

-> 


DESELDC 
CMDERR 


DESELDC 


: = 


CONDO 


+ -> 
-> 


DESELDD 
DRVSELDE 


DESELDD 
DESELDE 


: = 


REQ 
VCC 


+ -> 
-> 

+-> 
-> 


CMDERR 
DESELDD 
DESELDE 
INIT 



CONTINUE CHECKING FOR 

DESELECT COMMAND 
OTHERWISE GOTO DESELDA 
IF ceo IS TRUE, COMMAND 

IS INVALID 
OTHERWISE DESELECT DRIVE 
IF ceo IS TRUE, GOTO DRIVE 

SELECTION 
OTHERWISE COMMAND INVALID 
WAIT FOR REQ DEACTIVATED 
THEN GOTO DESELDE 
GOTO DRIVE SELECTION 



DESELDA. OUTF 
DESELDB. OUTF 
DESELDC. OUTF 
DESELDD. OUTF 
DESELDE. OUTF 



= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 



.AA**** INVALID COMMAND ****** 
CMDERR := REQ -> CMDERR 

+-> CMDERRA 



CMDERRA 
CMDERRB 
CMDERRC 



VCC 


-> CMDERRB 


VCC 


-> CMDERRC 


VCC 


-> DRVSELD 



WAIT FOR REQ DEACTIVATED 
THEN OUTPUT INVALID 

COMMAND CODE 
INVALID COMMAND CODE 
NEGATE CODE 
GOTO COMMAND DECODING 



CMDERR. OUTF: 
CMDERRA. OUTF 
CMDERRB. OUTF 
CMDERRC. OUTF 



/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*CD3*CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*CD3*CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 



.A AAA A A BOT COMMAND ****** 
BOTCMD := REQ 



BOTCMDA 
BOTCMP 



:= STS 
:= VCC 



-> BOTCMD ;WAIT FOR REQ DEACTIVATED 
^-> BOTCMDA ;THEN REWIND COMMAND CODE 
-> BOTCMP ;WHEN STS IS TRUE, NEGATE 

; OUTPUT CODE 
-> DRVSELD ;GOTO DRVSELD 



BOTCMD. OUTF := /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
BOTCMDA. OUTF := READY*CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
BOTCMP. OUTF: = READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 



jAAAAAA INITIALIZE COMMAND ****** 



INITCMD 


• ~ 


REQ 


-> 

+ -> 


INITCMD 
INITCMDA 


INITCMDA 


: = 


STS 


-> 


INITCMDB 


INITCMDB 


; = 


VCC 


-> 


INITCMDC 


INITCMDC 


: = 


STS 


-> 


INITCMDD 


INITCMDD 


: = 


VCC 


-> 


INITCMDE 


INITCMDE 


: = 


STS 


-> 


INITCMP 


INITCMP 


: = 


VCC 


-> 


DRVSELD 



WAIT FOR REQ DEACTIVATED 
THEN OUTPUT REWIND COMMAND 

CODE & ASSERT INITIALIZE 
WHEN STS IS TRUE, NEGATE 

COMMAND CODE 
ASSERT FFWD COMMAND CODE 
WHEN STS IS TRUE, NEGATE 

COMMAND CODE 
ASSERT REWIND COMMAND CODE 
WHEN STS IS TRUE, NEGATE 

COMMAND CODE & INITIALIZE 



INITCMD. OUTF:= 
INITCMDA. OUTF 
INITCMDB. OUTF 
INITCMDC. OUTF 
INITCMDD. OUTF 
INITCMDE. OUTF 
INITCMP. OUTF:= 



/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*CD3*CD2*/CD1*CD0*/ERASE*INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*INITIALIZE*DRSELD 
= READY*CD3*CD2*/CD1*/CD0*/ERASE*INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*INITIALIZE*DRSELD 
= READY*CD3*CD2*/CD1*CD0*/ERASE*INITIALIZE*DRSELD 

READY*/CD3*/CD2*/CDI*/CD0*/ERASE*/INITIALIZE*DRSELD 
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.****** ERASE COMMAND ****** 



ERASECMD.OUTF:= 
ERASECMDA.OUTF: 
ERASECMDB.OUTF: 
ERASECMDC.OUTF: 
ERASECMDD.OUTF: 
ERASECMDE.OUTF: 
ERASECMDF.OUTF: 



WAIT FOR /REQ DEACTIVATED 
THEN REWIND COMMAND CODE 
WHEN STS IS TRUE, NEGATE 

COMMAND CODE 
ASSERT ERASE 6 INITIALIZE 
WHEN STS IS TRUE, NEGATE 

ERASE AND INITIALIZE 
OUTPUT REWIND COMMAND CODE 
WHEN STS IS TRUE, NEGATE 

COMMAND CODE 
GOTO DRVSELD 



/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READy*CD3*CD2*/CDl*CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*ERASE*INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 



ERASECMD 
ERASECMDA 


:= REQ 
:= STS 


-> ERASECMD 

+-> ERASECMDA 

-> ERASECMDB 


ERASECMDB 
ERASECMDC 


:= VCC 
:= STS 


-> ERASECMDC 
-> ERASECMDD 


ERASECMDD 
ERASECMDE 


:= VCC 
:= STS 


-> ERASECMDE 
-> ERASECMDF 


ERASECMDF 


:= VCC 


-> DRVSELD 



.****** READ STATUS COMMAND ****** 



RDSTSCMD 

RDSTSAA 

RDSTSA 

RDSTSB 



RDSTSD 
RDSTSE 



RDSTSF 



RDSTSG 
RDSTSH 



RDSTSJ 
RDSTSK 



RDSTSL 



RDSTSM 
RDSTSN 



RDSTSO 



RDSTSP 
RDSTSQ 



RDSTSCMD. OUTF 
RDSTSAA. OUTF: 
RDSTSA. OUTF 
RDSTSB. OUTF 
RDSTSC.OUTF 
RDSTSD. OUTF 
RDSTSE. OUTF 
RDSTSF. OUTF 
RDSTSG. OUTF 
RDSTSH. OUTF 
RDSTSI.OUTF 
RDSTSJ. OUTF 
RDSTSK. OUTF 
RDSTSL. OUTF 
RDSTSM. OUTF 
RDSTSN. OUTF 
RDSTSO. OUTF 
RDSTSP. OUTF 
RDSTSQ. OUTF 
RDSTSR.OUTF 



REQ 

:= VCC 
:= VCC 
:= REQ 

:= REQ 



VCC 
REQ 



REQ 



VCC 
REQ 



REQ 



VCC 
REQ 



= REQ 



VCC 
REQ 



REQ 



VCC 
REQ 



= REQ 



-> 


RDSTSCMD 


+-> 


RDSTSAA 


-> 


RDSTSA 


-> 


RDSTSB 


-> 


RDSTSC 


-> 


RDSTSC 


+-> 


RDSTSD 


-> 


RDSTSE 


-> 


RDSTSF 


-> 


RDSTSF 


+-> 


RDSTSG 


-> 


RDSTSH 


-> 


RDSTSI 


-> 


RDSTSI 


+-> 


RDSTSJ 


-> 


RDSTSK 


-> 


RDSTSL 


-> 


RDSTSL 


+-> 


RDSTSM 


-> 


RDSTSN 


-> 


RDSTSO 


-> 


RDSTSO 


+-> 


RDSTSP 


-> 


RDSTSQ 


-> 


RDSTSR 


-> 


RDSTSR 


+-> 


DRVSELD 



WAIT FOR REQ DEACTIVATED 

THEN NEGATE READY 

ENABLE STATUS BYTE 

ASSERT READY 

WHEN REQUEST IS ACTIVATED, 

NEGATE READY 
WAIT FOR REQUEST TO BE 

DEACTIVATED 
THEN ENABLE STATUS BYTE 1 
ASSERT READY 
WHEN REQUEST IS ACTIVATED, 

NEGATE READY 
WAIT FOR REQ DEACTIVATED 
THEN ENABLE STATUS BYTE 2 
ASSERT READY 
WHEN REQUEST IS ACTIVATED, 

NEGATE READY 
WAIT FOR REQ DEACTIVATED 
THEN ENABLE STATUS BYTE 3 
ASSERT READY 
WHEN REQUEST IS ACTIVATED, 

NEGATE READY 
WAIT FOR REQ DEACTIVATED 
THEN ENABLE STATUS BYTE 4 
ASSERT READY 
WHEN REQUEST IS ACTIVATED, 

NEGATE READY 
WAIT FOR REQ DEACTIVATED 
THEN ENABLE STATUS BYTE 5 
ASSERT READY 
WHEN REQUEST ACTIVATED, 
NEGATE READY & COMMAND 
WAIT FOR REQ DEACTIVATED 
THEN GOTO DRVSELD 



! = /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
READV*/CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
/READY*/CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
/READY*/CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
/READY*/CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
/READY*/CD3*CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
/READY*/CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
/READY*/CD3*CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
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.****** WRITE FILE MARK COMMAND ****** 
WFMCMD := REQ -> WFMCMD 



WFMCMDA 
WFMCMDB 

WFMCMDC 

WFMCMD. OUTF: 
WFMCMDA. OUTF 
WFMCMDB. OUTF 
WFMCMDC. OUTF 



:= VCC 

1= STS 



VCC 



+-> WFMCMDA 
-> WFMCMDB 
-> WFMCMDC 

-> DRVSELD 



WHEN REQUEST DEACTIVATED, 

NEGATE READY & COMMAND 
OTHERWISE GOTO WFMCMDA 
OUTPUT WFM COMMAND CODE 
WHEN STS IS ACTIVATED, 

NEGATE COMMAND CODE 
GOTO DRVSELD 



/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 



.****** READ FILE MARK COMMAND ****** 
RFMCMD := REQ -> RFMCMD 



RFMCMDA 
RFMCMDB 



RFMCMDC 



VCC 
STS 



VCC 



h-> RFMCMDA 
-> RFMCMDB 
-> RFMCMDC 

-> DRVSELD 



WHEN REQUEST DEACTIVATED, 

NEGATE READY & COMMAND 
OTHERWISE GOTO RFMCMDA 
OUTPUT RFM COMMAND CODE 
WHEN STS IS TRUE, NEGATE 

COMMAND CODE 
GOTO DRVSELD 



RFMCMD. OUTF := /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RFMCMDA, OUTF := READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RFMCMDB. OUTF := READY*CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RFMCMDC. OUTF := READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 



.****** READ COMMAND ****** 
RDCMD := REQ 

:= ONLN 



!= STS 



RDCMDB 




RDCMDD 




RDCMDF 




RDCMDG 




RDCMDE 




RDCMDC 




RDCMD. OUTF: 


RDCMDA 


OUTF 


RDCMDB 


OUTF 


RDCMDC 


OUTF 


RDCMDD 


OUTF 


RDCMDE 


OUTF 


RDCMDF 


OUTF 


RDCMDG 


OUTF 



= VCC 
= STS 
= REQ 



VCC 



-> RDCMD 

+-> RDCMDA 
-> RDCMDB 

+-> CMDERRA 

-> RDCMDC 

+-> RDCMDD 
-> RDCMDE 

+-> RDCMDF 

-> RDCMDG 
-> RDCMDC 
-> DRVSELD 

+-> RDCMDB 
-> DRVSELD 



WHEN REQUEST DEACTIVATED, 

NEGATE READY 
OTHERWISE GOTO RDCMD 
IF ONLINEB IS TRUE, OUTPUT 

READ COMMAND CODE 
IF ONLINEB FALSE, OUTPUT 

INVALID COMMAND CODE 
IF FILE MARK IS FOUND, 

NEGATE COMMAND CODE 
OTHERWISE GOTO RDCMDD 
IF ONLINE IS TRUE, GOTO 

RDCMDE 
OTHERWISE NEGATE COMMAND 

CODE 
OUTPUT REWIND COMMAND CODE 
NEGATE COMMAND COPF. 
IF REQUEST IS TRUE, GOTO 

DRVSELD 
OTHERWISE GOTO RDCMDB 
GOTO DRVSELD 



= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
= /READY*CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
;= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
;= /READY*CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
;= /READY*CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
;= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
;= /READY*CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 



.****** WRITE COMMAND ****** 



WRCMD 



WRCMDC 



WRCMDD 
WRCMDE 



REQ 



:= EW 



:= VCC 



VCC 
REQ 



+-> WRCMDA 
-> WRCMDB 



+-> CMDERRA 
-> WRCMDC 



+-> WRCMDE 
-> WRCMDD 



-> DRVSELD 
-> DRVSELD 



+-> WRCMDF 
-> WRCMDB 



WHEN REQUEST DEACTIVATED, 

NEGATE READY 
OTHERWISE GOTO WRCMD 
IF ONLINE IS TRUE, OUTPUT 

WRITE COMMAND CODE 
IF ONLINE FALSE, OUTPUT 

INVALID COMMAND CODE 
IF EWH IS DETECTED, 

NEGATE COMMAND CODE 
OTHERWISE GOTO WRCMDE 
OUTPUT END OF MEDIA 

COMMAND CODE 
GOTO DRVSELD 
IF REQUEST IS TRUE, GOTO 

DRVSELD 
OTHERWISE GOTO WRCMDF 
IF ONLINEB IS TRUE, GOTO 

WRCMDB 
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WRCMDG 
WRCMDH 



WRCMDI 





+ -> 


WRCMDG 


NEGATE COMMAND CODE 


vcc 


-> 


WRCMDH 


OUTPUT WFM COMMAND CODE 


STS 


-> 


WRCMDI 


IF FM IS ACTIVATED, GOTO 
WRCMDI 




+ -> 


WRCMDH 


OTHERWISE GOTO WRCMDH 


VCC 


-> 


BOTCMDA 


REWIND TAPE 



WRCMD.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDA.OUTF:= /READV*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDB.OUTF:= /READY*CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDC.OUTF:= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDD.OUTF:= /READY*/CD3*CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDE.OUTF:= /READY*CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDF.OUTF:= /READY*CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDG. OUTF:= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDH. OUTF:= /READY*CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

WRCMDI. OUTF:= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 



;@@@@@@ CONDITIONS @@@@@@ 



CONDITIONS 


CONDO 


= CCO 


CONDI 


= CCl 


C0ND2 


= CC2 


C0ND3 


= CC3 


ONLN 


= ONLINE 


STS 


= STATUS 


EW 


= EWH 


REQ 


= REQUEST 



;@@@@@@ SIMULATION INPUT @@@@@@ 

SIMULATION 

TRACE_ON /CCO /CCl /CC2 /CC3 /REQUEST /ONLINE CD3 CD2 CDl CDO STATUS ERASE 

INITIALIZE READY DRSELD 
SETF /CC3 /CC2 /CCl /CCO /REQUEST /ONLINE /STATUS /EWH /PRESET /CLK 
SETF PRESET ; RESET DEVICE 

CLOCKF CLK ; SELECT DRIVE 

SETF /CC3 /CC2 /CCl CCO 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 

SETF /CC3 /CC2 /CCl /CCO ; INVALID COMMAND 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 

SETF /CC3 CC2 /CCl CCO ; BOT COMMAND 

SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 

SETF STATUS ; ASSERT STATUS, BOT 

CLOCKF CLK 
SETF /STATUS 
CLOCKF CLK 
CLOCKF CLK 
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•jtrr /ccj CC2 cci /ceo 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

SETF /CC3 CC2 CCI CCO 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

SETF /CC3 CC2 /CCI /CCO 

SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 



; ERASE COMMAND 



ASSERT STATUS, BOT 



; ASSERT STATUS, EOT 



; ASSERT STATUS, BOT 



: INITIALIZE COMMAND 



; ASSERT STATUS, BOT 



; ASSERT STATUS, EOT 



; ASSERT STATUS, BOT 



;READ STATUS 
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SETF /REQUEST ;READ STATUS BYTE 1 

CLOCKF CLK 
CLOCKF CLK 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 1 

CLOCKF CLK 
CLOCKF CLK 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 2 

CLOCKF CLK 
CLOCKF CLK 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 3 

CLOCKF CLK 
CLOCKF CLK 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 4 

CLOCKF CLK 
CLOCKF CLK 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 5 

CLOCKF CLK 
CLOCKF CLK 

SETF CC3 CC2 /CCl CCO ;READ COMMAND 

SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 

SETF /ONLINE ; NEGATE ONLINEB 

CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF STATUS 
CLOCKF CLK 
SETF /STATUS 
CLOCKF CLK 
CLOCKF CLK 

SETF CCS CC2 /CCl CCO ;READ COMMAND 

SETF ONLINE 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
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SETF STATUS ;FILE MARK FOUND 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF CC3 CC2 /CCl CCO ;READ COMMAND 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF CC3 CC2 CCl /CCO ;READ FILE MARK 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF CC3 CC2 /CCl CCO ;READ COMMAND 

SETF /ONLINE 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF CCS CC2 /CCl /CCO ; WRITE COMMAND 

SETF ONLINE 

CLOCKF CLK 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /ONLINE ; NEGATE ONLINEB 

CLOCKF CLK 

CLOCKF CLK 
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CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS ; ASSERT STATUS, BOT 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

TRACE OFF 
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PLE5P16 

12345 

COMMAND CODE DECODER 

MMI, SANTA CLARA, CALIFORNIA 



PLE DESIGN SPECIFICATION 
KEN WON 1/27/87 



STSO 

STSl 

STS2 

STS3 

STS4 

STS5 

EOM 

RFM 

WFM 

READ 

WRITE 

FFWD 

RWD 

ICMD 

FUNCTION TABLE 



= /CD3*/CD2*/CD1*CD0 
= /CD3*/CD2*CD1*/CD0 
= /CD3*/CD2*CD1*CD0 
= /CD3*CD2*/CD1*/CD0 
= /CD3*CD2*/CD1*CD0 
= /CD3*CD2*CD1*/CD0 
= /CD3*CD2*CD1*CD0 
= CD3*/CD2*/CD1*/CD0 
= CD3*/CD2*/CD1*CD0 
= CD3*/CD2*CD1*/CD0 
= CD3*/CD2*CD1*CD0 
= CD3*CD2*/CD1*/CD0 
= CD3*CD2*/CD1*CD0 
= CD3*CD2*CD1*/CD0 



ENABLE STATUS BYTE 

ENABLE STATUS BYTE 1 

ENABLE STATUS BYTE 2 

ENABLE STATUS BYTE 3 

ENABLE STATUS BYTE 4 

ENABLE STATUS BYTE 5 

END OF MEDIA 

READ FILE MARK 

WRITE FILE MARK 

READ 

WRITE 

FAST FORWARD 

REWIND 

INVALID COMMAND 



NC CD3 CD2 CDl CDO EOM ICMD RWD FFWD WRITE READ WFM RFM 
STS5 STS4 STS3 STS2 STSl STSO 



, INPUT / OUTPUT. 



LLLLL 
LLLLH 
LLLHL 
LLLHH 
LLHLL 
LLHLH 
LLHHL 
LLHHH 
LHLLL 
LHLLH 
LHLHL 
LHLHH 
LHHLL 
LHHLH 
LHHHL 



NO COMMAND 

ENABLE STATUS BYTE 

ENABLE STATUS BYTE 1 

ENABLE STATUS BYTE 2 

ENABLE STATUS BYTE 3 

ENABLE STATUS BYTE 4 

ENABLE STATUS BYTE 5 

END OF MEDIA 

READ FILE MARK 

WRITE FILE MARK 

READ 

WRITE 

FAST FORWARD 

REWIND 

INVALID COMMAND 



DESCRIPTION 

THIS PLE DEVICE IS ONE OF THREE DEVICES USED TO IMPLEMENT A QIC-02 
COMMAND SEQUENCER. THIS DEVICE DECODES THE OUTPUT CODE FROM THE 
PROSE DEVICE TO GENERATE CONTROL SIGNALS. THE OTHER TWO DEVICES ARE 
A 20L8 PAL DEVICE AND A 14R21 PROSE DEVICE. 
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One of the more common logic functions performed on serial 
data is the data encode/decode function. Usually it is desir- 
able to map (encode) the logical bit stream to a physical bit 
stream, adjusting for the peculiarities of the particular trans- 
mission or storage media. 

Noise, bandwidth, and reliability considerations may mean 
that a different data format would be desirable when data is 
sent along to or stored on a given media. For example, group- 
coded recording (GCR) formats take a given number of data 
bits and encode them with a larger number of bits. A 4B-5B 



GCR code would take 4 data bits and encode them into 16 
states with 5 new bits. A particular 4B-5B code is shown in 
Figure 1. 

This mapping allows at most two zeros to occur in succes- 
sion. Also note that data combinations with more than one 
zero at the beginning and end of the word are excluded. This 
is necessary to Insure that when data words are serialized, no 
more than two zeros occur in succession at any point in the 
bit stream. Finally, the data combination 11111 is reserved as 
a synchronization mark. In tape systems, this results in in- 
creased bit density and eases clock syncfironization. 



4B-5B Code 



4-Blt Data 


1 
10 
11 
10 
10 1 
110 
111 
10 
10 1 
10 10 
10 11 
110 
110 1 
1110 

1111 



5-Bit Data 

110 1 
110 11 
10 10 
10 11 
1110 1 
10 10 1 
10 110 
10 111 
110 10 
10 1 
10 10 
10 11 
11110 
110 1 
1110 
1111 



Figure 1. 4B-5B Code 
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The system diagram in Figure 2 shows how the GCR Encod- 
er/Decoder (GCR E/D) interfaces to a tape drive and tape con- 
troller. Parallel input data is given to the GCR E/D, converted to 
the 5-bit format, serialized, and written to the tape. On a read, 
the serial data from the tape is parallelized, converted back to 
the 4-bit format and output to the output data bus. Additionally, 
during a read, two status signals are developed. The first signal, 
INV, indicates the presence of an invalid input J. e., too many 
zeros in succession. The second status signal, H, indicates the 
detection of the synchronization mark (11111). 

The operation modes for the GCR E/D are shown in the Data- 
Flow Diagrams of Figure 3. The control signal and operation 
functions are indicated for each operation mode. In particular, the 
data flow between each bit of the output register is indicated 
schematically. 

The first mo de of op eration of the GCR E/D is the HOLD 
mode. When ENABLE is HIGH, all data operations on the out- 
put register are disabled, independent of the two mode con- 
trols, Ml and Mq. The output data is simply fed-back to the 
register inputs. Thus the register content is retained afterthe 
clock transition. 



When the ENABLE input is LOW, the operations indicated by 
the Mi and Mq mode bits areexecuted on the clocktransition. 
When Ml and Mo are both LOW, the SERIAL SHIFT IN mode is 
selected. In this mode the output register is configured as a 
serial shift register. The serial input is consecutively shifted 
into the register until all 5 bits from the tape have been stored, 
MSB at Y3 and LSB at SERIAL OUT 



The CONVE RT SERIA L INPUT AND LOAD operation is se- 
lected when ENABLE is LOW, M, is HIGH and Mq is LOW. 
After the 5 bits of data have been serialized by the SERIAL 
SHIFT IN instruction, the 5B code must be converted to a 4B 
code. This is accomplished by taking the outputs of the 5 
register bits and converting them to 4 bits with combinatorial 
logic. On the clock transition, the result is loaded into the Y 
register. On the same clock transition that loads the con- 
verted data into the Y register, the serial input is loaded into 
the serial output register. Because the serial data is being 
read continuously, one data bit per clock transition, the con- 
version must be done without missing a serial data bit. 

The CONVER T PARALL EL INPUT AND LOAD operation is 
selected when ENABLE is LOW, M^ is HIGH and Mq is HIGH. 
This mode takes the 4 input data bits and converts them to 
the 5 bit representation. The result is loaded into the output 
register on the clock transition. The LSB of the 5B representa- 
tion is loaded into the Y3 bit of the output register and the 
MSB is loaded into the serial output bit. This configuration, in 
conjunction with the next Instruction, allows the serial data 
to be written to the tape drive one bit per clock transition. 

The fina l operation, SERIAL SHIPT OUT, is selected when 
ENABLE is LOW, Mi is LOW and Mq is HIGH. After the CON- 
VERT PARALLEL INPUT AND LOAD operation is executed, 
the SERIAL SHIFT OUT operation outputs the converted data 
to the tape drive. A series of one convert operation followed 
by 4 shift operations will transfer a sequence of 5-bits to the 
tape drive, one bit per clock cycle. 



< 
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Figure 2. Typical Tape Storage System 
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ENABLE Ml Mo OPERATION 



DATA-FLOW DIAGRAM 



X X 
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^ 
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SERIAL 
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INPUT 
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Figure 3. GCR E/D Mode Definitions 
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Design Approach (AmPAL16R6) 

The PAL device implementation of the GCR Encoder/Decoder 
takes advantage of the multiplex er-lil<e stru cture of the AND-OR 
array. Each valid combination of ENABLE, M^ and Mq selects a 
different set of AND terms. In some cases, only one term is 
selected (in data steering operations for example). In other 
cases, multiple AND terms are selected to implement a com- 
binatorial logic function (the 5B-to-4B conversion for example). 
This concept, using the control inputs to enable one or more 
AND terms, allows the direct implementation of the PAL device 
design from the mode Data-Flow Diagrams (with a little Kar- 
naugh map help). The K-Maps for the 5B-to-4B conversion logic 
and the 4B-to-5B conversion logic for the Y3 output are shown in 
Figures 4 and 5. Given these maps and the flow diagrams in 
Figure 3, the Boolean equations can be constructed for the Y3 
output. The resulting equation, in PALASM software format, is 
shown in Figure 6. 

It is important to note that t he eq uation in Figure 6 is written for 
the inverse of the Y3 output (Y3). This is necessary if true data is 
desired on the output pin because of the inverting nature of the 
output buffer on the PAL device. The inverted form of the equa- 
tion is easily implemented by selecting the negative version of 
the data (Y3 in the hold operation for example) or by grouping 
zeros in a combinatorial logic function (see Figures 4 and 5). 
Notice that the multiplexer strategy works equally well for active- 
LOW or active-HIGH logic functions. 

Once the transformation of the Data-Flow Diagrams and 
K-Maps to Boolean equations is understood, the interested 



reader should be able to construct K-Maps for the other Y out- 
puts and, in conjunction with the Data-Flow Diagrams of 

Figure 3, write the PALASM software equations for the resulting 
logic functions. This exercise will help the reader to fully appreci- 
ate the advantages of the Data-Flow Diagram/Multiplexer 
method of PAL design. Consult the full PALASM software listing 
(Figure 10) for the complete solutions. 

It is important to note that the diagrams and equations in Figures 
7, 8, and 9 specify the true output for invalid signal (INV), rather 
than INV which appears in the system diagram of Figure 2. This 
is necessary if the correct data is desired on the output pin and 
is due to the inverting nature of the output buffer on the PAL 
device. 

The INV signal is registered and held until the clear INV flag 
input (CIF) is brought LOW, deactivating the flag. Only during a 
5B-4B conversion operation (Mi = HIGH, Mq = LOW) is the 
INV flag activated. Figures 7 and 8 show the INV flag mode 
definitions and the intermediate INVALID logic equation 
respectively 



In this case, an active-LOW output is desired so the active-HIGH 
form of the INV signal is developed internally. Ones are grouped 
in the intermediate combinatorial logic function (INVALID) and 
the true version of the data is selected. The complete PALASM 
software equation for INV is given in Figure 9. 
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A3 = Y3*Y2-HY3*SoUT 



Figure 4. 5B-to-4B Conversion K-IMap for Y3 Output 
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Figure 5. 4B-to-5B Conversion K-iMap for Y3 Output 



Y3:= EN*Y3 

+ EN*Mi*Mo*^ 
+ EN * Ml * Mo * SoUT 
+ EN • Ml * Mo * Y3 • SoUT 
+ EN'Mi^Mo*"^*"^ 
+ In * Ml * Mo * D3 • Do 
+ EFI*Mi* Mo*Di* Do 



HOLD 

SERIAL SHIFT IN 
SERIAL SHIFT OUT 
CONVERT SERIAL 
INPUT AND LOAD 
CONVERT PARALLEL 
INPUT AND LOAD 



Figure 6. PALASM Equation for Y3 
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Figure 7. INV Flag Mode Definitions 
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Figure 8. Intermediate Equation for INVALID 



INV:= GIF* INV 

ClF'Mi^Mo* Y3* Y2 

CIF«Mi*mB*yS*s5ut 

cTf*Mi •M5*Y2*Yi* Yo 

CiF • Mi • Mo • Y3 * Y2 • Yi • Yo « SoUT 



+ ;HOLD INV FLAG 

+ ;SET INV FLAG IF INVALID IS TRUE 



Figure 9. PALASiVI Equation for INV 
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TITLE 4B-5B ENCODER/DECODER 

PATTERN PAT003 

REVISION 01 

AUTHOR WARREN MILLER 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 

CHIP ENCO_DECO PAL16R6 

CK Ml MO D3 D2 Dl DO /EN /CIF'GND 
/E • SIN /INV YO Yl Y2 Y3 SOUT /H VCC 

EQUATIONS 



/SpUT := EN*/SOUT 

/EN*/M1*/M0*/SIN 
/EN*/M1* MO*/YO 
/EN* M1*/M0*/SIN 
/EN* Ml* MO* D3* Dl 
/EN* Ml* MO* D3* DO 



HOLD 

SERIAL SHIFT IN 

SERIAL SHIFT OUT 

CONVERT SERIAL INPUT AND LOAD 

CONVERT PARALLEL INPUT AND LOAD 



/YO := EN*/YO 

+ /EN*/M1*/MO*/SOUT 

+ /EN*/M1* M0*/Y1 

+ /EN* M1*/MO*/SOUT 

+ /EN* M1*/M0* Y3* Y2*/Y0 

+ /EN* Ml* M0*/D3* Dl 

+ /EN* Ml* M0*/D3* D2* DO 

/Yl := EN*/Y1 

+ /EN*/M1*/M0*/Y0 

+ /EN*/M1* M0*/Y2 

+ /EN* M1*/M0*/Y0 

+ /EN* M1*/M0* Y3* Y2 

+ /EN* Ml* M0*/D2 

/Y2 := EN*/Y2 

+ /EN*/M1*/M0*/Y1 

+ /EN*/M1* M0*/Y3 

+ /EN* M1*/M0*/Y1 

+ /EN* Ml* M0*/D3*/D1*/D0 

+ /EN* Ml* M0*/D3* D2*/D1 

+ /EN* Ml* MO* D3*/D1* DO 

/Y3 := EN*/Y3 

+ /EN*/M1*/M0*/Y2 

+ /EN*/M1* M0*/SOUT 

+ /EN* M1*/M0* Y3* SOUT 

+ /EN* M1*/M0* Y3*/Y2 

+ /EN* Ml* MO* D3*/D0 

+ /EN* Ml* MO* D1*/D0 

INV := /CIF* INV 

+ /CIF* Ml*/M0*/Y3*/Y2 

+ /CIF* M1*/M0*/Y2*/Y1*/Y0 

+ /CIF* Ml*/M0*/Y0*/SOUT 

+ /CIF* M1*/M0* Y3* Y2* Yl* YO* SOUT 

H = Y3* Y2* Yl* YO* SOUT 
; SIMULATION NOT INCLUDED 



HOLD INV FLAG 

SET INV FLAG IF INVALID TRUE 



Figure 10. PAL ASM Listing 
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Industrial control can be defined as simply as a basic state 
machine: process information is gathered and analyzed, and 
outputs feed back to the process to modify it based upon the 
analysis. Thus, PAL device applications are often state 
machines. 

A very basic industrial control system is a heater thermostat. 
Temperature is sensed, compared to a maximum upper or lower 
limit, and the heater is either turned on or off. A small state 
machine PLD can implement this function. 

Much of the circuitry in industrial control applications is analog or 
linear. Process information is gathered by sensors, and then the 
information is conditioned to eliminate noise. The conditioned 
signal is then converted to a digital signal, and processed by a 
digital processor. The control outputs are then converted to 
appropriate analog signals to modify the process (Figure 1). 

Data Acquisition 

The sensing part of the system is data acquisition. Data acquisi- 
tion is important not only in industrial control applications, but also 
in scientific monitoring applications. Data acquisition requires the 
capture of analog signals and conversion to useful information 
that can be analyzed. This first requires conditioning of the input, 
to filter out noise. The signal can then be analyzed with analog 
circuitry, but the power of digital microprocessors makes conver- 
sion to digital logic more efficient. 

More logic is being incorporated into the acquisition circuitry. The 
analog circuitry for signal conditioning is a natural addition to a 
sensor. Analog-to-digital conversion (ADC) can even be per- 
formed in the same unit. Advanced sensors may even incorpo- 
rate a microprocessor to provide immediate local analysis of the 
data, offloading the central processor. In these applications, PAL 
devices can form the glue logic for the microprocessor. 

Data Analysis 

Data analysis has been performed in the past by a central 
computer. The central computer would have the task of polling all 
of the sensors and analyzing the incoming data. However, in a 
system where thousands or millions of sensors are active, or 
where immediate response to process changes is required, this 
system is inefficient. 



The type of system more commonly used today includes distrib- 
uted control and processing (Figure 2). Microcomputers or 
programmable controllers can perform low-level processing and 
react with control signals immediately. Often single-board com- 
puters with added analog conversion circuitry are used. 

A LAN connection between a number of intelligent sensors and 
processors provides a very efficient system. Simple RS-232 links 
connect sensors. Buses such as Multibus or VME bus are 
commonly used to connect the boards together. Standards such 
as MAP (Manufacturing Automation Protocol) are being devel- 
oped for future LAN-based systems, allowing dissimilar devices 
to be connected in a network. Fiber-optic links will provide the 
data reliability over long distances in confined areas required for 
industrial control. 
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Figure 2. 



In these areas, standard computer applications for PAL devices 
are numerous. Bus interface is another key application area for 
communication within distributed systems. 

Control 

The most critical part of the system is control. The system must 
respond to the analyzed information with signals that modify the 
process under observation. In a system such as a nuclear 
reactor, immediate and accurate response is the most important 
objective. 
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PID Algorithm 

For smooth control, the PID algorithm for analysis and control is 
often used. P stands for Proportional: the incoming data is 
compared to a reference number, and then the process is 
modified in proportion to the difference between the desired and 
measured values. For example, a heater thermostat that meas- 
ures air temperature of 55 degrees, when its reference value was 
set at 65 degrees, would turn on the heater, and reduce the 
amount of heat as the temperature got closer to the reference 
value. 

Proportional analysis alone often causes undershoot or over- 
shoot. If the heater works quickly, it can heat the air well past the 
reference value. And if it works slowly, it may take a long time to 
heat the air back to the reference value. 

Forthese reasons, the Integral (I) part of the algorithm is added. 
The integral is taken of the difference between the reference 
value and measured value; in effect, this measures how long the 
measured value has been off from the reference value. Using the 
integral prevents undershoot and overshoot. 

However, the result can be a control output that causes the 
measured value to continue oscillating around the reference 
value. For example, if the heater is turned on at 64 degrees and 
an air conditioner is turned on at 66 degrees, the temperature will 
oscillate rapidly around the reference point of 65 degrees. 

For this reason, a Derivative (D) input is used. The derivative 
looks at the rate of change of the measured value, and will slow 
down the response if the measured value is rapidly approaching 
the reference value. 

The complete PID algorithm is 

OUT •= Kp X ERR + Ki X JeRR + Kd x dERR/dt, 

combining present, past, and future input considerations. Selec- 
tionof the proper constants allows the control responseto be both 
rapid and smooth. 

The PID algorithm requires mathematical processing capabili- 
ties, similar to those used in digital signal processing (see the 
section on Digital Signal Processing, page 2-283). 

Control circuitry is also combining analog and digital logic. Smart 
power ICs can perform logic functions while receiving and send- 
ing high-power signals. These ICs can perform simple functions 
likeovervoltage protection, cutting off acircuit if the powersurges. 

Applications 

Applications for industrial control systems are varied, and include 
the following: 

• Factory Automation 

Automated distribution centers 
Power grid monitoring and control 

• Motor Control 

Robots 

Engine testing 
Torque control 



• Transportation 

Air traffic control 
Subway system control 

• Automotive Electronics 

• Satellite Communication 

• Simulators 

Performance Requirements 

The key requirement of industrial control systems is the need for 
real-time processing. An air-traffic control system cannot be 
allowed to slow down if overloaded. During overloading in real 
systems, the information provided on display screens is reduced, 
instead of completely losing all information on a plane. 

An interrupt handling system is more efficient than a standard 
polling system. Many I/O are required, with high DAC and ADC 
accuracy. For data analysis, a large memory and good memory 
management is important. Workstations with a high I/O speed 
are often used for central industrial controllers. 

PAL Device Usage 

Most of the circuitry in an industrial control application is analog 
or linear. Onlythe analyzer is primarily digital logic. The analyzer, 
whether local or central, uses a standard computer architecture. 

Programmable logic devices, especially CMOS PLDs, are used 
in industrial control much as they are used in other computer 
applications. High speed is important in real-time control applica- 
tions involving a large amount of data. 

CMOS PAL Devices 

Because of the harsh environment often found in factories or 
around motors, CMOS technology is required. Specifications for 
the industrial range CMOS ZPAL family are guaranteed over a 
wider temperature range, and a wider supply voltage range. In 
addition, CMOS inputs are less sensitive to noise fluctuations. 
The lower power consumed by all CMOS PAL devices allows a 
smaller power supply, important in the small areas controllers 
often must be placed. Also, it is easierfor a battery to back up the 
power supply for critical operations. The reduced heat is good for 
enclosed areas, and allows CMOS devices to be placed closer 
together, even in surface-mounted devices, to save room. 

The erasability of CMOS PAL devices allows changes to refer- 
ence values, PID control constants, or other process control 
changes. Erasability also allows easy field service upgrades or 
repair. 

PAL Device Applications 

PAL devices find applications in all levels of industrial control, 
from the low-level sensing equipment to the mainframe analysis 
computers. 

Low-Level Applications 

Low-level data acquisition and response circuitry can be based 
on an individual state machine PLD. For example, stepper 
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motors are often used for mechanical valves. Asingle PALdevice 
can be programmed to convert enable, direction, and stepping 
information into the appropriate signals to hold or step the motor 
(page 2-550). 

Another example is shaft encoding (page 2-558). Shaft encoders 
convert the position of a rotating shaft into a digital signal. The 
position is sensed optically, and the digital information is encoded 
into position, speed, and direction signals. 

More complex applications may be handled by dedicated state 
machine PLDs, including the PROSE device or the Am29PL1 41 . 

Medium-Range Applications 

Dedicated microcontrollers can be used in medium-range appli- 
cations, such as small controllers linked to a central computer. 
These applications require the intelligence of a microcontroller 
but not a complete microprocessor-based computer system. In 
these applications, PAL devices form the glue logic between the 
microcontroller and its surrounding chips. 

High-End Applications 

Dedicated local computers and central data processing comput- 
ers are the high-end applications for PLDs. Here, microproces- 
sors are the brains of the system and PLDs perform many of the 
support functions. 



Microprocessor support is a key area of application (see the 
section titled Microprocessor-Based Systems, page 2-131 ). This 
is especially true of distributed control systems, where several 
microprocessors perform local processing. A standard architec- 
ture can be modified for different areas of the process by program- 
ming the PALdevices differently. PALdevices also allow the local 
controllers to take up less space. 

Another key application area is bus interface for distributed 
systems (seethe section titled Bus Interface, page 2-325). Either 
synchronous or asynchronous buses may be used. 

Since industrial controllers are usually interrupt-driven for effi- 
cient response, interrupt logic forms another potential application 
area for PAL devices. 

Digital signal processing is often performed in the analysis 
section of the industrial controller, usually in a central processor. 
The applications would not require the speed or complexity of 
most DSP applications, but would use similar circuitry. See the 
section titled Digital Signal Processing. 
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Functional Description 

stepper motors and linear actuators are used in a variety of 
applications requiring precise rotational and/or linear move- 
ment. Examples are printers, floppy disk drives, mechanical 
valves, etc. Stepper motors are two-phase permanent magnet 
motors which provide discrete angular movement every time the 
polarity of a winding is changed. In linear actuators the angular 
movement is converted to linear movement via a load screw. In 
essence, they are dc motors without brushes, where the user 
provides commutation with external logic. 

In these examples, the control and drive circuitry for such a 
motor is implemented digitally using PAL devices. 

Circuit Operation 

One type of drive circuit, unipolar drive, is shown in Figure 1 . Two 
drive sequences are given in Tables la and 1 b. Angular rotation 
is achieved by saturating the transistor drivers in the sequence 
shown in the appropriate table (full or half step). Now, assume 
the circuit of Figure 1 is connected to a stepper motor designed 
for 7.5° steps. By following the step sequence of Table la (full 
step), the shaft will rotate 7.5° each time the state is changed. If 
the sequence of Table 1 b is followed, a 3.75° (half-step) rotation 
will result for each change of state. For both step sequences, the 
direction can be reversed by stepping backwards through the 
table (step 4-3-2-1 -4-etc.). 
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Table 1a 
FULL STEP SEQUENCE 
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Table 1b 
HALF STEP SEQUENCE 
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PAL Device implementation 

In this application, one PAL16R4 can be used to provide the logic 
levels required to drive two stepper motors in the full step mode. 
Due to the high current drive required (100-400 mA/phase), 
external inverting high current buffers must be used (ULN 2001 
or equivalent). In the design, the following features are provided 
within the PAL device: 



• Enable/Disable inputs to enable stepping in either section. 
(E inputs). 

• Select clockwise or counter-clockwise rotation. 

• Set the motor to logic state step 1. 

A block diagram/pinout is shown in Figure 2. 
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Figure 2 



A function table for each motor control section is given below. 
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Hold motor in current position 
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Set outputs to step 1 levels 
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Step motor clockwise 
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Step motor counter-clockwise 



The full step sequence (Table_1a) can be_simplified from 4 
outputs to 2 outputs since Q1 = QO and Q3 = Q2. The sequence 
can then be expressed as follows: 



QOn+1 = 1, Qlp+i = 0, Q2n+i = 1, Q3n+i = 

when El = 1 orE2= 1: 

QOn+1 = QOn- Q1n+1 = Q^n. Q2n+i = 02^, 03^+1 = Q3 



D = 


D = 1 


STEP 
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The step sequences can be converted to equations by use of a 
Karnaugh map. 

QOn 



Q2r 



QO 



n+1 





3 








c 







<z 


1 J 






QOn+1 = Q2n • D + Q2n • D 



These equations make up the first design file. 

Conclusion 

Although this example could be used "as is" in a stepper motor 
application, the programmability of PAL devices could allow for 
any desired modifications. Changes to the circuit might include: 

1. Drive only one stepper motor, using a PAL16R6. The other 
flip-flops could be used as a programmable counter, allowing 
for different speed settings. 

2. Drive only one stepper motor, using the extra inputs and 
outputs to handle other circuit functions. 

3. Drive only one stepper motor, using a PAL16R6. The other 
flip-flops could be used as a 4-bit position counter. 

4. The substitution of a PAL16R8, and another inverting buffer 
would allow the driving and control of four stepper motors. 

5. Reprogram for half-step operation. This is done in the second 
design example. 
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Q2n+i = QOn • D + QOn • D 



Factor in El and E2: 

QOn+1 = El • E2 • Q2n • D + El • E2 • Q2n • D 

Q2n+i = eT • E2 • QOp • D + El • E2 • QOn • D 

Express the set function as an equation: 
QOn+1 = Ei • IT- S Q2n+i = El • E2 • S 



Express the hold function (when El or E2 = 1) 
QOn+1 = QOn • El + QOp • E2 



Q2 



n+1 



Q2n • El + Q2n • E2 



Combining all the above: 



QOn+1 
Q1n+1 
Q2n+1 
Q3n+1 



El • E2 • S + QOp • El + QOp • E2 + El • E2 • Q2n • D + El • E2 • Q2n • D 

Qpn+i 

El • E2 • S + Q2n • El + Q22 • E2 + El • E2 • QOp • D + El • E2 • QOn • D 



= Q2 



n+1 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



STEPPER MOTOR CONTROLLER 

P7015 

1.00 

DAVE SACKETT 

DEVOE COMPANY, INDIANAPOLIS, 

02/23/81 



CHIP SMC PAL16R4 

CLK /ElA /E2A SA DA /ElB /E2B SB DB GND /OC 
/Q3B /QIB /Q2B /QOB /Q2A /QOA /Q3A /QIA VCC 

EQUATIONS 



INDIANA 



QOA := Q0A*/E1A 

+ QOA* /E2A 
+ SA * ElA* E2A 
+ /Q2A* ElA* E2A* DA 
+ Q2A* ElA* E2A*/DA 



QIA = /QOA 
QIA.TRST 



VCC 



Q2A := Q2A*/E1A 

+ Q2A* /E2A 
+ SA * ElA* E2A 
+ QOA* ElA* E2A* DA 
+ /QOA* ElA* E2A*/DA 

Q3A = /Q2A 

Q3A.TRST = \/CC 

QOB := Q0B*/E1B 

+ QOB* /E2B 
+ SB * ElB* E2B 
+ /Q2B* ElB* E2B* DB 
+ 02B* ElB* E2B*/DB 

QIB = /QOB 

QIB.TRST ^ VCC 



HOLD IF NOT El 

HOLD IF NOT E2 

STEP 1 IF SET 

LOAD /Q2A IF COUNTER-CLOCKWISE 

LOAD Q2A IF CLOCKWISE 



HOLD IF NOT El 

HOLD IF NOT E2 

STEP 1 IF SET 

LOAD QOA IF COUNTER-CLOCKWISE 

LOAD /QOA IF CLOCKWISE 



HOLD IF NOT El 

HOLD IF NOT E2 

STEP 1 IF SET 

LOAD /Q2B IF COUNTER-CLOCKWISE 

LOAD Q2B IF CLOCKWISE 



IR— dIHii 



Q2B := Q2B*/E1B 

+ Q2B* /E2B 
+ SB * ElB* E2B 
+ QOB* ElB* E2B* DB 
+ /QOB* ElB* E2B*/DB 

Q3B = /Q2B 

Q3B.TRST = VCC 

SIMULATION 

SETF OC ElA E2A SA ElB E2B SB 
CLOCKF CLK 



HOLD IF NOT El 

HOLD IF NOT E2 

STEP 1 IF SET 

LOAD QOB IF COUNTER-CLOCKWISE 

LOAD /QOB IF CLOCKWISE 



^ 
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CHECK QOA /QIA Q2A /Q3A QOB /QIB Q2B /Q3B 

SETF /ElA /E2A /ElB /E2B 

CLOCKF CLK 

CHECK QOA /QIA 02A /Q3A QOB /QIB Q2B /Q3B 

SETF ElA E2A /SA /DA ElB E2B /SB DB 

CLOCKF CLK 

CHECK QOA /QIA /Q2A Q3A /QOB QIB 02B /Q3B 

CLOCKF CLK 

CHECK /QOA QIA /Q2A Q3A /QOB QIB /02B Q3B 

CLOCKF CLK 

CHECK /QOA QIA Q2A /Q3A QOB /QIB /Q2B Q3B 

CLOCKF CLK 

CHECK QOA /QIA Q2A /Q3A QOB /QIB Q2B /Q3B 

CLOCKF CLK 

CHECK QOA /QIA /Q2A Q3A /QOB QIB Q2B /Q3B 

CLOCKF CLK 

CHECK /QOA QIA /Q2A Q3A /QOB QIB /Q2B Q3B 

SETF DA /DB 

CLOCKF CLK 

CHECK QOA /QIA /Q2A Q3A /QOB QIB Q2B /Q3B 

SETF /ElA /ElB 

CLOCKF CLK 

CHECK QOA /QIA /Q2A Q3A /QOB QIB Q2B /Q3B 

SETF ElA /E2A ElB /E2B 

CLOCKF CLK 

CHECK QOA /QIA /Q2A Q3A /QOB QIB Q2B /Q3B 

SETF SA SB 
CLOCKF CLK 
CHECK QOA /QIA /Q2A Q3A /QOB QIB Q2B /Q3B 

DESCRIPTION 

THIS PAL16R4 PROVIDES THE LOGIC LEVELS REQUIRED TO DRIVE TWO 
STEPPER MOTORS IN THE FULL STEP MODE. 

THE FOLLOWING OPERATIONS MAY BE PERFORMED FOR EACH STEPPER 
MOTOR CONTROLLER INDIVIDUALLY: 



CLK /El /E2 



OPERATION 



X H X X X HOLD MOTOR IN CURRENT POSITION 

X X H X X HOLD MOTOR IN CURRENT POSITION 

C L L H X SET OUTPUTS TO STEP 1 LEVELS 

C L L L L STEP MOTOR CLOCKWISE 

C L L L H STEP MOTOR COUNTER-CLOCKWISE 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



DUAL STEPPER MOTOR CONTROLLER 

P7094 

1.00 

COLI/SACKETTE 

MONOLITHIC MEMORIES 

12/07/82 



CHIP DUAL_SMC PAL16R8 

CLK /ENA SETA ROTA MODEA /ENB SETS ROTB MODEB GND 

/OC /SWIB /SW2B /SW3B /SW4B /SW4A /SW3A /SW2A /SWIA VCC 

EQUATIONS 



SW1A:= SWIA 

+ 
+ 
+ 
+ 
+ 



*/ENA 

/SW2A */SW4A* ENA* ROTA* MODEA 

SW3A * ENA* ROTA*/MODEA 

/SW2A*/SW3A * ENA*/ROTA* MODEA 

SW4A* ENA*/ROTA*/MODEA 

ENA * 



SETA 



HOLD SWIA: DISABLE 
HALF- STEP MTR CW 
FULL-STEP MTR CW 
HALF-STEP MTR CCW 
FULL-STEP MTR CCW 
SET A TO STEP 1 



SW2A:= SW2A */ENA 

+/SW1A */SW3A * ENA* ROTA* MODEA*/SETA 

+ SW4A* ENA* ROTA*/MODEA*/SETA 

+/SW1A */SW4A* ENA*/ROTA* MODEA*/SETA 

+ SW3A * ENA*/ROTA*/MODEA*/SETA 



SW3A:= 



+/SW1A 

+ SW2A 
+ /SW2A 
+ SWIA 

+ 



SW3A */ENA 

*/SW4A* ENA* ROTA* MODEA 

* ENA* ROTA*/MODEA 
*/SW4A* ENA*/ROTA* MODEA 

* ENA*/ROTA*/MODEA 
ENA * 



SETA 



SW4A: 



SW4A*/ENA 
+ /SW2A*/SW3A * ENA* ROTA* MODEA*/SETA 
+ SWIA * ENA* ROTA*/MODEA*/SETA 

*/SW3A * ENA*/ROTA* MODEA*/SETA 
SW2A * ENA*/ROTA*/MODEA*/SETA 



+/SW1A 

+ 



SW1B:= SWIB */ENB 

+ /SW2B */SW4B* ENB* ROTB* MODEB 
+ SW3B * ENB* ROTB*/MODEB 

+ /SW2B*/SW3B * ENB*/ROTB* MODEB 
+ SW4B* ENB*/ROTB*/MODEB 

+ ENB * 



SETB 



:HOLD SW2A: DISABLE 

; HALF-STEP MTR CW 
; FULL-STEP MTR CW 
; HALF-STEP MTR CCW 
: FULL-STEP MTR CCW 

HOLD SW3A: DISABLE 
HALF-STEP MTR CW 
FULL-STEP MTR CW 
HALF-STEP MTR CCW 
FULL-STEP MTR CCW 
SET A TO STEP 1 

;HOLD SW4A: DISABLE 
: HALF-STEP MTR CW 
: FULL-STEP MTR CW 
: HALF-STEP MTR CCW 
: FULL-STEP MTR CCW 

HOLD SWIB: DISABLE 
HALF-STEP MTR CW 
FULL-STEP MTR CW 
HALF-STEP MTR CCW 
FULL-STEP MTR CCW 
SET B TO STEP 1 



SW2B:= SW2B */ENB 

+/SW1B */SW3B * ENB* ROTB* MODEB*/SETB 

+ SW4B* ENB* ROTB*/MODEB*/SETB 

+/SW1B */SW4B* ENB*/ROTB* MODEB*/SETB 

+ SW3B * ENB*/ROTB*/MODEB*/SETB 



:HOLD SW2B: DISABLE 
; HALF-STEP MTR CW 
; FULL-STEP MTR CW 
; HALF-STEP MTR CCW 
: FULL-STEP MTR CCW 
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SW3B:= 

+/SW1B 



+ SW2B 
+ /SW2B 
+ SWIB 
+ 



SW3B */ENB 

*/SW4B* ENB* ROTB* MODEB 

* ENB* ROTB*/MODEB 
*/SW4B* ENB*/ROTB* MODEB 

* ENB*/ROTB*/MODEB 

ENB * SETB 



SW4B:= SW4B*/ENB 

+ /SW2B*/SW3B * ENB* ROTB* MODEB*/SETB 

+ SWIB * ENB* ROTB*/MODEB*/SETB 

+/SW1B */SW3B * ENB*/ROTB* MODEB*/SETB 

+ SW2B * ENB*/ROTB*/MODEB*/SETB 



HOLD SW3B: DISABLE 
HALF-STEP MTR CW 
FULL-STEP MTR CW 
HALF-STEP MTR CCW 
FULL-STEP MTR CCW 
SET B TO STEP 1 

HOLD SW4B: DISABLE 
: HALF-STEP MTR CW 
: FULL-STEP MTR CW 
: HALF-STEP MTR CCW 
•FULL-STEP MTR CCW 



SIMULATION 

SETF OC ENA SETA ENB SETB 

CLOCKF CLK 

CHECK SWIA /SW2A SW3A /SW4A SWIB /SW2B SW3B /SW4B 

SETF /SETA ROTA MODEA /SETB /ROTB MODEB 

CLOCKF CLK 

CHECK SWIA /SW2A /SW3A /SW4A /SWIB /SW2B SW3B /SW4B 

CLOCKF CLK 

CHECK SWIA /SW2A /SW3A SW4A /SWIB SW2B SW3B /SW4B 

SETF /ENA /ENB 

CLOCKF CLK 

CHECK SWIA /SW2A /SW3A SW4A /SWIB SW2B SW3B /SW4B 

SETF ENA ENB 

CLOCKF CLK 

CHECK /SWIA /SW2A /SW3A SW4A /SWIB SW2B /SW3B /SW4B 

CLOCKF CLK 

CHECK /SWIA SW2A /SW3A SW4A /SWIB SW2B /SW3B SW4B 

CLOCKF CLK 

CHECK /SWIA SW2A /SW3A /SW4A /SWIB /SW2B /SW3B SW4B 

CLOCKF CLK 

CHECK /SWIA SW2A SW3A /SW4A SWIB /SW2B /SW3B SW4B 

CLOCKF CLK 

CHECK /SWIA /SW2A SW3A /SW4A SWIB /SW2B /SW3B /SW4B 

CLOCKF CLK 

CHECK SWIA /SW2A SW3A /SW4A SWIB /SW2B SW3B /SW4B 

SETF /ENA /ROTA /MODEA /ENB ROTB /MODEB 

CLOCKF CLK 

CHECK SWIA /SW2A SW3A /SW4A SWIB /SW2B SW3B /SW4B 

SETF ENA ENB 
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CLOCKF CLK 

CHECK /SWIA SW2A SW3A /SW4A SWIB /SW2B /SW3B SW4B 

CLOCKF CLK 

CHECK /SWIA SW2A /SW3A SW4A /SWIB SW2B /SW3B SW4B 

CLOCKF CLK 

CHECK SWIA /SW2A /SW3A SW4A /SWIB SW2B SW3B /SW4B 

CLOCKF CLK 

CHECK SWIA /SW2A SW3A /SW4A SWIB /SW2B SW3B /SW4B 



DESCRIPTION 

CLK /OC /EN SET ROT MODE SW1-SW4 COMMENTS 



X H X X X X Z HI-Z 

C L H X X X HOLD HOLD MOTOR POSITION 

C L L H X X 1 SET MOTOR POSITION TO STEP 1 

C L L L H H SW PLUS 1 HALF-STEP MOTOR CLOCKWISE 

C L L L H L SW PLUS 2 FULL-STEP MOTOR CLOCKWISE 

C L L L L H SW MINUS 1 HALF-STEP MOTOR COUNTERCLOCKWI 

C L L L L L SW MINUS 2 FULL-STEP MOTOR COUNTERCLOCKWI 
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The trend away from analog systems to digital numerical control 
systems focuses new attention on instruments that convert the 
analog output of position-sensing devices into digital format. 

The recording of the position of moveable parts of a machine 
requires high accuracy and noise immunity which can be 
attained through shaft encoder circuits of the type used in speed 
controllers and optical position-sensing devices. 



Principles of Shaft Encoding 

Most commonly used are opto-electrical encoders. The advan- 
tages of this type of circuit are direct shaft-to-digital encoding, a 
minimum number of power supplies, low power requirements, 
low cost and high speed. 



-oVcc 



CIRCUIT WHICH PRODUCES MODULATED 

OUTPUT SIGNALS USING LIGHT SENSORS 

(PHOTO DIODES) 




OPTICAL DISC 




- LIGHT 
-SENSORS 



Thus, if power is interrupted an incremental encoder must have 
its zero position reestablished. 

The two signals from the light sensors have to be amplified and 
converted into digital format using Schmitt triggers. 



JF 



Optical encoders measure shaft rotation by detecting the light 
which passes through a rotating code disc and a fixed slit. 

It consists of a number of light sources and sensors whose paths 
are interrupted by a disc that has transparent and opaque areas. 

By using various concentric patterns, the shaft position can be 
defined. Light shines through the disc onto sensors. A sensor 
turns on when the light passes through its corresponding trans- 
parent part of the disc giving an output that depends on the 
opacity of the segment. The combined output of all the segments 
is digital information representing the disc and the shaft position. 

— An absolute encoder has a number of concentric tracks on the 
code disc, which provide parallel readout of shaft angle with- 
out counting pulses. The code patterns on the disc are a kind 
of storage. 

Thus, the readout is also present after a power interruption. 

— Incremental encoders, which will be described here, have a 
single code track on the disc. Angular position is determined 
by counting pulses produced by the modulated light falling 
onto the photodetectors. 

Direction sensing is obtained by the use of quadrature signals 
which are provided by appropriate phasing of the paths. 



The two signals are out of phase by a quarter of a period. The 
direction of movement can be determined by which signal leads 
or lags the other. 

Out of this phase relationship the shaft encoder produces 
information about direction and speed for a following up/ 
down-counter. 

To avoid random discrepancies during switching operations 
shaft encoders should be built using synchronous clocked 
circuitry. 

The diagram below shows a typical circuit diagram for shaft 
encoding. 
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The phase relationship of the two signals PHIO and PHI90 is 
determined by the four registers, and their outputs are encoded. 

The underlying principle of the above circuit is the time delay 
between the clock pulses for the two signals PHIO and PHI90. 

S1 = PHI0atCLK S2 = PHIO at CLK + 1 

SI = PHI90 at CLK S4 = PHI90 at CLK + 1 

A transition in PHIO or PHI90 from L to H or vice versa causes a 
change in the data stored in the registers, which is encoded for 
direction sensing. 

The logic equations for the encoder circuitry are given below 

PHIO leads PHI90: SI * S2 * S3 * /S4 

+/S1 * /S2 * /S3 * S4 

+ SI * /S2 * /S3 * /S4 

+/S1 * S2 * S3 * S4 



PHIO lags PHI90: 



/SI * /S2 * S3 * /S4 

+ SI * S2 * /S3 * S4 

+ SI * /S2 * S3 * S4 

+/S1 * S2 * /S3 * /S4 



the input PHIO lags the input PH190 by 90°. In this case, four 
pulses per square wave are presented at the DOWN-output. Note 
that both the UP and DOWN outputs of the counter are normally 
held high. 

To ensure that no shaft encoder transition is missed, the clock 
frequency should be at least SxNxS, where N is the number of 
pulses produced by the encoder for each shaft revolution and S 
is the maximum speed in revolutions per second to be expected. 

Most commonly CLK frequencies in the rangeof 1 MHz are used 
for optimum circuit operation. 

Synchronous two-channel shaft encoders as described above 
are relatively insensitive to encoder phase errors. They use no 
temperamental Mono-Flops and can detect any illegal transition 
states generated by the circuit. 

Interference on the input lines is ignored if it occurs between two 
clock cycles. Random noise on both input lines results in one UP 
count for each DOWN count so that in the end the digital infor- 
mation remains unchanged. Hence synchronous shaft encoders 
are extremely useful in electrically noisy environments. 

The PAL Device Applications are examples of such circuits 
using a single PAL device each. 



SHAFT ENCODER #1 
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When the shaft rotates in a clockwise direction, the input PHIO 
leads the input PHI90 by 90° and the logic will generate pulses 
only at the UP-output. 
On the other hand, when the shaft's rotation is counterclockwise 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



SHAFT ENCODER No. 1 

P7097 

1.00 

WILLY VOLDAN 

MONOLITHIC MEMORIES 

09/09/82 



CHIP SEl PAL16R4 

CLK PHIO PHI90 X4 NC NC NC NC /SSET GND 
/OC DOWN NC S4 S3 S2 SI NC UP VCC 



EQUATIONS 


/SI :- 

+ 


/PHIO 
SSET 


/S2 : = 

+ 


/SI 
SSET 


/S3 : = 

+ 


/PHI90 
SSET 


/S4 : = 
+ 


/S3 
SSET 



; CHECK FOR PHIO 
;INITIALIZE S1=L 

; CHECK FOR SI 

; INITIALIZE S2=L 

; CHECK FOR PHI 90 
; INITIALIZE S3=L 

; CHECK FOR S3 

; INITIALIZE S4=L 



/DOWN = SI* S2* S3*/S4* PHIO* PHI90 

+ /Sl*/S2*/S3* S4*/PHI0*/PHI90 

+ Sl*/S2*/S3*/S4* PHI0*/PHI90 

+ /SI* S2* S3* S4*/PHI0* PHI90 

DOWN.TRST = VCC 

/UP = /Sl*/S2* S3*/S4*/PHI0* PHI90 

+ SI* S2*/S3* S4* PHI0*/PHI90 

+ Sl*/S2* S3* S4* PHIO* PHI90 

+ /SI* S2*/S3*/S4*/PHI0*/PHI90 

UP.TRST = VCC 



PHIO LEADS PHI90 

PHIO LEADS PHI90 

PHIO LEADS PHI 90 

PHIO LEADS PHI90 



PHI90 LEADS PHIO 
PHI90 LEADS PHIO 
PHI90 LEADS PHIO 
PHI90 LEADS PHIO 



SIMULATION 

SETF OC SSET 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

SETF /SSET /PHIO /PHI90 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

SETF PHI 90 
CLOCKF CLK 
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CHECK /S4 S3 /S2 /SI /UP DOWN 

CLOCKF CLK 

CHECK S4 S3 /S2 /SI UP DOWN 

SETF PHIO 

CLOCKF CLK 

CHECK S4 S3 /S2 SI /UP DOWN 

CLOCKF CLK 

CHECK S4 S3 S2 SI UP DOWN 

SETF /PHI90 

CLOCKF CLK 

CHECK S4 /S3 S2 SI /UP DOWN 

CLOCKF CLK 

CHECK /S4 /S3 S2 SI UP DOWN 

SETF /PHIO 
CLOCKF CLK 
CHECK /S4 /S3 S2 /SI /UP DOWN 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

SETF PHI90 
CLOCKF CLK 
CHECK /S4 S3 /S2 /SI /UP DOWN 

CLOCKF CLK 

CHECK S4 S3 /S2 /SI UP DOWN 

SETF PHIO 

CLOCKF CLK 

CHECK S4 S3 /S2 SI /UP DOWN 

CLOCKF CLK 

CHECK S4 S3 S2 SI UP DOWN 

SETF /PHI90 

CLOCKF CLK 

CHECK S4 /S3 S2 SI /UP DOWN 

CLOCKF CLK 

CHECK /S4 /S3 S2 SI UP DOWN 

SETF /PHIO 
CLOCKF CLK 
CHECK /S4 /S3 S2 /SI /UP DOWN 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 
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SETF PHI90 
CLOCKF CLK 
CHECK /S4 S3 /S2 /SI /UP DOWN 

CLOCKF CLK 

CHECK S4 S3 /S2 /SI UP DOWN 

SETF PHIO 

CLOCKF CLK 

CHECK S4 S3 /S2 SI /UP DOWN 

CLOCKF CLK 

CHECK S4 S3 S2 SI UP DOWN 

SETF /PHI90 

CLOCKF CLK 

CHECK S4 /S3 S2 SI /UP DOWN 

CLOCKF CLK 

CHECK /S4 /S3 S2 SI UP DOWN 

SETF /PHIO 
CLOCKF CLK 
CHECK /S4 /S3 S2 /SI /UP DOWN 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

SETF SSET 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

SETF /SSET 
CLOCKF CLK 
CHECK /S4 /S3 /S2 /SI UP DOWN 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

SETF PHIO 

CLOCKF CLK 

CHECK /S4 /S3 /S2 SI UP /DOWN 

CLOCKF CLK 

CHECK /S4 /S3 S2 SI UP DOWN 

SETF PHI90 
CLOCKF CLK 
CHECK /S4 S3 S2 SI UP /DOWN 

CLOCKF CLK 

CHECK S4 S3 S2 SI UP DOWN 

SETF /PHIO 
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CLOCKF CLK 

CHECK S4 S3 S2 /SI UP /DOWN 

CLOCKF CLK 

CHECK S4 S3 /S2 /SI UP DOWN 

SETF /PHI90 

CLOCKF CLK 

CHECK S4 /S3 /S2 /SI UP /DOWN 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

SETF PHIO 

CLOCKF CLK 

CHECK /S4 /S3 /S2 SI UP /DOWN 

CLOCKF CLK 

CHECK /S4 /S3 S2 SI UP DOWN 

SETF PHI90 
CLOCKF CLK 
CHECK /S4 S3 S2 SI UP /DOWN 

CLOCKF CLK 

CHECK S4 S3 S2 SI UP DOWN 

SETF /PHIO 
CLOCKF CLK 
CHECK S4 S3 S2 /SI UP /DOWN 

CLOCKF CLK 

CHECK S4 S3 /S2 /SI UP DOWN 

SETF /PHI90 

CLOCKF CLK 

CHECK S4 /S3 /S2 /SI UP /DOWN 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

SETF PHIO 

CLOCKF CLK 

CHECK /S4 /S3 /S2 SI UP /DOWN 

CLOCKF CLK 

CHECK /S4 /S3 S2 Si UP DOWN 

SETF PHI90 
CLOCKF CLK 
CHECK /S4 S3 S2 SI UP /DOWN 

CLOCKF CLK 

CHECK S4 S3 S2 SI UP DOWN 
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SETF /PHIO 
CLOCKF CLK 
CHECK S4 S3 S2 /SI UP /DOWN 

CLOCKF CLK 

CHECK S4 S3 /S2 /SI UP DOWN 

SETF /PHI90 

CLOCKF CLK 

CHECK S4 /S3 /S2 /SI UP /DOWN 

CLOCKF CLK 

CHECK /S4 /S3 /S2 /SI UP DOWN 

DESCRIPTION: 

THIS PAL16R4 IMPLEMENTS A TWO CHANNEL SHAFT ENCODER OF THE 
TYPE USED IN SPEED CONTROLLERS AND OPTICAL DEVICES. 

BOTH THE "UP" AND "DOWN" OUTPUTS OF THE PAL DEVICE ARE 
NORMALLY HIGH. 

WHEN THE SIGNAL AT THE "PHIO" INPUT LEADS THE SIGNAL AT THE 
"PHI90" INPUT, THE "DOWN" OUTPUT ALTERNATES BETWEEN HIGH AND 
LOW LEVELS AT HALF THE "CLK" FREQUENCY RATE. ALSO, WHEN THE 
SIGNAL AT THE "PHIO" INPUT LAGS THE SIGNAL AT THE "PHI90" 
INPUT, THE "UP" OUTPUT ALTERNATES BETWEEN HIGH AND LOW 
LEVELS AT HALF THE "CLK" FREQUENCY RATE. 

THE SHAFT ENCODER FEATURES THE CONFIGURATION AND OUTPUT 
POLARITY TO DRIVE AN 74S193 TYPE UP/DOWN COUNTER. 

THIS DESIGN WITH GLITCH-FREE OUTPUTS WILL BE EXTREMELY 
USEFUL IN ELECTRICALLY NOISY ENVIRONMENTS. THE PINOUT IS 
GIVEN AS A FIRST PROPOSAL AND CAN BE CHANGED ACCORDING TO 
THE PC BOARD LAYOUT. 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 



SHAFT ENCODER No. 2 

P7098 

1.00 

WILLY VOLDAN 

MONOLITHIC MEMORIES 

09/09/82 



CHIP SE2 PAL16R8 

CLK PHIO PHI90 X4 NC NC NC NC /SSET GND 
/OC UD NC S4 S3 S2 SI NC COUNT VCC 



EQUATIONS 






/SI 


= /PHIO 






+ 


SSET 




/S3 


; = 


/PHI90 






+ 


SSET 




/S2 


: = 


SI 






+ 


SSET 




/S4 


: = 


S3 






+ 


SSET 




/COUNT 


: = 


SI* S2*/S3* S4 






+ 


/Sl*/S2* S3*/S4 






+ 


/SI* S2*/S3*/S4* 


X4 




+ 


Sl*/S2* S3* S4* 


X4 




+ 


Si* S2* S3*/S4 






+ 


/Sl*/S2*/S3* S4 






+ 


/SI* S2* S3* S4* 


X4 




+ 


Sl*/S2*/S3*/S4* 


X4 


/UD 


: = 


/Si* S2*/S3* S4 






+ 


/SI* S2* S3* S4 






+ 


/SI* S2* S3*/S4 






+ 


SI* S2* S3*/S4 






+ 


Sl*/S2* S3*/S4 






+ 


Sl*/S2*/S3*/S4 






+ 


Sl*/S2*/S3* S4 






+ 


/Sl*/S2*/S3* S4 




SIMULATION 





SETF OC SSET 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 

SETF /SSET /PHIO /PHI90 /X4 

CLOCKF CLK 

CHECK /SI S2 /S3 S4 COUNT UD 



; CHECK FOR PHIO 
; INITIALIZE S1=L 

; CHECK FOR PHI 90 
; INITIALIZE S3=L 

; CHECK FOR /SI 
; INITIALIZE S2=L 

; CHECK FOR /S3 
; INITIALIZE S4=L 

THIS OUTPUT ALTERNATES 
BETWEEN HIGH AND LOW WITH 
HALF OR QUARTER THE 
CLK FREQUENCY 



THIS OUTPUT DETERMINES 
IF SIGNAL PHIO LEADS 
OR LAGS SIGNAL PHI90 
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SETF PHIO 

CLOCKF CLK 

CHECK SI S2 /S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 /S3 S4 /COUNT UD 

SETF PHI90 
CLOCKF CLK 
CHECK SI /S2 S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 S3 /S4 COUNT UD 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI /S2 S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 S3 /S4 /COUNT UD 

SETF /PHI90 

CLOCKF CLK 

CHECK /<^1 S2 /S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 /S3 S4 COUNT UD 

SETF PHIO 

CLOCKF CLK 

CHECK SI S2 /S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 /S3 S4 /COUNT UD 

SETF PHI90 
CLOCKF CLK 
CHECK SI /S2 S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 S3 /S4 COUNT UD 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI /S2 S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 S3 /S4 /COUNT UD 

SETF /PHI90 X4 

CLOCKF CLK 

CHECK /SI S2 /S3 /S4 COUNT /UD 
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CLOCKF CLK 

CHECK /SI S2 /S3 S4 /COUNT UD 

SETF PHIO 

CLOCKF CLK 

CHECK SI S2 /S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 /S3 S4 /COUNT UD 

SETF PHI90 
CLOCKF CLK 
CHECK SI /S2 S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 S3 /S4 /COUNT UD 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI /S2 S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 S3 /S4 /COUNT UD 

SETF /PHI90 

CLOCKF CLK 

CHECK /SI S2 /S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 /S3 S4 /COUNT UD 

SETF PHIO 

CLOCKF CLK 

CHECK SI S2 /S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 /S3 S4 /COUNT UD 

SETF PHI90 
CLOCKF CLK 
CHECK SI /S2 S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 S3 /S4 /COUNT UD 

SETF SSET 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 COUNT /UD 

SETF /SSET /PHIO /PHI90 /X4 

CLOCKF CLK 

CHECK /SI S2 /S3 S4 COUNT UD 

SETF PHI90 
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CLOCKF CLK 

CHECK /SI S2 S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 S3 /S4 COUNT /UD 

SETF PHIO 

CLOCKF CLK 

CHECK SI S2 S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 S3 /S4 /COUNT /UD 

SETF /PHI90 

CLOCKF CLK 

CHECK SI /S2 /S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 /S3 S4 COUNT /UD 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI /S2 /S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 /S3 S4 /COUNT /UD 

SETF PHI90 
CLOCKF CLK 
CHECK /SI S2 S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 S3 /S4 COUNT /UD 

SETF PHIO 

CLOCKF CLK 

CHECK SI S2 S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 S3 /S4 /COUNT /UD 

SETF /PHI90 

CLOCKF CLK 

CHECK SI /S2 /S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 /S3 S4 COUNT /UD 

SETF /PHIO X4 

CLOCKF CLK 

CHECK /SI /S2 /S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 /S3 S4 /COUNT /UD 
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SETF PHI90 
CLOCKF CLK 
CHECK /SI S2 S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 S3 /S4 /COUNT /UD 

SETF PHIO 

CLOCKF CLK 

CHECK SI S2 S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 S3 /S4 /COUNT /UD 

SETF /PHI90 

CLOCKF CLK 

CHECK SI /S2 /S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 /S3 S4 /COUNT /UD 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI /S2 /S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 /S3 S4 /COUNT /UD 

SETF PHI90 
CLOCKF CLK 
CHECK /SI S2 S3 S4 COUNT /UD 

CLOCKF CLK 

CHECK /SI S2 S3 /S4 /COUNT /UD 

SETF PHIO 

CLOCKF CLK 

CHECK SI S2 S3 /S4 COUNT /UD 

CLOCKF CLK 

CHECK SI /S2 S3 /S4 /COUNT /UD 

DESCRIPTION 

THIS PAL16R8 IMPLEMENTS A TWO CHANNEL SHAFT ENCODER OF THE 
TYPE USED IN SPEED CONTROLLERS AND OPTICAL DEVICES. 

THE "COUNT" OUTPUT OF THE PAL DEVICE IS NORMALLY HIGH. DURING 
SHAFT ENCODING THIS OUTPUT ALTERNATES BETWEEN HIGH AND LOW. 

INPUT "X4" SELECTS BETWEEN HALF (X4=H) OR QUARTER (X4=L) CLK 
FREQUENCY OF THE "COUNTER" OUTPUT. 
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OUTPUT "UD" DETERMINES WHETHER SIGNAL PHIO LEADS (UD=H) OR 
LAGS (UD=L) SIGNAL PHI90. 

THE SHAFT ENCODER FEATURES THE CONFIGURATION AND OUTPUT 
POLARITY TO DRIVE A 7 4S697 TYPE UP/DOWN COUNTER. 

THIS DESIGN WITH GLITCH-FREE OUTPUTS WILL BE EXTREMELY USEFUL 
IN ELECTRICALLY NOISY ENVIRONMENTS. THE PINNING IS GIVEN AS 
A FIRST PROPOSAL AND CAN BE CHANGED ACCORDING TO THE PC-BOARD 
LAYOUT . 
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TITLE SHAFT ENCODER No. 3 (WITH INTERNAL 4-BIT UP/DOWN COUNTER) 

PATTERN P7099 

REVISION 1.00 

AUTHOR WILLY VOLDAN 

COMPANY MONOLITHIC MEMORIES 

DATE 09/09/82 



CHIP SE3 PAL20X10 

CLK PHIO PHI90 X4 /LD NC D3 D2 Dl DO /SSET 
GND /OC DOWN S4 S3 S2 SI Q3 Q2 Ql QC UP VCC 



EQUATIONS 


/SI : = 

+ 


/PHIO 
SSET 


/S2 : = 

+ 


/SI 
SSET 


/S3 : = 

+ 


/PHI90 
SSET 


/S4 : = 
+ 


/S3 
SSET 



/DOWN := Si* S2* S3*/S4* PHIO* PHI90* X4 

+ /Sl*/S2*/S3* S4*/PHI0*/PHI90* X4 

:+: Sl*/S2*/S3*/S4* PHI0*/PHI90 

+ /SI* S2* S3* S4*/PHI0* PHI90 

/UP := /Sl*/S2* S3*/S4*/PHI0* PHI90 

.+ SI* S2*/S3* S4* PHI0*/PHI90 

:+: Sl*/S2* S3* S4* PHIO* PHI90* X4 

+ /SI* S2*/S3*/S4*/PHI0*/PHI90* X4 

/QO := /SSET* LD*/DO 

+ /SSET*/LD*/QO 

:+: /SSET*/LD* UP*/DOWN 

+ /SSET*/LD*/UP* DOWN 

/Ql := /SSET* LD*/D1 

+ /SSET*/LD*/Q1 

:+; /SSET*/LD* UP*/DOWN*/Q0 

+ /SSET*/LD*/UP* DOWN* QO 

/Q2 := /SSET* LD*/D2 
+ /SSET*/LD*/Q2 
:+: /SSET*/LD* UP*/DOWN*/QO*/01 
+ /SSET*/LD*/UP* DOWN* QO* Ql 

/Q3 := /SSET* LD*/D3 
+ /SSET*/LD*/Q3 
:+: /SSET*/LD* UP*/DOWN*/QO*/Q1*/Q2 



CHECK FOR PHIO 
INITIALIZE S1=L 

CHECK FOR SI 
INITIALIZE S2=L 

CHECK FO^ PHI 90 
INITIALIZE S3=L 

CHECK FOR S3 

INITIALIZE S4=L 

PHIO LEADS PHI90:COUNT=F/2 
PHIO LEADS PHI90:COUNT=F/2 
PHIO LEADS PHI90:COUNT=F/4 
PHIO LEADS PHI90:COUNT=F/4 

PHI90 LEADS PHIO :C0UNT=F/4 
PHI90 LEADS PHIO :C0UNT=F/4 
PHI 90 LEADS PHIO ;C0UNT=F/2 
PHI90 LEADS PHIO :C0UNT=F/2 

LOAD DO (LSB) 
HOLD QO 

DECREMENT 
INCREMENT 

LOAD Dl 
HOLD Ql 
DECREMENT 
INCREMENT 

LOAD D2 
HOLD Q2 
DECREMENT 
INCREMENT 

LOAD D3 (MSB) 
HOLD Q3 
DECREMENT 
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+ /SSET*/LD*/UP* DOWN* QO* Ql* 02 ; INCREMENT 



SIMULATION 



SETF OC SSET /PHIO /PHI90 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 Q3 02 01 00 

SETF /SSET LD D3 /D2 Dl /DO 

CLOCKF CLK 

CHECK 03 /02 01 /OO 

SETF /LD /X4 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 UP DOWN 03 /02 01 /OO 

SETF PHIO 

CLOCKF CLK 

CHECK SI /S2 /S3 /S4 UP DOWN 03 /02 01 /OO 

CLOCKF CLK 

CHECK SI S2 /S3 /S4 UP /DOWN 03 /02 01 /OO 

SETF PHI90 
CLOCKF CLK 
CHECK SI S2 S3 /S4 UP DOWN 03 /02 /Ol QO 

CLOCKF CLK 

CHECK SI S2 S3 S4 UP DOWN 03 /Q2 /Ql QO 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI S2 S3 S4 UP DOWN 03 /02 /Ql QO 

CLOCKF CLK 

CHECK /SI /S2 S3 S4 UP /DOWN 03 /02 /Ql QO 

SETF /PHI90 

CLOCKF CLK 

CHECK /SI /S2 /S3 S4 UP DOWN Q3 /Q2 /Ql /QO 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 UP DOWN Q3 /02 /Ql /QO 

SETF PHIO 

CLOCKF CLK 

CHECK SI /S2 /S3 /S4 UP DOWN 03 /02 /Ql /QO 

CLOCKF CLK 

CHECK SI S2 /S3 /S4 UP /DOWN 03 /02 /Ql /QO 

SETF PHI90 
CLOCKF CLK 
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CHECK SI S2 S3 /S4 UP DOWN /Q3 Q2 Ql QO 

CLOCKF CLK 

CHECK SI S2 S3 S4 UP DOWN /Q3 Q2 Ql QO 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI S2 S3 S4 UP DOWN /Q3 Q2 Ql QO 

CLOCKF CLK 

CHECK /SI /S2 S3 S4 UP /DOWN /Q3 Q2 Ql QO 

SETF X4 /PHI90 

CLOCKF CLK 

CHECK /SI /S2 /S3 S4 UP DOWN /03 02 Ql /QO 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 UP /DOWN /Q3 02 Ql /QO 

SETF PHIO 

CLOCKF CLK 

CHECK SI /S2 /S3 /S4 UP DOWN /Q3 Q2 /Ql QO 

CLOCKF CLK 

CHECK SI S2 /S3 /S4 UP /DOWN /Q3 Q2 /Ql QO 

SETF PHI90 
CLOCKF CLK 
CHECK SI S2 S3 /S4 UP DOWN /Q3 Q2 /Ql /QO 

CLOCKF CLK 

CHECK SI S2 S3 S4 UP /DOWN /Q3 Q2 /Ql /QO 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI S2 S3 S4 UP DOWN /03 /Q2 Ql QO 

CLOCKF CLK 

CHECK /SI /S2 S3 S4 UP /DOWN /Q3 /Q2 Ql QO 

SETF /PHI90 

CLOCKF CLK 

CHECK /SI /S2 /S3 S4 UP DOWN /Q3 /Q2 Ql /QO 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 UP /DOWN /Q3 /Q2 Ql /QO 

SETF PHIO 

CLOCKF CLK 

CHECK SI /S2 /S3 /S4 UP DOWN /Q3 /Q2 /Ql QO 

CLOCKF CLK 

CHECK SI S2 /S3 /S4 UP /DOWN /Q3 /Q2 /Ql QO 
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SETF PHI 90 
CLOCKF CLK 
CHECK SI S2 S3 /S4 UP DOWN /Q3 /Q2 /Ql /QO 

CLOCKF CLK 

CHECK SI S2 S3 S4 UP /DOWN /03 /02 /Ql /QO 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI S2 S3 S4 UP DOWN Q3 02 Ql QO 

CLOCKF CLK 

CHECK /Si /S2 S3 S4 UP /DOWN Q3 Q2 Ql QO 

SETF /PHI90 

CLOCKF CLK 

CHECK /SI /S2 /S3 S4 UP DOWN Q3 Q2 Ql /QO 

SETF SSET /PHIO /PHI90 
CLOCKF CLK 

SETF /SSET LD /D3 D2 /Dl DO 

CLOCKF CLK 

CHECK /Q3 Q2 /Ql QO 

SETF /LD /X4 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 UP DOWN /Q3 Q2 /Ql QO 

SETF PHI90 
CLOCKF CLK 
CHECK /SI /S2 S3 /S4 UP DOWN /Q3 Q2 /Ql QO 

CLOCKF CLK 

CHECK /SI /S2 S3 S4 /UP DOWN /Q3 Q2 /Ql QO 

SETF PHIO 

CLOCKF CLK 

CHECK SI /S2 S3 S4 UP DOWN /Q3 Q2 Ql /QO 

CLOCKF CLK 

CHECK SI S2 S3 S4 UP DOWN /Q3 Q2 Ql /QO 

SETF /PHI90 

CLOCKF CLK 

CHECK SI S2 /S3 S4 UP DOWN /Q3 02 Ql /QO 

CLOCKF CLK 

CHECK SI S2 /S3 /S4 /UP DOWN /Q3 Q2 Ql /QO 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI S2 /S3 /S4 UP DOWN /Q3 Q2 Ql QO 
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CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 UP DOWN /03 02 Ql QO 

SETF PHI 9 
CLOCKF CLK 
CHECK /SI /S2 S3 /S4 UP DOWN /03 02 01 00 

CLOCKF CLK 

CHECK /SI /S2 S3 S4 /UP DOWN /03 02 01 00 

SETF PHIO 

CLOCKF CLK 

CHECK SI /S2 S3 S4 UP DOWN 03 /02 /Ql /QQ 

CLOCKF CLK 

CHECK SI S2 S3 S4 UP DOWN 03 /02 /Ol /OO 

SETF /PHI90 

CLOCKF CLK 

CHECK SI S2 /S3 S4 UP DOWN Q3 /02 /Ol /OO 

CLOCKF CLK 

CHECK SI S2 /S3 /S4 /UP DOWN 03 /02 /Ql /OO 

SETF X4 /PHIO 

CLOCKF CLK 

CHECK /SI S2 /S3 /S4 UP DOWN 03 /02 /Ol 00 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 /UP DOWN 03 /02 /Ol 00 

SETF PHI90 
CLOCKF CLK 
CHECK /SI /S2 S3 /S4 UP DOWN 03 /02 01 /OO 

CLOCKF CLK 

CHECK /SI /S2 S3 S4 /UP DOWN 03 /02 01 /OO 

SETF PHIO 

CLOCKF CLK 

CHECK SI /S2 S3 S4 UP DOWN 03 /02 Ql OO 

CLOCKF CLK 

CHECK SI S2 S3 S4 /UP DOWN 03 /02 01 00 

SETF /PHI 90 

CLOCKF CLK 

CHECK SI S2 /S3 S4 UP DOWN Q3 Q2 /Ql /QO 

CLOCKF CLK 

CHECK SI S2 /S3 /S4 /UP DOWN 03 02 /Ol /OO 

SETF /PHIO 
CLOCKF CLK 
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CHECK /SI S2 /S3 /S4 UP DOWN Q3 02 /Ql QO 

CLOCKF CLK 

CHECK /SI /S2 /S3 /S4 /UP DOWN Q3 02 /Ol 00 

SETF PHI90 
CLOCKF CLK 
CHECK /SI /S2 S3 /S4 UP DOWN Q3 02 Ql /QO 

CLOCKF CLK 

CHECK /SI /S2 S3 S4 /UP DOWN 03 Q2 Ql /QO 

SETF PHIO 

CLOCKF CLK 

CHECK SI /S2 S3 S4 UP DOWN Q3 Q2 Ql QO 

CLOCKF CLK 

CHECK SI S2 S3 S4 /UP DOWN Q3 Q2 Ql QO 

SETF /PHI90 

CLOCKF CLK 

CHECK SI S2 /S3 S4 UP DOWN /Q3 /Q2 /Ql /QO 

CLOCKF CLK 

CHECK SI S2 /S3 /S4 /UP DOWN /Q3 /Q2 /Ql /QO 

SETF /PHIO 
CLOCKF CLK 
CHECK /SI S2 /S3 /S4 UP DOWN /03 /02 /Ql QO 

SETF LD /D3 /D2 /Dl DO 

CLOCKF CLK 

CHECK /Q3 /Q2 /Ql QO 

SETF /D3 /D2 Dl DO 

CLOCKF CLK 

CHECK /Q3 /Q2 Ql QO 

SETF /D3 D2 /Dl DO 

CLOCKF CLK 

CHECK /Q3 Q2 /Ql QO 

SETF /D3 D2 Dl DO 

CLOCKF CLK 

CHECK /Q3 Q2 Ql QO 

SETF D3 /D2 /Dl DO 

CLOCKF CLK 

CHECK Q3 /Q2 /Ql QO 

SETF D3 /D2 Dl DO 

CLOCKF CLK 

CHECK 03 /02 Ql QO 
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SETF D3 D2 /Dl DO 

CLOCKF CLK 

CHECK Q3 Q2 /Ql QO 

SETF D3 D2 Dl DO 

CLOCKF CLK 

CHECK 03 02 01 QO 

DESCRIPTION 

THIS PAL20X10 IMPLEMENTS A TWO CHANNEL SHAFT ENCODER WITH AN 
INTERNAL 4-3IT UP/DOWN COUNTER. 

BOTH THE "UP" AND "DOWN" OUTPUTS OF THE PAL DEVICE ARE NORMALLY 
HIGH. 

WHEN THE SIGNAL AT THE "PHIO" INPUT LEADS THE SIGNAL AT THE 
"PHI90" INPUT, THE "DOWN" OUTPUT ALTERNATES BETWEEN HIGH AND 
LOW LEVELS AND THE COUNTER WILL COUNT DOWN. WHEN THE SIGNAL 
AT THE "PHIO" INPUT LEADS THE SIGNAL AT THE "PHI90" INPUT, 
THE "UP" OUTPUT ALTERNATES BETWEEN HIGH AND LOW LEVELS AND 
THE COUNTER WILL COUNT UP. 

INPUT "X4" SELECTS BETWEEN HALF (X4=H) OR QUARTER (X4=L) CLK 
FREQUENCY OF THE COUNTER OUTPUTS. 

THE INTERNAL 4-BIT SYNCHRONOUS COUNTER HAS COUNT UP, COUNT DOWN 
CAPABILITIES. ALSO, THE COUNTER CAN PARALLEL LOAD AND HOLD DATA 
INDEPENDENTLY OF THE SHAFT ENCODER SECTION. THE REGISTERS ARE 
SYNCHRONOUSLY INITIALIZED WHEN /SSET IS HELD LOW. 

THE CONTROL INPUTS PROVIDE THESE OPERATIONS WHICH OCCUR 
SYNCHRONOUSLY AT THE RISING EDGE OF THE CLOCK. 
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Military Applications 



Monolithic Memories/Advanced Micro Devices is a major partici- 
pant in the Command, Communication and Control (C^) military 
marketplace. Applications include programmable obsolescence 
solutions, surface mount designs, radar systems, missile guid- 
ance, aircraft graphic processors, microprocessor support logic, 
state-machine designs, military computer hardware, redundant 
solutions for backup systems, and reconfigurable hardware using 
Logic Cell Arrays. 

Programmable products offer an excellent solution to military ob- 
solescence difficulties. The dynamics of the semiconductor in- 
dustry has shortened product life cycles, creating a sunset tech- 
nology market. Programmable products offer a quick, low-cost 
replacement for obsolete parts, through direct pin compatibility or 
board emulation. 

MMI/AMD has long been a leader in military bipolar PAL, PROM 
and FIFO products. We will soon be introducing military zero- 
power CMOS PAL devices and military CMOS Logic Cell Arrays. 
With pin and function compatible PAL products offered in both 
bipolar and CMOS technologies. Monolithic Memories offers 



backup systems (redundant solutions) in case of af allure with the 
original design. So, whether the customer needs high-speed 
bipolar or low-power CMOS, we will be the only company offering 
both technologies for the military marketplace. 

Surface mount technology is an assembly technique when circuit 
boards are assembled with components bonded to metal pads on 
the board surface, instead of being inserted into through-holes 
like conventional Dual In-line packages (DIP). Surface mount 
components can be mounted on both sides of a circuit board, 
doubling the functional density. This lowers cost by reducing the 
number or size of boards required. The military need for smaller 
or denser-packed systems make surface mounting a must. We 
meet this challenge offering af ull line of products in surface mount 
packaging, including Cerpack (W) and Leadless Chip Carriers 
(LCC). 

For more information on military products and applications, see 
the military datasheet section (page 5-415). 
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PAL32VX10 


74 Configurable IADs 
1 00 Configurable Logic 

Blocks (CLBs) 
1-16 PT/CLB Output 
1800 Gate Equivalent 
TTL or CMOS Input 

Levels 
Registered/Direct 

Inputs 
174 Registers 
On Chip Clock Gen. 
Asynch. Set/Res. Inputs 
Three-State 
Security Features 


AmPAL22V10 


58 Configurable l/Os 
64 Configurable Logic 

Blocks (CLBs) 
1-16 PT/CLB Output 
1200 Gate Equivalent 
TTL or CMOS Input 

Levels 
Registered/Direct 

Inputs 
122 Registers 
On Chip Clock Gen. 
Async. Set/Res. Inputs 
Three-State 
Security Features 


32 Inputs 
10 Outputs 
8-16 PT/Output 
Async. Reset/Sync. 

Preset 
Sync. Reset/Async. 

Preset 
Dual Feedback 
Registered Inputs 
Register Bypass 
J-K,S-R, DorT 

Rip Ffops 
Polarity 
Prefoad 
Three-State 
Security Fuse 


22 Inputs 
10 Outputs 
8-16 PT/Output 
Async. Reset/Sync. 

Preset 
Register Bypass 
Polarity 
Preload 
Three-State 
Security Fuse 













FUNCTIONALITY 







PAL20S10 
PAL20RS10 
PAL20RS8 
PAL20RS4 






PAL20RA10 






PAL20L8 
PAL20R8 
PAL20R6 
PAL20R4 




PAL20L10 
PAL20X10 
PAL20X8 
PAL20X4 


PAL16L8 
PAL16R8 
PAL16R6 
PAL16R4 


20 Inputs 
10 Outputs 
4 PT/Output 
Async. Clock 
Async. Set/Reset 
Register Bypass 
Polarity 
Preload 
Three-State 
Security Fuse 


20 Inputs 
10 Outputs 
1-16 PT/Output 
PT Sharing 
Polarity 
Preload 
Three-State 
Security Fuse 


20 Inputs 
8 Outputs 
8 PT/Output 
Three-State 
Security Fuse 


20 Inputs 
10 Outputs 
4 PTADutput 
XOR 
Preload 
Three-State 
Security Fuse 


16 Inputs 

8 Outputs 

8 Product Terms 

per Output 

Three-State 

Security Fuse 

















Note: PT = Product Term 



Figure 1. Military Configurable Architectures 
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Radiation Hardness 



Spurred on by America's Strategic Defense Initiative, military 
equipment manufacturers are lool<ing for radiation hardened 
partsto put into strategic weapon and space systems. It has been 
stated tiiat some lovo! of radiation tolerance v/ill bo required in up 
to 50% of all military applications by 1 990. Due to this increased 
need for radiation-resistant integrated circuits, MMI and AfvID 
have embarked on a program to determine what radiation dose 
rates its products will withstand, and to provide increasingly 
radiation-tolerant products. 

The Galileo probe to Jupiter was bombarded with 1 00,000 rads 
(Si) within a one hour period. In contrast, the earth's surface 
absorbs 0.1 rads in a year. Military aircraft must withstand a 
minimum of 2,500 rads total dose. A nuclear explosion 
emits transient- ionizing radiation of at least 10' rads (Si) per 
second. Radiation dosage and type varies greatly with the envi- 
ronment. Table 1 shows radiation dosages and semiconductor 
tolerances. 

Radiation effects can be divided into two major categories; 
natural space effects and man-made nuclear effects. 

Space radiation consists mainly of cosmic rays and charged 
particles In radiation belts (Figure 1). Cosmic rays from the sun, 
made up of protons and nuclei, cause single event upsets (SEU). 
A SEU occurs when a high-energy particle passes through an in- 
tegrated circuit, changing an internal logic state. 



Radiation belts around planets, such as the Van Allen Belt circling 
the earth, carry trapped protons and electrons. These charged 
particles accumulate overtime in CMOS devices, causing thresh- 
old voltage shifts. In oxido-isolatod bipolar circuits this ionizing 
radiation (total dose) will cause an increase in leakage current. 

A nuclear explosion irradiates a large neutron fluence and high 
energy gamma rays (Figure 2). Neutron radiation damages the 
semiconductor lattice structure, creating recombination sites for 
minority carriers, which decreases bipolar transistor current gain. 
Neutrons induce no change to CMOS devices, which are 
majority-carrier structures. 

Nanoseconds after a nuclear blast, short, high-intensity electro- 
magnetic pulses lash out. Difficult to stop, these gamma rays 
generate huge numbers of holes and electrons, causing large 
currents, destroying both bipolar and CMOS circuits. CMOS is 
especially susceptible due to its inherently low latch-up sensitivity 
of 10' rads per second. 

Emerging technologies and process improvements will make the 
basic military hardness requirements for semiconductor devices 
a reality (Table 2). Gallium Arsenide (GaAs) products now meet 
a no-upset dose rate of 5 X 1 0" rads per second and a total dose 
of 10' rads. Whilejunction-isolated bipolar devices also meet in- 
herently high dose rates of up to 1 0'° rads/second and total doses 
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TECHNOLOGIES 
GALLIUM ARSENIDE 

JUNCTION ISOLATED BIPOLAR 
OXIDE ISOLATED CMOS 
OXIDE ISOLATED BIPOLAR 
SILICON ON SAPPHIRE (SOS) 

STANDARD BULK CMOS 
NMOS 



TOTAL DOSE 
RADs (Si) 



Table 1. Radiation Perspectives 
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Natural 

Cosmic Rays 

• Protons and Nuclei 

• Causes Single Event Upsets (SEUs) 

Charged Particles 

• Electrons and Protons Found in Radiation Belts 

• Accumulated Ionizing Radiation Over Time (Total Dose) 
Measured in rads (Si) 

• Causes Accumulation of Trapped Charges -> Shifting 
Threshold Voltages in CfvlOS 

• Increases Leakage Currents in Bipolar 



Nuclear Effects 
Man-Made 

Neutrons 

• Neutron Fluence treasured in Neutrons Per Square 
Centimeter 

• Damages Lattice Structure -> Decreases Gain 

• Affects Bipolar More Than CMOS 

Gamma Rays 

• Transient-Ionizing Radiation (Dose Rate) Measured in 
rads (Si) Per Second 

• High Energy Photons of 10' rads Per Second 

• Causes Large Photocurrents -> Increases Supply Current 

• Results in Latch-Up and Permanent Damage from Burn-Out 

• Difficult to Shield Against 

• Ruins CMOS 




SPACE EFFECTS 




vo^"'" sEU-cosMic RAYS '' ' E : : 3\}E : : 3 !^ 

(PROTONS) '.. ^^-^-^ ^-^ M_l_l/.. 



25,000 W 



IL"^ 



VAN ALLEN BELT 
(PROTONS AND ELECTRONS) 



=g^' 




NUCLEAR EFFECTS 



NUCLEAR EXPLOSION 
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Figure 1. Space Effects 



Figure 2. Nuclear Effects 
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of up to 10' rads, radiation-hardened CMOS circuits are not far 
behind with a typical dose rate of 1 0'° rads/second and a total 
dose of 1 0^ rads (Table 3). 

Process changes to improve the radiation tolerance of CMOS 
devices are not without penalties. Guard rings reduce latch-up 
sensitivity and leakage paths, but decrease circuit density. Thin- 
ner gate oxides increase radiation resistance but cause reliabil- 
ity problems due to hot electron effects. A dry oxide process 
produces less defects but increases manufacturing time. Done 
correctly, these sacrifices will produce a marketable 
radiation-hardened CMOS process. 



At Monolithic Memories, radiation data has been obtained for 
neutron fluence and dose rate effects. Products representing 
bipolar junction and oxide-isolated processes passed neutron 
irradiation levels of up to 10'^ neutrons per square centimeter 
(Table 4). Dose rate data obtained on the junction-isolated 
products showed recovery in fifty to seventy microseconds from 
a one microsecond pulse of 2 X 1 0'° rads (Si) per second (Table 
5). MMI and AMD will move from radiation testing towards new 
radiation-tolerant designs and processes as the need for radia- 
tion-hardened products increases. 

Space Effects 



RADIATION TYPE 


AIRCRAFT 


NUCLEAR 


SPACE 


Dose Rate 
with no upset 
(rads/second) 


10» 


10" 


— 


Dose Rate 
with no latch-up 
(rads/second) 


10'° 


10'= 


— 


Total Dose 
(rads (Si)) 


2500 


5000 


10^ 


Neutron Fluence 
(neutrons/square 
centimeter) 


100 


10'^ 


— 


Single Event Upset 
involving electrons 
(bits/square 
centimeter) 


— 


10-'° 


— 


Single Event Upset 
involving protons 
(bits/square 
centimeter) 


— 
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10-'* 
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106 


• GaAs 






• NMOS/SOS 

• CMOS/SOS 


ECL^— 

TTL • 

LST^L • 




• NMOS 

. PMOS 

. CMOS 


|2l . 










• MOST DISC 








DO 

(F 


SERi 

ADS/ 


ME 
S) 





Table 3. Typical Ionizing Dose Rate Upset Levels 



Table 2. Basic Military i-Iardness Requirements 
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PRODUCT 


DESCRIPTION 


PROCESS 


PASSED 

DATASHEET 

LIMITS UP TO 

(neutrons/cm^) 


PASSED 

FUNCTIONAL UP TO 

(neutrons/cm') 


PAL16R4AM 


20 Pin, 30 ns 
PAL Device 


Junction Isolated 
Diffused Bipolar 


4.5X10'' 


1 xlO'* 


PAL16R4BM 


20 Pin, 20 ns 
PAL Device 


Shallow-Junction Isolated 
Implanted Bipolar 


4.5x10" 


1 x10'« 


PAL16R4DM 


20 Pin, 15 ns 
PAL Device 


Oxide Isolated 
Bipolar 


4.5 xlO'^ 


1 xlO'' 


PAL20R4AM 


24 Pin, 30 ns 
PAL Device 


Junction isolated 
Diffused Bipolar 


8x10'^ 


1 XlO'* 


PAL20RA10M 


24 Pin, 

Asynchronous 

PAL Device 


Junction Isolated 
Diffused Bipolar 


9.6x10" 


1 xlO'^ 


PAL32R16M 


1500 Gate, 40 Pin 
MegaPAL™ Device 


Junction Isolated 
Diffused Bipolar 


1 x10'« 


1 xlO" 


53S1681 


16KPR0M 


Junction isolated 
Diffused Bipolar 


4.5x10" 


4x10" 


53RA1681A 


16K Registered 
PROM 


Junction Isolated 
Diffused Bipolar 


8x10" 


1 XlO'* 


53S3281 


32K PROM 


Junction isolated 
Diffused Bipolar 


1x10'* 


1 XlO" 



Table 4. Neutron Bombardment Data 
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PRODUCT 


DESCRIPTION 


PROCESS 


DOSE RATE 
(rads/second) 


PHOTOCURRENT 
(amps) 


PAL12H6M 


20 Pin Small 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.6x10'" 


4.2 


PAL14H4M 


20 Pin Small 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.1 xlO'" 


3.9 


PAL10L8M 


20 Pin Small 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.5x10"= 


4.2 


PAL12L6M 


20 Pin Small 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.2x10"> 


3.3 


PAL14L8M 


24 Pin Small 
PAL Device 


Junction Isolated 
Diffused Bipolar 


1.2x10'° 


3.6 


PAL16L6M 


20 Pin Small 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.3x10'° 


4.0 


PAL16L8M 


20 Pin Standard 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.5x10'" 


5.3 


PAL16R8M 


20 Pin Standard 
PAL Device 


Junction Isolated 
Diffused Bipolar 


1.9x10'° 


5.3 


PAL16R6M 


20 Pin Standard 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.4x10'° 


5.7 


PAL16R4M 


20 Pin Standard 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.1 x10'° 


5.5 


PAL20L10M 


24 Pin Exclusive-OR 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.1 X 10'° 


6.7 


PAL20X10M 


24 Pin Exclusive-OR 
PAL Device 


Junction Isolated 
Diffused Bipolar 


1.9x10'° 


5.9 


PAL20X8M 


24 Pin Exclusive-OR 
PAL Device 


Junction Isolated 
Diffused Bipolar 


2.1 x10'° 


5.8 



Table 5. Transient Upset Test Data 



^ 



Monollthio 



m 



Memories 



2-585 



Notes 




2-586 



£1 MonollthloifSJi Memories S^ 



Article Reprints 



Article Reprints 

Programmable Logic Device Preserves Pins, Product Terms (PAL32VX10) 2-589 

PLD Programmability Extends its Sway Over Complex I/O (PALG29M/MA16) 2-593 

PROSE Devices Simplify State Machine Design (PMS14R21) 2-601 

Designing a State Machine With a Programmable Sequencer (PMS14R21) 2-607 

FPCs and PLDs Simplify VME Bus Control (Am29PL141) 2-619 

Fuse-Programmable Chip Takes Command of Distributed Systems (Am29PL141) 2-633 

PAL Device Buries Registers, Brings State Machines to Life (AmPAL23S8) 2-639 

Programmable Event Generator Conquers Timing Restraints (Am2971) 2-644 

Wait-State Remover Improves System Performance (PAL22V10) 2-648 

PLDs Implement Encoder/Decoder for Disk Drives (PAL22V10) 2-651 

Mixing Data Paths Expands Options in System Design (PAL22V10) 2-650 

Programmable Logic Chip Rivals Gate Arrays in Flexibility (PAL22V10) 2-670 

XOR PLDs Simplify Design of Counters and Other Devices (PAL20X10) 2-676 

The PAL20RA10 Story —The Customization of a Standard Product (PAL20RA10) 2-683 

PLDs Abound: RAM-Based Logic Joins In 2-699 

Introduction to Programmable Array Logic 2-702 

Logical Alternatives in Supermini Design 2-71 1 

Conference Proceedings 

New PAL Device Architecture Extends Design Flexibility (PAL32VX10) 2-719 

PROSE Architecture and Design Methodology (PMS14R21) 2-724 

Blazing Fast PAL Devices Enable New Application Areas (PAL16R8-10, PAL10H20P/G8) 2-730 



.icIiilAll 



MonoinhiG\f\M Memories L\ 2-587 



Notes 




2-588 



,iaM 



Monolithic Vului nflemories 



^ 



DESIGN APPLICATIONS 



Programmable Logic Device 
Preserves Pins, Product Terms 

Chris Jay 

Monolithic Memories Inc., 2175 Mission College Blvd., Santa Clara, CA 95054; (408) 970-9700. 
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Programmable-logic devices continue to evolve, re- 
sponding to designers' demands for improved per- 
formance, functionality, and flexibility. Among the 
newest is the PAL32VX10, a programmable-array 
logic (PAL) device that enhances the architecture 
of the earlier PAL22V 10. The result is a more flexi- 
ble general-purpose device. 

As a functional and architectural superset of the 
22V10, the 32VX10 can be programmed to carry 
out any function that its predecessor can. It also 
adds two features: regis- 



Buried registers and 
exclusive-OR gates 
malce ttiis PAL device 
a sound foundation 
on wfiicfi to build 
state macfiines tfiat 
conserve I/O pins. 



ters that can be buried 
for designing state ma- 
chines, and exclusive- 
OR (XOR) gates for 
counter and JK func- 
tions (Electronic 
Design, Jan. 8, p. 45). 

The number of inputs 
to the fuse array of the 
latest programmable- 
array logic (PAL) de- 
vice has been increased to 32 by the addition often 
buried feedback paths. A macrocell output pro- 
grammed as an input accesses the chip's feedback 
path, which is buried within the fuse array. 

As a result, buried state machines can be config- 
ured in the device. Any logic design calling for 
many input pins can be configured with dedicated 
input pins and any of the of macrocell I/O pins 
(Fig. 1). This option leaves pins free for configuring 
outputs as inputs, and other Boolean functions may 
be realized in buried registers. Thus, functionality 
of the macrocell need not be lost. 

The macrocell's XOR gate (denoted by the X in 
the part number) simplifies state-machine design 
because it promotes gate product-term economy. 
This economy is the ability to create complex state- 
machine designs without exhausting the limited 
number of product terms in each macrocell. 

For example, there are ten D-type registers in the 
32VX10, one in each of its ten macrocells. The 
XOR input to each register and no more than three 
Reprinted with permission from Electronic Design, 
Vol. 35, No. 13 (May 28, 1987). Copyright 1987 Hayden Publishing Co., Inc. 



product terms transform a D-type register to a JK 
register. Once the JK register is configured, subset 
functions like the T (toggle) and an SR (set-reset) 
may be derived. 

The macrocell's product-term economy can be a 
result of the hold state inherently defined in the JK 
truth table — a function not offered by the D-type 
register alone (see the table). For a D-type register 
to hold its contents in a "set" state during several 
states, additional product terms must hold that 
condition until it is relaxed. In many tasks, apply- 
ing product terms as holding functions could ex- 
haust the limited supply in a given cell and compro- 
mise a state-machine design. 

A practical task for the hold function is in a line- 
sync generator. The blanking and line-sync pulses 
are programmed as JK functions, and a hold condi- 
tion is invoked for their active duration. 

The XOR gate also influences the output polari- 
ty of a logic signal. Active-high or -low logic signals 
may be created at the output pins. In PAL devices 
without polarity control, the output can be inverted 
by applying DeMorgan's Theorem to a sum-of- 
products architecture. This approach, however, 
brings with it the added baggage of more product 
terms. For example, one term containing N literals 
(Boolean variables) is inverted by creating the sum 
of N negated literals, with each negated literal us- 
ing a product term. 

Each of the 32VX10's ten macrocells is config- 
ured around a sum-of-products and XOR input to a 
D-type register, which is clocked from a dedicated 
input pin. Each macrocell can be programmed for a 
combinational or registered output. One product 
term, associated with the control to bypass the reg- 
ister, selects the path dynamically, if needed. 

Typically, the registers are programmed as a re- 
fresh counter for dynamic RAM. When a host mi- 
croprocessor must access the RAM for read and 
write cycles, its address-line outputs bypass the reg- 
isters, which contain the current refresh count for 
the RAMs. Instead, the outputs address the memo- 
ry inputs directly. Afterwards, the address outputs 



1:1 



MonoUiMo 



m 



Memories 



2-589 



Programmable Logic Device Preserves Pins, Product Terms 



from the registers are enabled again to refresh the RAM. 

In fast video applications, a similar philosophy is in ef- 
fect. The host microprocessor bypasses the contents of 
the video-memory address counter to update the video 
RAM. It is important, though, that any counter-incre- 
ment signal be disabled, when the registers are bypassed; 
an active clock input would overwrite any contents of the 
D-type registers. In a video RAM system, this might not 
be a disadvantage, but for dynamic-RAM arrays, over- 
writing a refresh counter could lead to loss of data. 

One product term might also dynamically influence 
the logic signal's output polarity. With a registered out- 
put, a dedicated register-output-polarity multiplexer se- 
lects either a Q or Q output, routing the selected signal to 
the output pin. One programmable polarity fuse selects 
the required path. This fuse does not affect the program- 
mable polarity associated with combinational outputs. 

The global product terms from the fuse array are dedi- 
cated to resetting the ten registers synchronously and also 
presetting them asynchronously. The action of these two 
product terms can be selected by the programmable-po- 
larity multiplexer for individual registered cells. 

For instance, for a registered output, the asynchro- 



nous-preset gate can be programmed to a synchronous- 
reset gate or vice versa. The single product-term input to 
the XOR gate can select the output polarity for either 
combinational or registered outputs. 

The second added feature of the new PAL device, its 
conversion of an internal D-type register to a JK-register 
cell, is demonstrated by five Boolean equations formed 
from a sum of products: 



Q*J+Q*K(I) 
Q*J+Q*K(2) 



Q*J+Q' 
Q*J+Q' 
Q*J+Q' 



K(3) 
K(4) 
K(5) 



Clock input Q_ 



Fuse 
array 



rh 



^^-D 



Synchronous reset O 






Output Enable 




Distributed product 
terms 8, 10, 12, 14, and 16 



n 



n 



HD 



Register bypass control 



Register bypass 



Multiplexer 



H> 



> Q 

P 



Multiplexer 



Buried register 
feedback 



Feedback from input pin 



Asynchronous preset 



1. Each of the ten macrocells in Monolithic Memories' PAL32VX10 PAL device 
is driven by an exclusive-OR gate, a feature that promotes frugal use of 
product terms. For example, the gates reduce the number of terms needed 
to Implement JK registers. The macrocells can also feed output and input 
signals back to burled registers. As a result, designing state machines with 
the device is easier. 



where the Boolean operators are: 

for an inversion; 
:+: for an XOR operation; 
+ for an OR operation; 
* for an AND operation; and 
:= is the state-machine operator "becomes equal to" 

Note that all five Boolean equations have true or com- 
plement conditions for either J_or K_; and a mixture of J 

and K or J and K functions. 

Three of the equations require 
the XOR function. Moreover, J 
and K active-high conditions 
can be used in equation 3 only 
with an XOR operator. 

The Boolean operators are 
compatible with the Palasm as- 
sembler-support software. Pa- 
lasm is a computer-aided 
(CAD) tool that generates pro- 
gramming information for 
PAL devices. The Boolean ex- 
pression information describes 
the operation of the device to be 
programmed. 

A good design specification 
done by a CAD tool like Pa- 
lasm should come with a com- 
plete comment field to describe 
the function of the Boolean 
equations. Palasm accepts a 
comment field in the descrip- 
tion of a logic-design file. When 
a semicolon precedes a state- 
ment, Palasm ignores that 
statement during the assembly 
process. 

The functional description of 
the logic circuits is in terms of 
Boolean statements. A later 
version of Palasm will accept 
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state-machine logic descriptions on a higher level. Then, 
designers will be able to deal with logic statements con- 
taining JK functions without describing them as a series 
of a sum of products, as is now the case. 

The ability to create subsets of a JK register gives the 
logic designer more flexibility to design sequential-logic 
circuits. One of the subset functions, for example, is the 
toggle (T) flip-flop, which lends itself to counter applica- 
tions. Binary counters require that the contents of any 
general register, Q^,, remain in a hold state until the prod- 
uct term of all the lesser significant registers goes to a log- 
ic high. Then the register must change state or toggle after 
the arrival of the next clock pulse. The T function may be 
created by tying the J and K inputs together and treating 
this composite as the T input. A hold state is maintained 
when T = 0, and a change of state results when T = 1 . 

The application of the toggle function to binary 
counters can be developed by considering a general regis- 
ter Qn in a binary count sequence, where the lesser signifi- 
cant registers are Q^.i, Qn.2>- • -Qi' Qo- Using equation 3, 
the JK description for a general register is given as 

Qn:-Qn:+:K*Qn+J*Qn (6) 

A hold condition must apply when the product term of 
all lesser significant registers is a logic zero, or: 

Qn-,*Qn-2*-Qi*Qo = 

and a toggle condition must apply when 

Qn-i*Qn-2*-Q.*Qo=1 
This develops to 

J=K = Qn-,*Qn-2*--Qi*Qo(7) 

which represents the T input to a toggle register. Substi- 
tuting equation 7 into 6 leads to 

Qn:-Qn:+:Qn*(Qn-.*Qn-2-Qi*Qo) 

+ Qn 

*(Qn-,*Qn-2*-Qi*Qo)(8) 

Substituting equation 7 into 6 for J and K inputs leads 
to 

Qn:=Qn:+:(Qn + Qn) 
*Qn-.*Qn-2*--Q,*Qo(9) 

Because (Q^, + Q^) = I, equation 9 reduces to 

QN:=QN(holdQN) 

:+: Qn-1 *Qn-2*---Q,*Qo (toggle Qn) (10) 

Equation 10 is a general equation for a register in a bi- 
nary counter. Only two product terms are used because 
the XOR gate makes it possible to invoke the toggle func- 
tion as a JK subset. Without the XOR gate, the general 
equation for register N would be 



Qn == Qn 

* Qn-i*Qn-2*.-Qi*Qo (toggle Qn) 

+ QN*QN-T(holdQN) 

+ QN*QN-2(h0ldQN) 



+ QN*Q,(holdQN) 

+ QN*Q„(holdQN)(ll) 

Equation 1 1 requires N product terms as hold func- 
tions. This means that for a 10-bit counter, the most sig- 
nificant register, Qg, would need nine individual product 
terms as hold functions and one additional product term 
for a toggle function. 
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2. Thanks to its oasy implementation of the JK func- 
tion and its subsets, the 32VX10 mokes a compact 
line-address counter and line-sync generator for 
video tasks. 
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In contrast, equation 10 makes use of the XOR func- 
tion, requiring one product term to hold and one product 
term to toggle. No matter how significant the register in 
the count sequence, the inclusion of an XOR function in 
the equation eliminates large numbers of product terms. 
The advantage of the XOR gate increases for a 10-bit, up- 
down counter because the required number of product 
terms doubles. Q, would normally need 20 product terms 
but with an XOR gate, only three do the job. 

A practical outcome of configurable JK and T registers 
is the design of a line-sync generator circuit for control- 
ling a video RAM, and for synchronizing line-display in- 
formation to a CRT interface (Fig. 2). Memory-address 
outputs MAq through MA^,, which are derived from an 
internal counter, drive the address inputs to the video 
RAM. One complete line of video information is dis- 
played and surrounded by line-sync and blanking pulses 
from two macrocells programmed as JK registers. 

The internal memory-address counter is programmed 
to address an 80-character screen and then issue an ac- 
tive-high blanking pulse. The line-sync pulse goes active- 
high when the counter reaches a decimal count of 79 (4F 
hex) and is turned off at a decimal count of 103 (67 hex). 
The blanking pulse is negated and the counter returns to 
zero after reaching a decimal count of 107 (6B hex). 

The JK functions for turning on line-sync and blanking 
pulses come from decoding the count equations. The 
blanking pulse overlaps the line-sync pulse to give front 
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and back porches to the video line-scan signal. Also, be- 
cause the host shares access to the video memory, it c an 
update video information. In that case, a Write (WR) is 
driven low to produce three-state video-address outputs. 

When at work, the microprocessor can change the con- 
tents of the video RAM during frame- or line-signal fly- 
back periods to avoid displaying random data during ac- 
tive display periods. To make such changes possible, the 
WR input can be derived from a smaller combinational 
PAL device that decodes the microprocessor's address 
lines and maps the video RAM int o its a ddress space. 

An asynchronous reset input (RST) powers up the 
PAL to a known state. With its Count Enable Input 
(CEI) and Output (CEO) signals, the device can cascade 
with other circuitry, such as a frame-sync generator and a 
video-shift register. 

The memory-address counters are programmed as T 
fiip-Hops; line-sync, blanking and CEO f unctions as JK 
registers. An Enable-State Input (ENST) enables input 
states through 2 as count enable inputs in preference to 
the CEI signal. When in the low state, ENST qualifies in- 
put states through 2 as active-high count-enable inputs. 

To complement the line-sync generator design, the 
32VX10 is programmed as a 7-bit video-shift generator. 
Seven registers shift video data and three are pro- 
grammed as a 3-bit counter to control loading and shift- 
ing of video data in the shift register. When all are high, 
counter output states through 2 cause the shift register 
to load and the line-sync generator to increment. 

For the example circuit, another 32VX10 is pro- 
grammed to support the line-sync generator. As with the 
other 32 VX 1 Os, the frame-sync output is programmed as 
a JK function. The internal counter accesses row-charac- 
ter information as well as the contents of the video RAM. 
The 32VX10 video-shift register accepts dot information 
from the character generator and supplies a parallel-to- 
serial output for the video display. Any parallel-to-serial 
output format can be handled by the 32VX10. 

Readers interested in the PAL design specifications 
may contact the author for further details. D 
Chris Jay, logic-cell-array applications manager with 
Monolithic Memories, has been with the company since 
1983. Three years were spent as a field-application engi- 
neer in Northern Europe. Before that, he worked for Texas 
Instruments in Bedford, England and Fairchild Camera 
and Instrument in Bristol, England, performing logic de- 
sign and applications engineering, respectively. Jay earned 
a BSc (honors) degree from Essex University in Coulches- 
ter, England in 1977. 
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Electrically erasable CMOS parts, the 

densest 24-pin SLIM PALs yet, reach 

new heights in architectural flexibility 



Om Agrawal, Product Planning Manager 

Fares Mubarak, Senior Test Engineer 

Field-Programmable Logic Division 

Advanced Micro Devices, Inc., Sunnyvale, CA 



wo electrically erasable PAL 
devices being introduced here 
by Advanced Micro Devices 
offer by far the greatest flexibility of 
any programmable array logic 
(PAL) device to date. Several fea- 
tures contribute to the adaptability 
of these CMOS ICs — the highest yet 
number of product terms (188 and 
178) of any 24-pin PAL; the opti- 
mization of one device, the AmPAL- 
29M16, for synchronous designs and 
the other, the AmPAL29MA16, for 
asynchronous and glue logic applica- 
tions; and the convenience of some 
11,000 electrically programmable 
and erasable interconnecting cells 
(see Figs. 1 and 2). 

But the most distinctive enhance- 
ment is the programmable input and 
output bidirectionality of each de- 
vice's 16 I/O logic macrocells (I/O- 
LMs) . Until now, some program- 
mable logic devices (PLDs) have 



had output macrocells, but none has 
had input macrocells, let alone the 
I/O type. The I/O pin on each I/O- 
LM can in addition be configured as 
combinatorial, as an edge-triggered 
register, or as a transparent latch. 

What's more, on 8 of the 16 I/O 
pins, the associated 8 register/ 
latches can serve independently of 
the pins as programmable buried 
state registers. Current 40-pin 
CMOS EPLDs have only half that 
number. These internal registers 
offer designers an extra eight flip- 
flops for such jobs as keeping track 
of flags or generating timing and con- 
trol signals. 

The 11,000 or so interconnecting 
elements are electrically erasable 
(E') cells that define what each 
PAL device can do. They connect all 
array inputs to the programmable 
AND-OR array, and they configure 
the architecture of the I/OLMs. The 
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user programs these cells by charg- 
ing and discharging them, so that de- 
sign changes can be made quickly — 
far faster than for ultraviolet-light 
erasable PLDs, or EPLDs. 

The record number of product 
terms, 16 macrocells, and horde of 
E^ cells make the two newcomers 
the densest 24-pin SLIM PAL de- 
vices available, whether bipolar or 
CMOS. The ICs are five to six times 
denser than industry-standard, first- 
generation 20- and 24-pin EPLDs 
and E^PLDs and twice as dense as 
second-generation 24-pin parts. In 
fact, these CMOS PALs are denser 
even than 40-pin bipolar PLDs. 

Their 35-ns max input-to-output 
propagation delay and external clock 
frequency of 20 MHz max fixes the 



new CMOS devices in the medium 
speed range expected of bipolar 
imits. Maximum active power dissi- 
pation is 120 mA. Each comes in two 
logic levels— CMOS HC and CMOS 
HCT logic for TTL systems. (The 
trademark PAL belongs to Mono- 
lithic Memories.) 

From what follows, it will become 
clear that the basic I/O logic macro- 
cell structure is the same for both the 
synchronous 29M16 and asynchro- 
nous 29MA16. Differences arise, 
though, in the I/OLM control mech- 
anisms — their clocking, resetting 
and presetting, and output enabling. 
Also, the distribution of the product 
terms between logic and control 
functions is different for the two 
PALs. The testing and debugging 



Fig. 1. The architecture of the synchronous AmPAL29M16 E^ programmable 
logic device derives much of Its flexibility from Its 16 bidirectional Input and 
output logic macrocells. The macrocells may use either one or tv/o clocks, 
and each controls the output of on average 1 1 logic product terms. 



features, though, are identical. In 
conclusion, an application of the 
29M16 will be outlined — the use of 
nearly a dozen of these packages to 
build a 32-bit direct memory access 
(DMA) controller. 

I/OLM with a difference 

The flexibility of the I/O and stor- 
age elements of these parts lends 
them the look more of a gate array 
than a PAL device (see Figs. 1 and 
2) . Eight of the I/OLMs have single 
feedback to the AND/ OR array, and 
eight have dual feedback. Single 
feedback comes from either the I/O 
pin or the storage element, dual feed- 
back comes from both. 

It's when an I/O pin is used for 
input that its associated storage ele- 
ment can be turned to use as a buried 
register or latch. As many as all eight 
of the dual -feedback I/OLMs can 
be devoted to user-programmable 
buried-state register/ latches. These 
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are helpful for both synchronous and 
asynchronous state-machine designs. 
The presence, number, and pro- 
grammability of the E^ cells within 
the I/OLMs is the most significant 
architectural variable of these PAL 
devices. (Second-generation PLDs 
had three to four times fewer archi- 
tectural cells.) Is the macrocell to be 
an input or an output? An edge-trig- 
gered D flip-flop or a transparent 
latch? What are the macrocell's 
clocking, feedback, and output- 
enable schemes and output polarity? 
What if it frees up a buried state 
register? 

Programming the E^ cells 

All these decisions are made by 
programming the E" cells. Single- 
feedback macrocells have nine E- 
cells, dual-feedback macrocells have 
eight. In its virgin, erased (charged 
or disconnected) state, an E- cell's 
value is 1. In its programmed (dis- 
charged or connected) state, it is a 0. 



Both synchronous and asynchro- 
nous I/OLMs contain a configurable 
storage-element cell and five multi- 
plexers—two 2:1 multiplexers for se- 
lecting I/O and feedback path and 
three 4:1 multiplexers for selecting 
output path, clock, and output en- 
able (see Fig. 3). 

Whether a macrocell is to be an 
input or output and the nature of 
its storage element (edge-triggered 
register or transparent latch) is set 
by E- cells So and S3. In its virgin 
state, the I/OLM is an output cell 
and the storage element is an edge- 
triggered register. It can be changed 
by the user to an input latch or regis- 
ter or output latch by appropriate 
programming of S2 and S3. 

The polarity (active high or active 
low) of the cell's output and its se- 
quential (registered or latched) or 
combinatorial nature are determined 
by Sq and Sj. With both of these 
cells in the erased (charged) state, 
for example, the output is set to be 



Fig. 2. The 16 I/O macrocells in the asynchronous AmPAL29MA16 can each 
be allotted Its own product-term-driven clock. The PAL's need for more 
control terms leaves an average of 7 logic product terms per macrocell. 



active low and combinatorial. 

By programming, or discharging, 
cell Sq, the designer makes the out- 
put go active high. The OR-gate out- 
put bypasses the storage element 
and feeds the output-path selection 
multiplexer directly. 

On the other hand, by program- 
ming Sj, the designer makes the out- 
put sequential (registered/ latched) . 
The Q output of the storage element 
passes through the 4:1 multiplexer 
to an inverting output buffer and is 
fed back to the AND array inter- 
nally. This configuration is particu- 
larly useful for state machine de- 
signs. Programming both Sq and S^ 
results in an active-high, sequential 
output. 

The feedback multiplexer for the 
I/OLMs with single feedback is con- 
trolled by yet another E^ cell (Sg). 
This separate control of the feedback 
multiplexer makes for the ultimate 
in flexibility in state machine design. 
Each I/OLM, with registered, com- 
binatorial or latched outputs, choice 
of polarity, and dual feedback, has 
12 possible output configurations. 
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The timing needs of synchronous 
state machines are much simpler 
than those of asynchronous and glue 
logic — hence the difference in the 
two PALs' clocking, output-enable, 
and preset/ set arrangements and 
in the number of product terms as- 
signed to logic and control. 

For synchronous state-machine 
design, the flexibility of one or two 
synchronous clocks, preferably with 
polarity control, is often needed for 
individual macrocells. It is impor- 
tant for designing single-, dual-, or 
quadruple-state machines with a var- 
iable number of state registers. Pro- 
grammable clock polarity offers the 
ability to trigger certain groups, or 
banks, of registers on one edge and 
other groups on the opposite edge. 
The result is faster internal state 
machines. 

Early PAL devices had a single 



synchronous clock. Some newer syn- 
chronous CMOS erasable PAL de- 
vices offer dual clocks, but the two 
are selectable for a bank of 8 regis- 
ters at a time, rather than for 16 
macrocells individually. Thus, they 
are limited to, at most, two different 
state machines, each with an eight- 
register maximum. 

In addition, these PAL devices us- 
ually have two separate clock pins. 
So, if the designer needs a single- 
state machine with up to 16 registers, 
the second clock pin is wasted. 

Better clocking 

The 29M16S alleviate these draw- 
backs. Each I/OLM has a 4:1 pro- 
grammable clock-selection mecha- 
nism selected by two E^ cells, S4 and 
S5. For addressing synchronous 
logic, the 29M16 has one dedicated 
CLK/LE (latch enable) pin and 



Fig. 3. The I/O logic macrocell, in either the synchronous AmPAL29M16 gate 
array (with input gates shaded yellow) or the asynchronous AmPAL29MA16 
(Inputs on yellow background at far left), depends for its functionality on how 
the erasable S cells configure the multiplexers and storage cell. 



one CLK/LE /I (input) pin. For 
macrocells configured as registers, 
these pins are clock pins; for latches, 
they're latch-enable pins. 

Either the clock or latch-enable 
pin may be selected to trigger each 
storage element on either the rising 
or trailing edge. With this, system 
designers can implement up to a 
quadruple-state machine, with each 
meant to have a different number of 
registers. 

Also, since the device offers 16 
I/OLMs with input register/ latches, 
the PAL is applicable to pipelined 
systems with eight input latches and 
eight output registers. The dual- 
clock capability allows the PAL to 
run with synchronous or asynchro- 
nous inputs. The clocks' program- 
mable polarity allows the internal 
state machine to run at twice the 
speed of the external clock. 

Further, since only one pin is ded- 
icated as a clock/ latch enable, a sin- 
gle state machine need not waste the 
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second clock pin. It can be an extra 
input. 

While one or two synchronous 
clocks with programmable polarity 
are sufGcient for synchronous state 
machines, asynchronous and glue 
logic applications have more de- 
manding clock/ latch requirements. 
They call for more flexible PAL de- 
vices with combinatorial outputs and 
feedback and with edge-triggered 
registers and transparent latches 
having individual clock/ latch-enable 
capability (preferably with pro- 
grammable polarity) . 

In addition, they require individ- 
ual control of reset and preset and 
output-enable product terms, as well 
as slightly fewer product terms per 
output. Further, for asynchronous/ 
glue logic applications, the clock-se- 
lection and output-enable schemes 
should be independent. 

The 29MA16 addresses these 
needs with up to 16 product-term- 
driven asynchronous clock/ latch-en- 
able controls. Each of these controls 
is a product term driven from the 
AND-OR array as a function of cer- 
tain input signals. These, in turn, 
are controlled by the user according 
to the application. 

Like its synchronous counterpart, 
each I/OLM of the asynchronous 
29MA16 has a 4:1 multiplexer for 
clock selection. Unlike the other, 
however, it allows the user to select 
from either a common synchronous 
clock/ latch enable or an individual 
clock driven by a product term. 

Preset and reset 

The synchronous 29M16 uses 
common asynchronous reset and 
preset controls for all registered or 
latched inputs and outputs. When 
the product term for asynchronous 
preset is asserted, all the register/ 
latches are immediately loaded with 
a high, independent of the clock. 
Conversely, when the asynchronous 
reset product term is asserted, all the 
register/latches are immediately 
loaded with a low, independent of 
the clock. Actual output depends on 
the macrocell polarity. 

The a.synchronous 29MA16, on 
the other hand, offers individual 
asynchronous reset and preset con- 
trol for each of its storage elements. 



This feature, plus the individual 
clock capability, makes it easy for 
the device to replace a large amount 
of glue logic. 

As in all PAL devices, the size, 
organization, and distribution of the 
internal product terms of the 29M16 
and 29MA16's AND-OR array de- 
termine its logic capability. Both 
PAL devices can have 29 inputs, of 
which 24 issue from the I/O logic 



macrocells (8 from I/OLMs with 
single feedback and 16 from I/OLMs 
with dual feedback) , 1 comes from 
the I/OE pin, and 3 are dedicated. 
The 29MA16 adds a fourth dedi- 
cated input, but the 29M16 instead 
has the CLK/LE/I pin. 

Of the 29M16's 188 product terms, 
all of 176 are logic terms and 12 are 
control terms. The 176 logic product 
terms are distributed among the 16 



PALs move to ever-greater flexibility 



Important gauges of a PAL device's flexi- 
bility are the number and complexity of 
its input/output logic macrocells. With 
the introduction of its electrically eras- 
able, CMOS-based AmPAL29M16 and 
AmPAL29MA16, Advanced Micro Devices 
has brought more flexibility than ever to 
programmable logic devices. 

First-generation, bipolar fuse-pro- 
grammable PAL devices had at most 
eight outputs fixed at the factory as 
either registered or combinatorial with 
active high or low polarity. The second 
generation improved on this somewhat 
by adding 10 programmable outputs con- 
figurable as registered or combinatorial 
and active high or low. 

Although some second-generation 
CMOS EPROM-based PLDs offer as many 
as 16 logic cells, these can be configured 
only as outputs. Further, these outputs 
lack a transparent-latch capability, a re- 
quirement for both asynchronous and 
synchronous designs. 

Architectural flexibility also extends to 
features such as variable product terms, 
varying the number of product terms per 
output, and programmable buried states. 
Variable product terms, first introduced 
in bipolar PAL devices like the AmPAL- 
22V10, help the device match the appli- 
cation. Programmable buried states use 
internal storage elements for state-ma- 
chine design; until now, PAL devices 
lacked independent buried states. Such 
elements were used only in conjunction 
with I/O pins. Consequently, when an 
I/O pin was an input, its associated stor- 
age element was wasted. 

At first, PAL devices were produced 



with fuse-based bipolar technology; they 
could not be reprogrammed. Next genera- 
tion EPROM-based PLDs (or EPLDs) are 
reprogrammable, as are the still newer 
electrically erasable PLDs (E^PLDs). 

To be erased, EPLDs must be in ex- 
pensive windowed ceramic packages 
through which they're exposed to ultra- 
violet light for a lengthy time. Easier to 
reprogram, E'PLDs fit in inexpensive 
plastic packages. 

Complete testability is another sig- 
nificant advantage of E' technology. Un- 
like bipolar and EPLDs, which require 
additional testing circuitry, E^ devices 
use only the array and its operating cir- 
cuitry to make DC, AC, and functional 
tests. 

Finally, a PAL device must meet the 
specific demands of combinatorial, syn- 
chronous, and asynchronous applica- 
tions. While some CMOS PLDs have at- 
tempted to address the broad range of 
applications with a "one-size-fits-all" 
approach, this has limitations. The device 
tends to have extra features that show 
up in overhead while not providing others 
that are key. The result is expensive and 
less than optimal. 
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I/OLMs, as was shown in Fig. 1. The 
eight I/OLMs with dual feedback 
have eight product terms each, while 
of those with single feedback, four 
have 12 product terms and four have 
16. The average is 11 product terms 
per output. 

This variable distribution of prod- 
uct terms, as well as the number of 
product terms per output, allows 
more complex functions — such as 
specialized counters and more com- 
plex state machines — to be imple- 
mented in a single device. 

Of the 29M16's 12 control terms, 
two are used as common asynchro- 
nous reset and preset, and eight are 
output-enable product terms — one 
pair of common AND-XOR product 
terms per bank of four registers. The 
remaining two are used for observa- 
bility in testing and for preload of 



the device's storage element. 

As for the 29MA16, its smaller 
total of 178 product terms is divided 
into only 112 logic terms and all of 
66 control terms. The logic product 
terms are also distributed in a var- 
iable fashion (see Fig. 2 again); 
the average is 7 product terms per 
output. Of the control terms, four 
are distributed to each of the 16 
I/OLMs, and the remaining two are 
for observability and preload. 

Output enable control 

The I/O pin for both new devices 
may be configured as a dedicated (or 
permanently disabled) input, a dedi- 
cated output, or dynamically con- 
trolled. This flexibility results be- 
cause the connection from the I/O- 
LM to its I/O pin is controlled by a 
4:1 multiplexer. 



On both devices, three of the in- 
puts to this multiplexer come from 
ground, Vcc. and the I/0-enable pin. 
On the synchronous 29M16, the 
fourth input to the multiplexer is 
driven by the two AND-XOR prod- 
uct terms already mentioned as be- 
ing common for a bank of four 
I/OLMs. In synchronous state ma- 
chines, this arrangement allows the 
I/O pins to be controlled in groups 
of four. The AND-XOR product 
term control also allows for program- 
mable polarity, a useful feature for 
bus-control applications. 

On the 29MA16, this fourth input 
is driven by an individual product 
term from the AND-OR array. Such 
individual product-term control 
gives users maximum flexibility in 
configuring the I/O pins. 

Both the parts have built-in fea- 
tures to simplify testability and sys- 
tem-level debugging. These include 
a preload and a power-up reset for 
testability and an observability 
product term for debugging and for 
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tracing buried state registers. 

When the observability product 
term is asserted, it suppresses the 
combinatorial output data, prevent- 
ing the data from appearing on the 
I/O pin. It then presents the con- 
tents of the register/ latches on the 
output pin for each of the logic mac- 
rocells. 

The built-in power-up reset ini- 
tializes the register/ latches to a 
known state, a feature that simpli- 
fies state machine designs. A secvir- 
ity cell in both devices protects the 
logic against unauthorized copying. 

A multichannel DMA controller 

The AmPAL29M16/29MA16 are 
suitable for applications ranging 
from glue and general-purpose logic 
replacement to synchronous and 
asynchronous state machines. Fairly 
complex Mealy and Moore state ma- 



INITIALIZE 



chines can be readily designed, and 
the I/O latches support double pipe- 
lined systems. 

Figure 4 is the block diagram of a 
direct memory access (DMA) con- 
troller built for the 32-bit 68020 mi- 
croprocessor. It is a simple, four- 
channel, single-cycle, 32-bit mem- 
ory-to-memory data-transfer design. 

The 68020 processor accommo- 
dates memory blocks and peripheral 
widths ranging from 8 to 32 bits on 
its system bus. It also imposes cer- 
tain connectivity requirements on 
slave devices: 8-bit devices must go 
to the uppermost 8 bits of the data 
bus, while 16-bit devices must con- 
nect to the data bus's upper word. 

Consequently, issues related to the 
data width for the DMA operation 
are the data-transfer size (8, 16, or 
32 bits), the data-funneling scheme 
( for packing and unpacking data 




to support a flexible data-transfer 
scheme) , and block size. 

The DMA logic comprises two ad- 
dress pointers (one for source and 
one for destination); a 32-bit data 
latch; burst, wait, and block-length 
counters; an 8-bit command register; 
a 16 X 16-bit RAM; and the DMA 
sequence controller. This design uses 
a total of 11 CMOS AmPAL29M16s 
— 9 to implement pointers, latch, and 
counters and the remaining couple in 
the sequence controller. 

The two pointers have a width of 
30 bits divided into an upper 16-bit 
segment register field and a lower 
14-bit offset register field. The 16- 
segment register field allows data to 
be transferred in 64-K segments, 
each segment being a maximum of 
16-K blocks (or 64-K words, with 4 
words in each block) . 

The task of segment register main- 
tenance comes under CPU control. 
Thus, if the data to be transferred 
spans a segment boundary, the CPU 
must break up the transfer into two 
separate operations and set up seg- 
ment registers for it. 

The 14-bit offset register's value 
is loaded into the RAM locations by 
the CPU before the DMA operation 
begins. In this example the DMA 
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BRST_ZER = BURST COUNTER AT ZERO 
CNT_2ER = BLOCK COUNTER AT ZERO 
WAIT-JER = WAIT COUNTER AT ZERO 
DSACK = DATA ACKNOWLEDGED 
BG = BUS GRANTED 
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WS = WRITE STATE 



Fig. 5. The state diagram of the DMA controller's 
operation shows that it will cycle continuously 
through the read and write states until some other 
bus arbiter is granted control of the bus. 
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transfers only long words, so the 
least significant address bits Aj and 
Aq are stuck-at-zeroes. The offset 
register is incremented after every 
DMA read/write cycle unless the 
command register says not to. 

A 32-bit data latch temporarily 
holds the long-word data fetched 
from the source. During destination 
writes, this becomes the source for 
the data. The burst, wait, and block 
count registers are each of them 14- 
bit decrementers. 

The burst register counts down af- 
ter every DMA read/ write cycle. At 
zero, the data and address buses are 
relinquished by the DMA controller 
for a number of cycles defined in the 
wait register. 

The 14-bit block counter decre- 
menter, whose value is loaded into 
RAM locations by the CPU, keeps 
track of the number of long words 
transferred in each current DMA cy- 
cle. The counter is loaded with a 
value by the CPU before each new 
DMA operation and is decremented 
by the DMA control logic at each 
transfer. A count of zero indicates 
the end of current DMA operation. 



The control sequencer 

The DMA control sequencer has 
three jobs. It handles loading both 
the pointers and the registers from 
the RAM locations. It controls the 
DMA read/ write cycles. And it con- 
trols the interfaces to the CPU ( han- 
dling of burst and wait registers) and 
the memory. 

Because of I/O limitations in the 
sequence, the buried registers asso- 
ciated with inputs implement the 
state sequence. The output-enable 
structure enhances the individual 
I/O control of every macrocell, and 
the clock scheme is used by trigger- 
ing on both edges of the system clock 
to improve the DMA read/ write cy- 
cle throughput. 

According to the state diagram for 
the DMA sequence controller (see 
Fig. 5) , the device stays in the idle 
state until a DMA operation is 
started by the CPU's writing the 
GO bit in the command register. 
The controller then loads the point- 
ers and the counter registers from 
the RAM locations and requests 
the bus. After the bus is granted, it 
initiates memory-to-memory data 



transfers — read followed by write 
cycles. (One memory-to-memory 
transfer cycle is a single operation.) 

DMA preemption 

The DMA preemption is per- 
formed in write state WS3. When 
the external arbiter signals that the 
bus is needed by another potential 
bus master (by negating bus grant), 
the DMA controller gives up the bus 
and enters the bus-request state. It 
stays there until the bus control re- 
turns to the DMA controller. 

In state WS3, if the block coxmter 
reaches value 0, then it returns to the 
idle state. If the burst length has 
reached zero, but the counter value 
is not zero, then it goes to the wait 
state until the wait register value 
reaches zero. Then it transfers to the 
"load burst register" state. In state 
WS3, however, if both the coimter 
and burst-register values are not 
zero, and if no other bus arbiter 
needs the bus, the controller contin- 
ues memory-to-memory data trans- 
fers until it is done. D 
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PROSE Devices Simplify 
State IViacliine Design 



A bipolar device combines a PAL device for conditional branching 
with a PROM device for storing system control functions to make 
the design of small state machines easier. 



Control logic is required in a broad range of ap- 

Fplications — from low-level control of custom 
bit-slice microprocessors, to bus arbitration and 
timing generation in conventional microprocessors, 
to special-purpose needs like data encryption and 
decryption. Even though these applications vary in 
complexity, they all require a sequential-type device 
(a state machine, for example) that provides for se- 
quences of output signals based upon state tran- 
sitions and that includes some capability for 
branching. More complex sequencers provide capa- 
bility for looping and subroutining. 

Possible candidates for control logic range from 
Programmable Array Logic (PAL) devices, to the 
more complex Field Programmable Logic Se- 
quencer (FPLS) devices, to instruction-based 
microprogram controllers that rely on on-chip 
memory for expanded subroutine capabilities. 
While FPLS and PAL devices have limited product 
terms for many state-machine applications, the 
other microprogram controllers are sometimes 
more complex than required. For applications re- 
quiring control functions that small and medium 
state machines can represent, the Prose device from 
Monolithic Memories Inc (MMI) can facilitate con- 
trol logic. 

Combining a PAL device for conditional branch- 
ing and a PROM for storing the control functions 




Nick Schmltz 

Schmitz is a product planning manager for program- 
mable logic devices at Monolithic Memories Inc (San- 
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of the system. Prose is intended to function as a 
programmable sequencer (Prose = PROgrammable 
SEquencer). It offers a maximum of 128 states and 
up to four-way branching. A bipolar device oper- 
ating at up to 25 MHz, Prose functions as a Moore 
state machine. Under certain conditions, a Mealy 
state machine can also be modeled with Prose. 

Working faster and simpler 

Custom microprocessors, such as those built with 
bit-slice components, require control logic in the 
form of multiple IC microinstruction memories 
and sequencers to generate addresses for those 
memories. For complicated sets of instructions 
with many states and complex branching and 
subroutines, a microprogram controller makes 
sense. But in situations in which a more limited set 
of instructions and instruction transitions is re- 
quired, the single-chip Prose device is faster as well 
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Prose in a classic traffic-controller application 



Consider a four-way intersection. For each direc- 
tion (N, S, E and W), there are two sets of 
lights: one for the forward direction (F) and one for 
left turns (L). Each set has three lights: red (R), yel- 
low (Y) and green (G). For each of the four direc- 
tions, there are also two detectors (D): a left turn 
detector and a forward detector. 

In the master diagram for traffic-flow control (a), 
each box represents a set of states generated by 
the controller in response to the input conditions. 
For instance, DO NL is executed given the input 
conditions NLD */SLD (NLD and not SLD). 

Four Prose devices, a counter and an encoder 
are used to implement this controller (b). The in- 
puts to this controller are the six outputs of the en- 
coder and the carry out (CO) signal from the 
counter. The outputs are the control signals to the 
lights themselves. The encoder simply passes the 
NLD, SLD, ELD and WLD signals. It generates two 
new signals, NSD and EWD, as follows: 

NSD = NLD+NFD-i-SLD-t-SFD 
EWD = ELD-i-EFD-i-WLD+\A/FD 

Prose devices 1 through 3 generate the 24 out- 
puts that control the traffic lights, while Prose 4 
preloads the counter. Note that all four Prose de- 
vices receive the CO signal from the counter. CO 
tells the Prose devices when the preloaded 
number of cycles has been completed so they can 
generate the next state. The counter is used to 
conserve states in the Prose devices; that is, the 
devices simply cycle back to the same state until 
the desired number of cycles has been completed 
or until some condition detected by the sense de- 
tectors warrants jumping out of the loop early. 
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as simpler. For instance, when an ALU is used only 
for a multiply or filtering operation, Prose devices 
reduce chip count but can also generate the required 
instructions faster than the more complex micro- 
program controller. Prose devices can also be used 
for addressing the data memory to fetch the 
operands for the filtering operation. 

Consider also the case of a video controller. For 
scanning purposes, counters that operate in various 
sequences and count lengths are required. But in- 
stead of implementing these as actual counters, the 
sequences involved can be unlocked and imple- 
mented as state-machine transitions. And there's 
an advantage beyond the mere economy of parts. A 
count can be set or initiated and then left to poll 
various external hardware conditions while the mi- 
croprocessor performs other operations. 



Memory and bus arbitration applications offer 
further opportunities for using Prose devices. If 
two processors or other devices vie for memory ac- 
cess, a simple Prose state machine can implement 
the logic for deciding which device gets the bus 
and/or access to memory. In such a case, the heavy 
artillery of a microprogram controller and memory 
isn't required. 

As a sequencer for signal-processing applica- 
tions. Prose state machines offer speed and suffi- 
cient functionality without the overkill of a com- 
plex microsequencer paired with dedicated mem- 
ory. For simple algorithms, such as those involved 
in performing a fast Fourier transform, Prose de- 
vices can control the set of vectors that are mul- 
tiplied and added in the process. Via the instruction 
sets to the devices. Prose can control the operation 
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Consider a block such as DO NL. As we enter 
this bloci<, We have just completed flow In the east- 
west direction. The previous state has generated 
the preload for the counter so that as we enter the 
block DO NL with NLD* /SLD, the counter begins 
the count. Suppose that we have preloaded 110 
and count down. On the count following 000, CO 
goes high, and this tells the Prose devices to 
branch to another state. Figure (c) is a state transi- 
tion diagram for DO NL. The state "NLG NFG" re- 
cycles until CO or /NLD occurs. Figure (d) shows 
the details of DO NLSL; Figure (e) shows DO NFSF. 
These diagrams reflect all basic transition types in 
the controller. There are four blocks of the type DO 
NL, two of the type DO NLSL, and two of the type 
DO NFSF. In addition, there's one other block, ALL 
REDS, that contains only one state: all outputs red. 
So, from counting the possible states of the sys- 
tem according to branch type, we have 15 one-way 
branches, 10 two-way branches, three-way 
branches, and 4 four-way branches, resulting in 31 
of the 128 PROM locations used. 

This application uses four Prose devices— three 
forgenerating signals to control the lights and one 
for controlling the counter for state minimization. 
Although outputs are different, all fourdevices use 
the same inputs and execute the same transition 
table and thus can be considered to be in the same 
state, or to be generating a single state with a 
wider output word. This usage demonstrates how 
the Prose device can be expanded in width. When 
programming the devices, state transition and 
condition equations are identical and can be 
copied. Only the output equations are unique to 
each and must be programmed separately. 
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of the ALUs or multipliers and can control memory 
access when separate Prose devices are used. A 
Prose device is well suited to relatively simple oper- 
ations that don't involve complicated decision 
making. For complex digital signal processing op- 
erations, a programmable DSP device may be bet- 
ter, but it's not likely to be as fast as the dedicated 
hardware approach. 

In peripheral control, the simple state-machine 
approach can be efficient. Consider the case of run- 
length-limited code. Both encoding and decoding 
can be translated into state sequence diagrams, 
which examine the serial data stream as it's read, 
and can then be readily programmed into a Prose 
device. 

Industrial control and robotics also require sim- 
ple control functions. Tasks such as mechanical po- 



sitioning of a robot arm, simple decision making, 
calculating a trigonometric function and displaying 
several digits usually don't require a high-power 
microsequencer with stacks and pointers and mi- 
croprogram memory. What's required instead is a 
device that can store a limited number of states and 
allows simple branching upon conditions. 

Control logic alternatives 

A registered PAL device is a simple state machine 
and can be effectively used for simple control func- 
tions, such as counters, dynamic RAM controllers, 
interrupt controllers and certain types of video con- 
trollers. Combining a PAL device with memory 
such as a PROM, as MMI has done in Prose, gives 
the device much more powerful control capability. 
Another alternative for control logic is the FPLS, 
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Prose as a bus arbiter 



Prose can be handy in applications that require 
bus arbitration. Consider, for example, two 
microprocessors and refresh circuitry that require 
access to a blocl< of dynamic RAMs (a). 
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The arbiter, which is a Prose device, receives re- 
quests for access via condition codes CC(2-0). The 
device may also receive a synchronous initializa- 
tion request, /IS, which is intended to reset the sys- 
tem to an initial state. From these input signals, 
the arbiter must generate signals to control the ac- 
cess to the DRAMs. Thus it must generate ac- 
knowledge signals for the requesting devices — 
condition code acknowledge signals— /CCACK(2- 
0)— and /RAS (row address strobe) and /CAS 
(column address strobe) for the memories them- 
selves and MCS for row/column multiplexer con- 
trol. With this set of input and output signals, the 



Prose device can precisely control the access to 
memory. 

Once access is granted, a complete memory cy- 
cle requires five clock cycles. Three clocks are re- 
quired for access; two are required for DRAM 
precharging. /RAS is asserted from the second cy- 
cle to the end of the fourth, while /CAS is asserted 
from the third cycle to the end of the fourth. (Row 
and column addresses are asserted, as shown, 
prior to the assertion of /RAS and /CAS.) Both /RAS 
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CC0*/CC1*/CC2VIS 
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which has programmable AND and OR arrays as 
well as JK or clocked-RS flip-flops with internal 
registered feedback that allow sequencing and con- 
figurable I/O pins. FPLS devices have a fixed 
number of product terms that can be shared be- 
tween input logic, state transitions and output func- 
tions, dictated by the application needs. Because 
the Prose device has dedicated resources — the PAL 
and PROM arrays — these applications trade-offs 
aren't required. While not characteristic of the ar- 
chitecture, some FPLS sequencers operate at dif- 
ferent speeds depending on the logic functions 
implemented; for example, as the complexity of the 
logic functions increases, speed decreases. With the 
Prose device, however, operation speed is a con- 
stant. For many applications, FPLS devices and 
Prose devices are complementary; a design engineer 
should be able to use both, choosing the right de- 
vice for the application. 

Finally, some recently introduced instruction- 



based machines, similar to a PROM under control 
of a microsequencer, offer another option for con- 
trol logic. These machines are sophisticated devices 
with ample states for state machines, but they're 
too complex for simple control appHcations. More- 
over, user adaptability and customization is Hm- 
ited. Because they're powerful and have sufficient 
outputs, these machines can easily implement 
counter-type state machines. But they offer a 
limited choice of combinations of input literals for 
condition testing. While such devices form a power- 
ful microprogram controller/sequencer for gener- 
ating addresses, they don't provide a single-chip 
solution, and in many cases they're far more power- 
ful than the appHcation requires. 

Prose device overview 

The major building blocks in the Prose device are 
the PAL array, the PROM array, and the shadow 
and output registers. In brief, the PAL array gener- 
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and /CAS are removed during the fifth and the first 
cycle for DRAM precharging. 

The sequences of memory clock cycles with 
their associated control signals can be viewed as a 
set of state transitions (b). The condition codes, 
CC(2-0), are assigned as follows: CC2 is a refresh 
required; CC1 is a request from Processor 1; and 
ceo is a request from Processor 2. A refresh cycle 
differs from a regular memory cycle only in that 
/CAS isn't asserted. 

As shown in the transition diagram, A1 cycles 
upon itself if IS is H (or active low signal /IS is L), 
or if there's no request. It there's a refresh request 
and IS is L, the transition A1 to B1 occurs, initiating 
the cycle of states B1 through B5. This acknow- 
ledges the refresh request and generates the tim- 
ing required to carry out the refresh operation. 
Note that any time /IS occurs, the next transition is 
to A1. The same is true for the loops CI through C5 
(processor 1) and D1 through D5 (processor 2). 
Note also that at 85 if there's no request (and IS is 
L), the next transition is to A1. But if IS is L and C2 
is still true (another refresh request), the next tran- 
sition will be to B1. Furthermore, if CC± is L (no 
refresh request), IS is L, but CC1 is H (processor 1 
request), the transition will be to CI directly. 
Similarly, if IS, CC2 and CC1 are all L, and CCO is H, 
the transition will be directly to state D1. The tran- 
sition diagram shows a similar relationship of tran- 
sitions in the loops C1 through C5 and D1 through 
D5. There are 16 states, including 1 three-way, 12 
two-way, and 3 four-way states, which the Prose 
device can easily accommodate. 

Prose can also handle more complex configura- 
tions. Suppose there are four devices competing 



for bus and memory access. A problem arises in 
simply expanding the diagram (c) to accommodate 
the extra device. Doing this directly would require 
that A1 be a five-way branch. But the Prose is 
limited to four-way branching. 




PROCESSOR 1 PROCESSOR 2 



This difficulty is easily avoided, however, by in- 
serting another state, A2, between A1 and the tran- 
sition sequences for two of the requesting 
devices. Thus A2 serves as the embarkation point 
forthese transitions. Here, the condition codes are 
CC3 (refresh), CC2 (processor 1), CC1 (processor 2) 
and the CCO (an I/O port). 003 has the highest 
priority; COO has the lowest. Note, however, that 
D1 through DSaretheflveclockcyclesforthe refresh 
circuit. A refresh request always gets the highest 
priority but, since speed of execution isn't crucial for 
the refresh request, it's reached via A2. That is, the 
refresh request is never ignored, but it's adequate to 
service it through A2, allowing fasterservicing (by one 
clock cycle)ofprocessor1 orprocessor2 requests. 
A1goestoA2forOO3*/IS-HCO0*/OO2*/CC1*/IS;A2 
goes to D1 for 003 */IS. This configuration requires 
no one-way, 16 two-way, 1 three-way, and 5 four-way 
states— easily handled by Prose. 



ates the branch address bits for the memory (128 
words X 21 bits), which stores the system's next 
state addresses, device outputs and control func- 
tions. The shadow register is used for programming 
and diagnosis, and the output register provides syn- 
chronous operation and registered feedback. 

The PAL array is a 14H2-type: it has 14 comple- 
mentary inputs and two active high outputs. Each 
of the two outputs is a sum of eight product terms. 
Eight of the inputs, from an external source, are 
used for encoding of conditions for branch selec- 
tion. The other six inputs, from internal feedback 
from the PROM, are used for selection of the con- 
ditions that determine branching when decisions 
are involved in state transitions. 

The PROM consists of 128 21-bit words or loca- 
tions, each of which can be viewed as a state, or as 
containing state-related information. Eight bits at 
each location are intended as outputs. The other 13 
bits are intended as feedback: five are feedback to 



the PROM array itself for partially determining the 
next location to be addressed, and six are condition- 
select bits to the PAL array. They're used for prod- 
uct term selection/enabling. The remaining two 
bits are inputs to the exclusive OR gates. They're 
used to determine address bits A6 and A5 when 
branching isn't involved. 

When no branching is involved, CS(5-0) select no 
condition in the PAL array, so that no product 
term is wasted, and the PAL outputs are 
X1 = X2 = 0. Thus XFl and XFO are used to select 
the higher-order address bits — A6 and A5, in this 
case — and product terms are conserved for use 
when real conditional selection (branching) is in- 
volved. When conditional branching is involved, 
no more than four branches can be from one loca- 
tion, and all branch states will be in the same prima- 
ry location but in different quadrants. That is, 
address bit A(4-0) will be the same, and the condi- 
tions will toggle A6 and A5 to determine the exact 
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In the Prose device, the PAL ar- 
ray processes inputs to produce 
two decision variables that ac- 
cess the PROM array, which 
generates outputs from the ma- 
SDO chine and the next data address. 
Support logic is provided for 
serial scan access and diagnosis. 
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quadrant location of each of the possible next 
states. 

For diagnostic purposes, a shadow register 
breaks the data path. Data can be shifted in or out 
serially. The shadow register and the output register 
are operated by different clocks, so that data may 
be shifted without affecting device operation and 
spoiling data. This lets data (in this case, test vec- 
tors) be shifted into the shadow register and clocked 
onto the output register. A normal operation can 
then be performed, after which the data can be 
loaded onto the shadow register and shifted out to 
be analyzed. In addition, a multiplexer to the shad- 
ow register lets the register contain data from the 
output register or from the previous shadow regis- 
ter contents. 

Internally, programming of the Prose device 
takes place via the shadow register, is completely 
transparent to the user and is done individually for 
each half of the device. For programming the PAL 
part of the device, a certain pattern of data is shift- 
ed into the shadow register with Vcc raised to the 
programming supervoltage and the preset/enable 
pin also raised to the supervoltage afterwards. Indi- 
vidual fuses in the PAL device are blown and se- 
lected by the appropriate pattern. 

Programming of the PROM is accomplished in 
similar fashion. Vcc is raised to the supervoltage 
value; SDI is raised to the supervoltage afterwards. 
The bit pattern in the shadow register determines 
which fuse is to be blown. 



chine, the user inputs a list of outputs expected for 
each path to each state. (For a Moore state ma- 
chine, the outputs depend only on the state. For a 
Mealy machine, outputs depend on the inputs also.) 

User input to Palasm 2 (for state-machine entry) 
consists of four sections: declaration, state, equa- 
tions and simulation. The declaration section speci- 
fies the type of Prose device to be programmed. 
Currently, there's just one device in the family. 

The state section specifies the state transitions 
and the output details. The user specifies the type of 
state machine — Moore or Mealy — and the various 
defaults that should be used when the next state or 
the outputs can't be determined from the equa- 
tions. Furthermore, the user specifies state transi- 
tions and outputs via equations. The transition 
equations specify transitions from each current 
state. The output equations specify the outputs for 
each state, in the case of a Moore state machine, or 
the outputs expected upon the transition from one 
state to another, in the case of a Mealy state ma- 
chine. For the Moore machine, there's only one set 
of outputs for each state, but for the Mealy ma- 
chine, the outputs depend on the path to the state. 
The last section, the simulation section, contains in- 
structions that tell the software how to simulate ac- 
tual operation of the state machine and, therefore, 
how to test the device once it's programmed. CD 



Software support 

Prose state machines are fully supported by new 
programs in the Palasm suite — Proasm for assem- 
bly and Prosim for simulation. To program the 
Prose device, the user inputs a list of state names, 
transitions and conditions that cause transitions to 
subsequent states. When progamming a Moore ma- 
chine, the user also inputs a list of expected outputs 
for each state, and when programming a Mealy ma- 
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n the abstract, a computer, or at 
least parts of a computer, can be 
. defined by a state-machine di- 
rected-transition graph ( see Fig. 1 ) . 
A state machine is a small, locally 
controlled machine — not intended 
for a complete computation of a com- 
plex program. 

Experts generally divide state ma- 
chines into two basic classifications: 
algorithmic and synchronous-se- 
quential machines. Algorithmic ma- 
chines include the subcategoriei 
called Turing machines, Post ma- 
chines, and finite machines, which 
primarily describe the specific algo- 
rithm used in programming the com- 
puter. A basic difference between a 
synchronous sequential state ma- 
chine and an algorithmic machine is 
the presence or absence of stop 
states. A start state, of course, must 
be present in all machines or else 
they will not be initiated. 

A stop state in an algorithmic ma- 
chine (like a Turing) occurs when 
it has finished executing an algo- 
rithm. In a synchronous sequential 
machine, however, to achieve a stop 
state, the machine must branch to it- 
self — a type of wait state — until an- 
other operation is ready. The only 
time a synchronous sequential state 
machine may be actually halted is 
for an error or illegal command that 
forces it into looping on itself uncon- 
ditionally. But once in such a state, 
the state machine theoretically can't 
exit from it. The machine must be 
reset and started over again. 

Implementing a state machine 

Although devices such as PAL, 
PLA, and FPLS arrays can be used 
to implement such a state machine, a 
programmable sequencer (PROSE) 
device, designated PMS14R21, is a 
convenient, flexible, and economical 
way. It consists of a combination 
of a special PAL and a registered 
PROM that are optimized for use 
in local, intelligent-control state- 
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Fig. 2. The PMS14R21 programmable se- 
quencer has a special PAL array joined to 
a registered PROM by two OR gates. 



machine design (see Fig. 2) . 

The introduction of the PROSE 
device is a significant advance in the 
design of fuse -programmable state 
machines of small to middling size. 
The PAL array serves to select tran- 
sition conditions, and the PROM 
stores the state assignments. Also, 
two exclusive-OR gates help reduce 
the total number of states and prod- 
uct terms a state machine would 
need. One terminal (I/E) can be 



Fig. 1. Parts of a computer can be represented by a graph of a state machine 
showing Its directed transitions between states. An algorithmic machine (a) 
has stop state(s), whereas a synchronous machine (b) does not. 

good many other state machines. 
Also, timing restraints on external 
devices are greatly relaxed. Perhaps 
most important of all, the PROSE 
device provides a small, single-pack- 
age solution to state-machine design 
(for example, 24-pin, 3-in. Skinny- 
DIPs and 28-pin plastic leaded chip 
carriers) . 

Moore and Mealy models 

Two categories of state-machine 
models are generally implemented: 
the Moore and the Mealy state ma- 
chines. The Moore machine's out- 
puts depend only on the state; they 
are independent of inputs. A Mealy 
machine's outputs, however, depend 
on both the state and the machine's 
inputs. The Mealy, therefore, is a 
superset of a Moore state machine. 

A true Mealy state machine is too 
versatile to be defined in a general 
architecture. A Moore machine is 
much easier to define and simpler 
to implement. The simpler the state 
machine is, the faster it will be. On 
the other hand, for most applica- 
tions, the outputs of a state machine 
depend somehow on the inputs as 
well as on the state. 

The PROSE device can support 
the Moore model. Fortunately, the 
Mealy model can also be supported, 
provided that the PROSE device's 
output pattern can be pipelined by 
one clock cycle. Also, the number of 
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programmed to receive an asynchro- 
nous initialization signal (one that 
resets all registers to the high state) 
or an output enable (via an architec- 
tural fuse); others, to enable fault 
diagnosis. A power-up reset feature 
is also provided. 

Each PROSE device has eight in- 
puts, eight outputs, up to 128 states, 
and up to a four-way branching ca- 
pability. Many local control func- 
tions can be done faster than on a 



2-608 



MonolHMc 



m 



Memories 



Designing a State Machine with a Programmable Sequencer 



combinations of patterns in the sub- 
sequent states, and the state thus 
pipelined, must not exceed four. 

Controlling a state machine 

Naturally, a state machine must 
include a control system in addition 
to the operating portion outlined so 
far. The control system determines 
the next state on the basis of the cur- 
rent state plus a combination of in- 
puts to the control system. Of course, 
based on its state, the operating part 
of the system determines the outputs 
of the machine. 

The number of inputs, to a certain 
extent, determines the number of 
conditions (as inputs or combina- 
tions of input literals) from which 
the state machine can select. It can 
also determine whether encoding the 
conditions is necessary, which then 
becomes part of any external delay. 
A small, local, intelligent, arbitration 
unit may have as few as three to four 
states to as many as 30 to 40 states. 
A highly intelligent controller can 
have thousands of states. However, 
a state machine generally has a rela- 
tively small number of states to 
which branching occurs, because the 
latter states will eventually loop to 
each other (except maybe for the few 
starting states) . 

Conditional branching needed 

Because a state machine switches 
its states on the basis of both the 
current condition and inputs, fre- 
quently conditional branching is 
necessary, sometimes to more than 
two next states. Four-way branching 
is common. If more than four-way 
branching is required, up to 16-way 
branching can be carried out by two 
cycle; of four-way branches (see 
Fig. 3). 

But in applications that operate 
on single-cycle instructions, such a 
branch-fanout tree may not be de- 
sirable. Such single-cycle multiple 
branching can be done provided the 
machine allows losing a cycle for the 
fanout and has enough product terms 
to perform the branching on all the 
conditions. (A PAL or PLA device 
has a limited number of available 
product terms.) 

Similarly, the number of outputs 




Fig. 3. A state fanout tree of up to a 
16-way branch can be carried out In 
two cycles of four-way branches. 
This operation requires the state ma- 
chine to lose a cycle for the fanout 
and to have enough product terms 
available to perform the branching 
for all the conditions. 

determines whether encoding of op- 
erations control is necessary and 
how many operations are possible. 
Again, decoding of control signals 
externally will mean extra delay. In 
a single-device state machine, since 
a limited number of pins is available, 
the ratio of the number of outputs to 
the number of inputs is very critical. 
This ratio determines the applica- 



tion range of the device. 

The PROSE state machine uses a 
PAL array, designated 14H2, instead 
of a simple condition multiplexer, to 
avoid limiting the choice of possible 
transition conditions to just one in- 
put at a time. Rather, input literals 
at any time can be combined so that 
transition-condition encoding is not 
needed. The eight inputs (Iq to I7) 
to the PAL device are condition sig- 
nals. Together with the six addi- 
tional inputs (CSq to CSg), which 
carry condition-select feedback sig- 
nals, they determine the next state. 
These feedback signals serve only to 
select the conditions for branching. 

The six condition-select feedback 
signals to the PAL array simplify the 
selection of the transition conditions. 
Also, combinations of conditions to 
minimize product terms can be em- 
ployed. Since the output registers 
are all initialized to high states, and 
the next state after an initialization 
has to be known (state 127 for this 
device), all product terms in the 
PAL array must be disabled with 
this pattern (all high) to perform 
unconditional branching. Therefore, 
this pattern is reserved for disabling 
all product terms for unconditional 
branching. 

The 14 PAL inputs are comple- 
mented internally to help program 
two eight-product-term OR gates at 
each of two active-high outputs (Xq 
and Xi ) . The conditions controlling 
the next states must fit within these 
limits. The outputs of the PAL feed 
exclusiue-OR (EXOR) gates, which 
help conserve the need for state and 
product terms. 

To perform an unconditional, or 
one-way branch, to save a product 
term, Xq and Xj are at GO. For ex- 
ample, the new PROM address has 
A5 and Ag equal to 10 (binary) . Ac- 
cordingly, the EXOR feedback sig- 
nals XFq and XF^ are programmed 
as IO2 to force the EXOR outputs to 
10 unconditionally. 

For initialization, the CSq to CS5 
feedback pattern for an uncondi- 
tional branch is 111111. Thus, the 
operational feedback of CSq to CSg 
to the PAL must contain at least 
one — that is, patterns like 111111, 
11X1X1, or XXIXXX should not 
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A6 



A6 = 



A6 = 1 



A5< 



A5 = 



A5 = 1 






1 
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5 


6 


7 


64 


65 


66 


67 


68 


69 


70 
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10 
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103 


40 


41 


42 


43 


44 


45 


46 
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1 PRIMARY LOCATION 
be used to select any product term. 

Organization of the PROM array 

The PROM array is a fuse-pro- 
grammable array of 128 locations by 
21 outputs. Eight primary outputs 
are directly available (Qq to Q^). 
The actual functions are defined by 
the user. In addition, 13 more out- 
puts serve as feedback signals to help 
determine the next state. All the out- 
puts are registered at the rising edge 



Fig. 4. An absolute address map (a) 
and a primary location map (b) show 
flow a location wittiin eacfi of four 
PROM quadrants tias tfie same pri- 
mary location. If two states branch 
to the same state (c), that state 
would have to be duplicated if all the 
resulting next states exceed four. 



of the CLK input signal. 

The primary locations in the 
PROM are addressed by five inputs, 
Aq to A4, which are fed back directly 
from the PROM registered outputs 
to the PROM array. These 5 bits de- 
termine the next primary location. 

Therefore, 32 primary locations 
define the states. The remaining two 
address inputs to the PROM (bits 
A_- and A^ ) , which the PAL array 
and the exclusive-OR gates generate, 
address four PROM quadrants, in 
each of which branching can be per- 
formed to any of its 32 primary lo- 
cations (see Fig. 4a). 



Potentially, each of the 128 loca- 
tions thus addressed could define a 
difTerent state. In actuality, the num- 
ber implemented is usually fewer. 
For example, some states may need 
to be duplicated in difTerent primary 
locations. Consider a state A whose 
next states will be B, C, and D, and 
a state E whose next states will be 
D, F, and G. Although one next state, 
D, is common to the two conditional 
transfers, putting B, C, D, F, G with- 
in the same primary location (within 
the four quadrants) is impossible, 
since a primary location can accom- 
modate only four next states in all 
(see Fig. 4b). Therefore, two pri- 
mary locations must be occupied 
with the state D and its clone. 

Another example requiring dupli- 
cation of states occurs even when a 
state A branches to states B and C, 
and state D branches to states C, E, 
and F — state C being common (see 
Fig. 4c) . Although the total number 
of next states is four, if insufficient 
product terms are available, the 
states of A and D may still need to 
bo duplicated to achieve that result. 

In the worst case, if every next 
state had to be duplicated because 
of incompatible combinations, or if 
all states had four-way branches, the 
PROSE state machine could still ac- 
commodate 32 states. Each state 
would have to be pointed to a differ- 
ent primary location. 

Registers for tiie PROIVI 

To conserve terminal pins, the 13 
bits of feedback signals from the 21- 
bit output register are buried within 
the PROSE chip (see Fig. 5) . What 
is called a shadow register, together 
with a diagnostic-on-chip (DOC) 
scheme, enables testing various parts 
of the chip, including the buried-sig- 
nal conditions. 

The shadow and output registers 
(see Fig. 6a) are operated by differ- 
ent clocks — DCLK and CLK, re- 
spectively — in several modes with 
the MODE and SDI signals (see 
Table 1) . The output registers can 
multiplex data from the shadow reg- 
ister (to receive a test vector) or 
from the PROM array (to receive a 
test result). Similarly, the shadow 
register can multiplex data from the 
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output register (for the test result) ; 
the previous shadow-register bits (to 
shift in a test vector and/ or shift 
out a test result) ; and from the cur- 
rent shadow-register bits (as just a 
hold operation). 

The DOC-register concept also 
solves the programming issues for 
the PROSE device. A pattern in the 
shadow register identifies the fuse 
to be blown (sec Fig. 6b) . Bits PTg 
to PTj select one product term out 
of eight from each of the PAL's sum 
of terms. Of these, bits CSq to CSg 
and Iq to I7 select one input to the 
PAL array. The bit labeled 0/P se- 
lects a smn term, and the bit T/C 
selects either the true or complemen- 
tary value of the input selected. 

Similarly, the fuse to be blown in 
the PROM is identified by the pat- 
tern in the shadow register (see Fig. 
6c) . Aq to Ag is the absolute address 
for the PROM array, and OAq to 
OA^ selects 1 bit out of the 21 bits 
in a PROM word. 

Applications abound 

The PROSE device is designed to 
be used as a general-purpose state 
machine for controlling equipment 
or for data manipulation. The 128 
states and four-way branch capabili- 
ties make it possible to be used as a 
high-speed memory or bus arbitra- 
tor, a video-RAM arbitrator, an in- 
dustrial traffic controller, a signal- 
processor sequencer, a sequencer for 
mass storage or communications, a 
data encoder and decoder, and for 
many other applications. For exam- 
ple, the PROSE state machine can 
implement the run-length-limited 



Sm (FROM PREVIOUS BIT) 



Di (NORMAL DATA) 



MODE- 



CLK- 



SHADOW REGISTER 
MULTIPLEXER 



fv SHADOW 

/ REGISTER 



-►Si 



OUTPUT REGISTER 
MULTIPLEXER 



OUTPUT REGISTER 



Fig. 5. The block diagram for a diagnostic register bit includes both a shadow 
register and an output register. Thus a test vector and the test result can be 
shifted in and out of the shadow register without corrupting the output flow. 



code, designated 2,7 RLL (see Table 
2). It is popular in mass-storage 
equipment and can increase the ca- 
pacity of such a device by approxi- 
mately 50%. 

In any mass-storage media, a min- 
imum separation of at least one zero 





Table 1. 


Functions of the Diagnostic Registers 


IVIODE 


SDI 


CLK 


DCLK 


Shadow 
register 
(SREG) 


Output 
register 
(OREG) 


Operation 


L 


X 


C 


• 


hold 


OREG,-<PROM 


normal 


H 


X 


c 


* 


hold 


OREGi<SREGi 


load from SREG 


L 


X 


* 


c 


SREGi<SREGi-1 
SREGo-<SDI 


hold 


shift SREG 


H 


L 


* 


c 


SREGi<OREGi 


hold 


load to SREG 


H 


H 


* 


c 


hold 


hold 


no operation 


L = zero 
H = one 
X = don't car 


e 


C = clock pulse (low-to-high transition) 
* = stable clocl< or high-to-low transition 
i = bit number 





between adjacent logic ones is re- 
quired. In the 2,7 RLL coding/ de- 
coding system, at least two zeros 
must exist between any two adjacent 
ones. (Thus encoded patterns like 
11 or 101 are illegal because there 
are fewer than two zeros between ad- 
jacent ones.) 

Since the distance between ones 
determines the flux transition fre- 
quency on the magnetic media, a 
minimum of two zeros between ad- 
jacent ones means that flux transi- 
tions are at least three bit intervals 
apart. Therefore, the encoded flux 
density becomes 3/X (a binary 100 
in the space X, the minimiun on the 
medium between transitions) . Other 
schemes like MFM, however, require 
a density of 2/X (a binary 10 in the 
minimum space X). Accordingly, 
since three encoded bits occupy the 
space on the media that carried 2 
bits before encoding, the encoded 
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density is 3/2, or 1.5-times the orig- 
inal density. (Additionally, to syn- 
chronize data, a maximum of seven 
zeros can be present between any two 
adjacent ones.) 

Starting from the INIT, or reset 
state, the encoder/ decoder first car- 
ries out a three-way branch — to per- 
form encoding if the signal ENC is 
received, or to perform decoding if 
the signal DEC is received, or else 
to itself. Once the state machine 
starts, it continues on its own and 
will be stopped when the machine 
receives a reset signal at the initiali- 
zation input pin. 

Thus, in its initial state, the 
PROSE machine loops on itself until 
a control signal, CTj , goes low. Then 
a signal CTq must be set to com- 
mand either an encode or decode 
operation. An instruction (/CTq/ 
CTj), therefore, forces a jump to 
Ago, which starts the encoding op- 
eration of the state machine (see 
Fig. 7a) . Data inputs serially, before 
encoding, as M, and outputs encoded 
serially via Qg . 

The Qy port provides an output 
for decoding; it is not used during 
encoding. Tlierefore, in the states in 



Table 2 
An IBM 2,7 RLL Code Table 



Input pattern 


Output pattern 


000 


000100 


10 


0100 


DIG 


100100 


0010 


00100100 


11 


1000 


Oil 


001000 


0011 


00001000 



which encoding is performed, the la- 
bels OX and IX in Fig. 7b refer to the 
output bits of Q^ and Qq, respec- 
tively, where X (for the Q^ output) 
is a don't-care condition. 

With PROSE running at twice the 
uncoded data rate and employing a 
four-stage pipeline, if the input pat- 
tern for M is 

1111111100011000101 
10 11 

the encoded data output at Qg will be 

1000100010001000000100100000010 
00100010001001000... 

Similarly, the instruction (/CTq/ 
CT^) forces a jump to Cqq, which 
starts the decoding operation. Dur- 



ing decoding, Qg is not used, but 
now Q-j is the decoded serial data 
output. Therefore, the labels for the 
output pattern of Q are either OX 
or IX, where again X stands for a 
don't care. 

For example, if the input pattern 
before decoding is the serial data in- 
put N: 

1000100010001000000100100000010 
0010001000100..., 

the output pattern at Q^ will be 

11111111000110001010.... 

with each bit appearing twice, since 
all legal data is clocked twice. 

In a practical design, if erratic pat- 
terns occur in the decoded data 
stream, the encoder/ decoder should 
neither correct the data nor stop, but 
should make an intelligent guess at 
the error's probable nature and con- 
tinue without holding up the data 
stream. However, in this example, 
illegal patterns have been made to 
enter an error state, because this sit- 
uation is much easier to follow for 
illustration purposes. 



BIT I 



(a) 



3 



5 
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AO 


Al 


A2 


A3 


A4 


Qo 


Ql 


Q2 


Q3 


Q4 


Q5 


Q6 


Q7 


CSo 


CSi 


CS2 


CS3 


CS4 


CS5 


XFo 


XFi 



v 

PROM ARRAY 
ADDRESS 
FEEDBACK 



V 
OUTPUT PATTERN 



V 
PAL ARRAY 
FEEDBACK 



BIT 1 



(b) 



10 11 12 13 14 15 16 17 18 19 



PTO 


PTi 


PT2 


T/C 




















CSo 


CSi 


CS2 


CS3 


CS4 


CSj 


0/P 



~\r 



^A/-^ 



PRODUCT-TERM CONTROL 
CODE 



V 
UNUSED 



V 
PAL ARRAY 
FEEDBACK 



BIT 1 



(C) 



10 11 12 13 14 15 16 17 18 19 20 



Ao 


Al 


A2 


A3 


A4 








OAo 


OAi 


0A2 


OA3 


OA4 














A5 


A6 



V 


V 


V 


PROM ARRAY 


UNUSED 


PROM DATA 


ADDRESS 




BIT CODE 


FEEDBACK 







MOST- 
SIGNIFICANT 
PROM ADDRESS BITS 



Fig. 6. Bit assignments for the stiadow and output registers during normal operation (a) are 
quite different from tfie sfiadow and tfie output registers during PAL array programming (b) 
end from tfie sfiadow and tfie output registers during PROM array programming (c). 
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One three-way branch (INIT) , 53 
two-way branches, and 15 uncondi- 
tional branches occur in the 2,7 
RLL. The three-way branch will oc- 
cupy three quadrants of one primary 
location. The 53 two-way branches 
can be merged into 40 two-way 
branches and placed into 20 primary 
locations merely by combining with 
two identical next states. The 15 im- 
conditional branches can be merged 
into 8, which will occupy two pri- 
mary locations. Thus, the total num- 
ber of occupied primary locations is 
1 -f 20 -f 2 = 23, which is well under 
the maximum of 32. This state-mini- 
mization approach is primitive and 
merges out only the states whose sets 
of next states are the same or subsets 
of other remaining states. 

The PMS14R21 PROSE state ma- 
chine will be available in April for 
$25.95 each in quantities of 100. D 



(ENCODE) 



INIT (RESET STARTUP STATE) 
/CTf/CTo I /CTi'CTo 




Fig. 7. The state-machine diagrams of a 
2,7 RLL encoder (a) and Its decoder (b) 
are a typical example of the possible ap- 
plication of the PMS14R21 device. 
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THE DESIGN FILE 

The following specification was written in the PALASM® 2 for- 
mat, and can be compiled using the PALASM 2 and PROASM'" 
software packages developed by Monolithic Memories. 

There are two corrections to the said article : 

1) Table 2, the output pattern for input 010 is 100100. 

2) Table 2 (same table), the output pattern for input 0010 is 
00100100. 

; This specification was written by Frank Lee with help from 
; Jo-NingTa 



Header 



Title 


14R21.G17 


Pattern 


27rll.eg 


Revision 


03 


Author 


Frank Lee 


Company 


MMI 


Date 


3/6/87 


Chip 




CHIP PROSE-27RLL PMS14R21 



; Pin Definition 

CLK DCLK CTO CTl N M NC NC NC NC SDI GND 
/I SDO Q7 Q6 NC NC NC NC NC NC MODE VCC 

; String Definition 

; Used in section on output patterns 

STRING CQS VQ7*Q6' 

STRING SQ6 '/Q7* Q6' 

STRING SQ7' Q7VQ6' 
; State machine type : Moore machine, output pattern is dependent 
; on current state only. 
; Feature or architecture pin is used as reset (or initialization) pin. 

STATE MOORE_MACHINE MASTER_RESET 
; This design spec starts with description for state transitions, fol- 
; lowed by output patterns, and conditions for the branches. 

; State transitions 
; Power-up state 

; The PROSE device is initialized to all I's. On the first clock, it will 
; perform unconditional branch to state INIT 
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POWER UP: = VCC -> INIT 

The PROSE will stay at INIT until there is a command for it to 
start encoding (condition ENC) or decoding (DEC). AOO is the 
starting state for encoding and COO is the starting state for de- 
coding. 
INIT := ENC — > AOO + DEC — > COO + -> INIT 

Encoder design 

A00,A10,...,B00,... are states for the encoder, as defined in fig. 7. 
EIN is defined in the condition section. It is the same as input M. 
VCC is an always-true condition. 
AOO will branch to All on EIN, else to AlO 



AOO 
; AlO 
AlO 
All 
A20 
A21 
A30 
A31 
A32 
ASS 
A40 
A41 
A50 
A51 
A52 
ASS 
A60 
A70 
ATI 
BOO 
BIO 
Bll 
B20 
B21 



= EIN — > All +-> AlO 

always branches to A20 

=.VCC->A20 

==VCC->A21 

= EIN — > ASl H — > A30 

= EIN -> A33 + -> A32 

= VCC— >A40 

= VCC->A41 

= VCC->AOO 

= VCC -> BOO 

= EIN — > ASl H — > ASO 

= EIN — > ASS H — > AS2 

==VCC->AOO 

=-VCC->A60 

= VCC->A00 

= VCC->BOO 

= EIN — > A71 H — > A70 

= VCC->A20 

= VCC->A20 

= EIN — > Bll H — > BIO 

= VCC -> B20 

= VCC -> B21 

= EIN — > ASl H — > ASO 

= EIN — > ASS -] — > A32 



Decoder design 

C00,C10,...,D00,...,E00,... are states for the decoder, as defined in 
fig. 7. 

DIN is defined in the condition section. It is the same as input N. 
COO will branch to Cll on DIN, else to CIO 
COO := DIN — > Cll H — > CIO 
CIO := DIN — > C21 +— > C20 
J ERR is the error state for an illegal encoded pattern received. 



Cll 
C20 
C21 
C22 



= DIN — > ERR +-> C22 
= DIN — > CSl -\ — > CSO 
=. DIN — > ERR H — > CS2 
= DIN — > ERR H — > CSS 
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C30 : 


= DIN 


C31 : 


= DIN 


C32 ■ 


= DIN 


C33 


z:=DIN 


C40 


= DIN 


C41 


= DIN 


C42 


= DIN 


C43 


= DIN 


C50 


==DIN 


C51 


= DIN 


C52 


= DIN 


C53 


= DIN 


C60 


= DIN 


C61 


= DIN 


C70 


•=DIN 


C71 


.= DIN 


DOO 


— DIN 


DIO 


:^DIN 


Dll 


:=DIN 


D20 


:=:DIN 


D21 


:=DIN 


D22 


:=DIN 


EOO 


= DIN 


ElO 


= DIN 


Ell 


= DIN 


E20 


= DIN 


E21 


= DIN 


E22 


:=DIN 


E30 


: = DIN 


E31 


.=3 DIN 


E32 


:=DIN 


E33 


:=DIN 


E40 


:=DIN 


E41 


:=DIN 


E42 


:=DIN 


E43 


: = DIN 


E44 


:=DIN 


E45 


:=DIN 



->C41+- 
-> ERR + 
-> ERR + 
->C43H — 
->C50+- 
-> ERR + 
-> ERR + 
-> ERR + 
-> ERR + 
-> ERR + 
->C61+- 
->ERR + 
->ERR + 
->ERR + 
-> ERR + 
->ERR + 
->D11+- 
->D21+- 
-> ERR + 
->C31+- 
->ERR + 
->ERR + 
->E11+- 
->E21+- 
-> ERR + 
->E31 +- 
->ERR + 
->ERR + 
->E41+- 
-> ERR -f 
->ERR + 
->E45 +- 
->C50+- 
-> ERR + 

->ERR + 
->D11+- 

->E11+- 
->ERR + 



>C40 
->C42 
>D00 
>E00 
>ERR 
->C51 
->C52 
->C53 
->C60 
->C00 
->E00 
->D00 
— >C70 
->C71 
— >E00 
->D00 
->D10 
->D20 
— >D22 
->C30 
->C32 
->C33 
->E10 
->E20 
— >E22 
->E30 
->E32 
->E33 
->E40 
->E42 
->E43 
->E44 
->ERR 
->C51 
->C52 
->D10 
->E10 
->C53 



; Error stay until initialized 
ERR :=VCC->ERR 



;STAY UNTIL RESET 



<state>.OUTF refers to the output pattern for the <state>. 
CQS, SQ6, and SQ7 are output patterns. CQS means output bits 

6 and 7 are all O's. SQ6 means output bit 6 is 1 and output bit 

7 is 0. SQ7 means output bit 7 is 1 and output 6 is 0. In all 
cases, output bits 0-5 are all don't cares as far as this design 
is concerned. 
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dTI 



INIT.OUTF 


-CQS 


AOO.OUTF : 


= CQS 


AIO.OUTF : 


-CQS 


All.OUTF : 


= CQS 


A20.OUTF : 


= SQ6 


A21.0UTF : 


-:SQ6 


A30.OUTF : 


= CQS 


A31.0UTF : 


= CQS 


A32.0UTF : 


= CQS 


A33.0UTF : 


= CQS 


A40.OUTF ■ 


= CQS 


A41.0UTF 


= CQS 


A50.OUTF 


= CQS 


A51.0UTF 


= CQS 


A52.0UTF 


= SQ6 


A53.0UTF 


= CQS 


A60.OUTF 


= CQS 


A70.OUTF 


-SQ6 


A71.0UTF 


= CQS 


BOO.OUTF 


= CQS 


BIO.OUTF 


^SQ6 


Bll.OUTF 


= SQ6 


B20.OUTF 


= CQS 


B21.0UTF 


= CQS 


COO.OUTF 


==CQS 


CIO.OUTF 


= CQS 


Cll.OUTF 


= CQS 


C20.OUTF 


= CQS 


C21.0UTF 


= CQS 


C22.0UTF 


==CQS 


C30.OUTF 


= CQS 


C31.0UTF 


=:CQS 


C32.0UTF 


= CQS 


C33.0UTF 


.= CQS 


C40.OUTF 


:=CQS 


C41.0UTF 


:=CQS 


C42.0UTF 


:=CQS 


C43 OUTF 


:=CQS 


C50.OUTF 


:=CQS 


C51.0UTF 


:=CQS 


C52.0UTF 


:=CQS 


C53.0UTF 


:=CQS 


C60.OUTF 


:=CQS 


C61.0UTF 


:=CQS 


C70.OUTF 


:-CQS 


C71.0UTF 


:-CQS 


DOO.OUTF 


: = CQS 


DIO.OUTF 


:=:SQ7 


Dll.OUTF 


: = SQ7 


D20.OUTF 


: = SQ7 


D21.0UTF 


: = SQ7 


D22.0UTF 


: = SQ7 
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EOO.OUTF: 


= CQS 


EIO.OUTF- 


= SQ7 


Ell.OUTF: 


= SQ7 


E20.OUTF: 


= SQ7 


E21.0UTF: 


= SQ7 


E22.0UTF: 


= SQ7 


E30.OUTF 


= SQ7 


E31.0UTF. 


= SQ7 


E32.0UTF- 


= SQ7 


E33.0UTF 


: = SQ7 


E40.OUTF 


: = SQ7 


E41.0UTF 


. = SQ7 


E42.0UTF 


: = SQ7 


E43.0UTF 


:-:SQ7 


E44.0UTF 


: = SQ7 


E45.0UTF 


:=SQ7 


ERR.OUTF 


r-CQS 



CONDITIONS 

The conditions for state transitions 

; encode command 
; decode command 



ENC = /CTIVCTO 
DEC = /CT1* CTO 



EIN = M 
DIN = N 



; encoder input (original NRZ data) 
; decoder input (encoded data) 



; SIMULATION 

; Simulation vectors not .shown here 
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Designer's Guide to 
VME Bus Control — Part 1 



By using fuse-programmable controllers (FPCs) and 
PLDs, you can implement VME Bus control in your 
computer system with a minimum of hardware. Just 
a few chips per plug-in module, and a bus arbiter, 
can perform all bus-control functions. This article, 
Part 1 of a 2-part series, describes the bus-arbitration 
process and control functions and shows you how to 
implement the VME Bus protocol in an FPC. Part 2 
unll describe the implementation of slave controllers, 
discuss interrupt handling, and provide tools for pro- 
gramming the FPC. 



Arthur Khu, Advanced Micro Devices 



Until recently, you needed many ICs — sequencers, mi- 
croprogrammed control stores, and other MSI/LSI 
chips — to implement V^ 3us control in a computer 
system. Now, however, you can use a minimum of 
hardware to handle the bus's intermodule communica- 
tion functions. Just a few fuse-programmable control- 
lers (FPCs) and programmable-logic devices (PLDs) 
relieve the CPU of all bus-control functions. 

A typical VME Bus-based computer system com- 
prises one or more master modules (eg, CPU boards), 
one or more slave modules (eg, cache/memory boards), 
a bus arbiter, and interrupt-handling circuitry. A mas- 
ter initiates a data transfer to a slave by requesting 
control of the data bus from the bus arbiter. Once bus 
control has been granted to a master, the master and 
slave exchange control signals according to predefined 



protocols that guarantee an orderly transfer of data 
between the communicating modules. The interrupt- 
handling circuitry services all interrupt requests. 

By using the streamlined design in Fig 1, you can 
implement most intermodule communication in a VME 
Bus-based system by using just two types of VLSI 
devices: the Am29PL141 fuse-programmable device 
and the AmPAL22V10 programmable-logic device. For 
the remainder of the bus-control functions, you'll need 
some bus-arbitration circuitry, which must occupy a 
particular slot on the VME Bus backplane, but which 
can reside on the same board with the bus master of 
highest priority. 

When you design VME Bus control into your system, 



C 




^ 



Fig 1 — You can implement VME Bus control in a computer system 
by using just a few fuse-programmable controllers (FPCs) and 
programmable-logic devices. 



Reprinted from EDN, October 2, 1987 

© 1987 CAHNERS PUBLISHING COMPANY 
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Ton can implement VME Bus control in 
your computer system by embedding the 
bus-control functions in state machines that 
comply with the VME Bus protocol. 



your first step is to consider the VME Bus protocols. 
These protocols specify the steps your circuitry must 
take to perform any bus-related operation, such as the 
transfer of data between two modules. You can describe 
these protocols by means of flow diagrams that show 
how the various interface signals reflect the interac- 
tions between the communicating modules. Fig 2, for 
example, shows how the priority bus arbiter resolves 
simultaneous bus requests from two modules that use 
the same request line. 

Next, you analyze the flow diagrams to pick out the 
functions that can be incorporated into FPCs or PLDs, 
and you define state machines for those functions. You 
can describe the state machines abstractly, by Boolean 
logic equations, or diagrammatically, by means of 
standard flow-chart symbols (eg, rectangles to repre- 
sent processes and diamonds to represent control-flow 
decisions). Finally, you must write programs for the 
state machines in a high-level or assembly language. 
You repeat this process to design each of the four main 
types of VME Bus controllers: bus arbiters, masters, 
slaves, and interrupt handlers. 

Before any master module can perform a data trans- 
fer, it must request control of the data bus from the bus 
arbiter; the arbiter must check to see whether the bus 
is free and must resolve any contention between two 
masters of equal priority. For example, consider a 
priority-option bus arbiter implemented on a single 
PLD. The arbiter will monitor the four bus-request 
lines (BRim) and assign the highest priority to BR3. 

As the flow diagram (Fig 2) shows, the arbiter 
grants the bus to the requesting module that's using 
the highest active request line, which is BRi in this 
example. To enable your bus arbiter to resolve simulta- 
neous bus requests from two or more modules that use 
the same request line, you must daisy-chain the associ- 
ated bus-grant signal to all the devices using that 
request line. Therefore, the arbiter must be in the first 
slot of the VME Bus system. The module that's closest 
to the bus arbiter will have the highest priority: If it 
requests the bus, it will receive the bus grant and lock 
out any modules farther down the chain (Listing 1). 
The AND/OR array of the PLD processes all bus 
requests in parallel, so that priority arbitration is 
complete in only one clock cycle. 



Priority arbitration options 

Sometimes, the arbiter must force the current bus 
master to relinquish control of the bus; this situation 
occurs, for example, when a bus master of higher 



LISTING 1 

IF (/BBSV(BR0+BR1+BR2+BR3)) THEN "if bus not busy and a 



BEGIN 

IF (BR3) THEN 

BG3IN := 1 ; 
IF (/BR3-BR2) THEN 

BG2IN := 1; 
IF (/BR3*/BR2'BR1) THEN 

BG1IN:- 1; 



IF (/BR3VBR2*/BRrBR0) THEN 

BGOIN:= 1 ; 
END; 



request line is active " 

"if BR3 is active, grant " 

" bus to device on BR3 " 

"activate bus grant " 

" daisy chain 2 " 
"if BRI is active and BR3" 

" and BR2 are not, then " 

" grant bus to device " 

" using request line 1 " 
"BR0-3andBG0INto 

" BG3IN are active low " 



IF (/BBSVBGIIN) THEN "if BG1IN is asserted in response to a request " 

BGIIN := 1; "over line BR1, then continue asserting BG1IN " 

"until requesting device drives BBSY active * 



LISTING 2 

IF (BBSY-(BR0+BR1+BR2+BR3)) THEN 
BEGIN 
IF(BR3-(/(MASTER[1:0J = 3) ) ) THEN 

' BCUR :- 1; . "assert BCLR if MASTER <> 3" 
IF(/BR3*BR2*(/(MASTER(1:01 = 3) ) ) THEN 

BCLR := 1; 
IF (MASTER[1:01 » 0) THEN 

BCLR:= 1; 
END; 



priority initiates a bus request. The arbiter examines 
the priorities of both the current bus master and the 
new requester. If these conditions meet the predefined 
bus-clear conditions, the arbiter asserts the bus-clear 
signal (BCLR). 

In the design in Fig 1, the bus arbiter keeps track of 
the current bus master's priority by recording the 
bus-request line that was used to gain control of the 
bus. For example, if the current bus master used BR2 to 
gain control of the bus, it sets two output registers 
called Bus_Master to the binary valu e 2. Eit her of the 
two following conditions will activate BCLR: 

• The value in Bus_Master is 2, 1, or and the 
active bus request line is 3 or 2. 

© The value in Bus_Master is 0, and any bus- 
request line is active. 

When the value in Bus-Master is 3, the arbiter will 
not honor any bus requests until the Bus Busy line 
(BBSY) is high. Listing 2 gives the logical e xpress ion of 
these conditions. Once having asserted BCLR, the 
arbiter holds this line in the active state until the 
current bus master releas es BBS Y. 

Be sure to define the BCLR in such a way that 
uninterruptible devices can use bus-request line 3, 
which has the highest priority. Devices that temporari- 



2-620 



^ 



Monolithic 



m 



Memories 



FPCs and PLDs Simplify VME Bus Control 



LOCATED IN SLOT 2 



REOUESTER B 



DRIVE (DEVICE 
WANTS BUS) HIGH 



U 



DETECT (DEVICE 
WANTS BUS) HIGH 
DRIVE Br T LOW 



LOCATED IN SLOT 1 



REQUESTER A 



ARBITRATION 
IN PROGRESS 



DRIVE (DEVICE 
WANTS BUS) HIGH 



DETECT (DEVICE 
■ WANTS BUS) HIGH 
DRIVE §RT LOW 



DETECT SGTiR LOW 
DRIVE BBSY LOW 



DETECT Sfr; LOW 
DRIVE BGIIN low 
I 



I OF DATA-TRANSFE 



MASTER A HAS CONTROL | 
OF DATA-TRANSFER BUS I 



lARBITRATION | 
I IN PROGRESS I 



DETECT BGIIN LOW 
DRIVE BBS? LOW 

I I 



RELEASE B^l 

DRIVE (DEVICE GRANTED 

BUS) HIGH 



DETECT (DEVICE 
GRANTED BUS) HIGH 

PERFORM DATA TRANSFERS 

I 

DRIVE (DEVICE 

WANTS BUS) LOW 



RELEASE BR1 DETECT SBSY LOW 

■ DRIVE (DEVICE DRIVE BGTTN HIGH 

GRANTED BUS) HIGH I 



DETECT B5l IN HIGH 



DETECT (DEVICE 
WANTS BUS) LOW 
RELEASE I 



I ^ 



DRIVE (DEVICE DETECT BBS? HIGH 

GRANTED BUS) LOW I 



DRIVE BCJi IN LOW 



DETECT (DEVICE , 

GRANTED BUS) LOW if 

DETECT BSTiN LOW 
BSTTIsl ffSTCJUf DRIVE eyOUT LOW 

"V\ DAIRY r.HAIfJ VV 



DAISY CHAIN 



• nn 



n 



DETECT (DEVICE GRANTED 
BUS) HIGH 

♦ 

PERFORM DATA TRANSFERS 

t 

DRIVE (DEVICE WANTS 

BUS) LOW 



DETECT (DEVICE GRANTED, 
BUS) LOW 




DETECT BBTTN HIGH 

I 

DETECT (DEVICE WANTS 

BUS) LOW 

-DRIVE (DEVICE GRANTED 

BUS) LOW 

RELEASE BBS? 



MASTER B HAS 
CONTROL OF DATA- 
TRANSFER BUS I 

DETECTBGIIN HIGH ^ 
DAISY CHAIN '"^ DRIVE BGlOQT HIGH 



DETECT BISY LOW 
DRIVE 65^1 IN HIGH 



I ARBITER WAITS FOR | 
j NEW BUS REQUESTS j 



Fig 2 — Flow diagrams help you define the VME Bus protocols. This flow diagram illustrates how the bus arbiter chooses between requesters 
that have the same priority level. 
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LISTING 3 



DEVICE "MASTER* (Am29PL141) "specify the device to use' 
DEFINE 

'define the test conditions" 
"master request Input " 

"bus grant in line ' 

"data transfer acknowledge line " 



tius_request 
bu8_grant 
dtack 
addr_strobe 



- to 

- t1 

- t2 

- t3 



"check address strobe bus line 
'other test conditions 



"define the control outputs' 
off__state - 0000#h 
as > eOOOOfh 

busmaster - 4CX)0#h 
arb_roq « 2000*h 
bbsy - 1000#h 



'off state 
"16th bit . 
"15th bit . 
"14th bit . 
»13th bit . 



1 : address strobe 
1 : inform master 
1 : drive BRn lino 
1 : drive BBSY line 



"signals going to the bus ere gated thru inverting high-current 
bus driver; these include AS,ARB_REQUEST, and BBSY 

slgnatl » 0001#h ; "other control signals " 

'FPC assembler format is - <label:output,statemenf;>,virith label 
optional" 
BEGIN , 

'wait for a bus request to be asserted high' 
(S) start :otf_state , while (not bus_request) wait 

else goto pt(get_bus) ; 
(d) bu»_granted "assert BBSY and BUSMASTER signals' 
: bbsy+busmaster 

, if (addr_8trobe « 1) then 
goto pl(bu8_granted) ; 
@ 'AS(L) must be HIGH before continuing; this means previous bus 
master is not driving the bus anymore ' 

"other statements" 



"BUS ACQUISITION microcode subroutine: the control output 
ARB_REQUEST Is asserted (BRn line) until the /BQlNn signal 
is received active LOW; if the bus request line from the master 
goes LOW before the bus is granted (e.g., master cancels bus 
request), then turn off the ARB_REQUEST output and return 
to the wait stale START ' 

(b) get_bus: arb_requo8t , If (bua_grant - 0)then 
goto pl(btJ8_grantod) ; 
arb_request , if ^us_request) then 
goto pl(get_J5us) ; 
off_state , if (not bus_request) then 

goto pKstart) ; 
END. 'end of source code* 



CPU MODULE 
(MASTER) 



FROM BUS AAA 
ARBITER ^\ £^ AA 



FPC Ain29Pt141 



SEO-CYCLE 
BUSJEOUEST 

SK3NALTOBUS 
BUS-MASTER AUBITOl 

OTACK^enR LVTORD. BB, BflSY. 

(— EN AS OS..DS, lAOL BEBR 



74AS240 



<: 




Am29828 



^ 



Fig 3 — You can implement the requester logic for a master module 
in an FPC. The FPC handles all Ims-acquisition and data-transfer 
protocols. 



ly can be suspended to accommodate interrupts and 
higher-priority operations should be assigned to bus- 
request line 0. The BCLR conditions will vary with 
your application, but you can modify them just by 
redefining the high-level logic expressions. 



Master modules 

The master modules of VME Bus systems initiate all 
data transfers over the bus. A system can have one 
master (the CPU unit) or several (multiple processors, 
DMA controllers). A master module must control the 
bus before it can perform any data transfers. Most of 
the bus-control logic resides in the requester section of 
the module; this section handles bus acquisition and 
communication with other master or slave devices in 
the system. 

You can microprogram all the requester functions 
into a single FPC, which serves as the interface be- 
tween the master device and the VME Bus (Fig 3). The 
FPCs microprogram handles the bus-acquisition proto- 
col. Upon receiving the bus-grant signal from the 
arbiter, the microprogram informs the master device 
that it has control of the bus and may initiate a data 
transfer. After completing the transfer, the master 
device releases the bus-request signa l, thus causing the 
FPC to free the bus by releasing the BBSY signal. The 
FPC may also relinquish control of the bus at the 
request of the bus arbiter. 

In designing your application, you need to extract the 
bus-acquisition phase of the requester from the flow 
diagram in Fig 2 and translate this information into a 
state diagram (Fig 4), from which you must write the 
corresponding FPC assembler source code. Listing 3 
gives an example of this code. 

FPC-controUed data transfers 

Once a master module becomes the bus master (that 
is, once it gains control of the bus), it can begin 
transferring data to a slave module. A data-transfer 
flow diagram (Fig 5) shows the necessary handshaking 
signals for transferring 32-bit data between master and 
slave. The state diagram for the master module's FPC 
(Fig 4) combines the bus-acquisition (requester) and 
data-transfer-control functions of the master module. 
An FPC assembler (which the manufacturer of the FPC 
provides) simplifies the task of microprogramming this 
state machine into an FPC. 

Handling unanswered data-transfer requests 

To prevent bus lockups, which malfunctioning slave 
devices may cause, you should implement a bus-time- 
out (BTOn) option in the master module by writing a 
microcode routine that uses the 6-bit counter in the 
master's FPC. (Listing 4 gives an example of such a 
routine.) The bus-time-out option permits a bus master 
to abort a data-transfer cycle if the slave does not 
respond within n microseconds. 

At the start of every data-transfer operation, the 
FPC microprogram loads a value into the 6-bit counter, 
tests fo r the data-transfer acknowledge signal 
(DTACK), decrements the counter and tests it for zero, 
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© 




ACTIVATE BBSY 
INFORM MASTER 
LT MASTER BUS 
BLOCKjyS GRAhn" OUT 
DRIVE BR HIGH 



LATCH ADDRESS 

ONTO BUS 

DRIVE LWOnS LOW FOA 

U-BU DATA TRANSFERS 

DRIVE \XB( HK2H 

TO INDICATE A 

DATA-TRANSFER CYCLE 



CHECK IF SEQUENTIAL 
TRANSFER CYCLE 



DRive 

WWTE 
LOW 





DRIVE STROBES 
0S^~iLOW 




BUS 
ERROR 
ROUTINE 





DRIVE 
HIGH 




NO 




YES 

1 


DRIVE BB5V HIGH 

(Bn MUST BE 

RELEASED 30 nSEC 

BEFORE BBS? 

RELEASE CWORD, 

RCK 

DEACTIVATE MASTEfl- 

GRANTEO-BUS 

SIGNAL 












WAIT FOR MASTER 
TO REQUEST BUS 
00 TO START 



Fig 4 — You can derive detailed state-machine diagrams from your flow diagrams. In this diagram for a master requester, the 
bus-acquisition phase is surrounded by a dashed line. Note the close correspondence between the state diagram and the assembly-language 
prrogram in Listing 3. 
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I MASTER MODULEn 

ADDRESS THE SLAVE 
PRESENT ADDRESS 
PRESENT ADDRESS MODIFIER 
DRIVE CVTORD LOW 
DRIVE (XCK HIGH 
DRIVE XS LOW 



SPECIFY DATA DIRECTION 
DRIVE WRITE LOW 



SPECIFY DATA WIDTH 
WAIT UNTIL tSUCK HIGH AND 
BERR HIGH (INDICATES 
THAT PREVIOUS SLAVE IS 
NO LONGER DRIVING DATA DUS) 
DRIVE ISSiT, LOW 



TERMINATE CYCLE 

RECEIVE DWCK LOW 

IF LAST CYCLE THEN 
RELEASE ADDRESS AND 
ADDRESS MODIFIER LINES 
RELEASE DATA LINES 
RELEASE CWORD 
RELEASE ISCK 

DRIVE OSi, HIGH 

DRIVE AS HIGH 



\ SLAVE MODULE } 



PROCESS ADDRESS 

RECEIVE ADDRESS 
RECEIVE ADDRESS MODIFIER 
RECEIVE LWORO LOW 
RECEIVE tACK HIGH 
RECEIVE AS LOW 
IF ADDRESS IS VALID 

FOR THIS SLAVE 
THEN GENERATE DEVICE SELECT 
ELSE TAKE NO FURTHER ACTION 



STORE DATA 

RECEIVE WRITE LOW 
RECEIVE DS]' LOW 
RECEIVE DSo LOW 
LATCH DATA FROM DATA 

LINES DOo-3, 
WRITE DATA INTO 

SELECTED DEVICE 



RESPOND TO MASTER 
" DRIVE OYACK LOW 



I FOR 



FOR SIMPLICITY. THE ASSUMPTION 
' IS MADE THAT NO-TRANSFER 
I CAUSES A BUS ERROR 



IN I 



END TERMINATION 
IF LAST CYCLE THEN 

RELEASE D^S; 

RELEASE fS 
ELSE GO TO ADDRESS THE SLAVE 



ACKNOWLEDGE TERMINATION 
RECEIVE AS, OSo-i HIGH 
RELEASE DTACK 



Fig 5 — This flow diagram shows the signals you'll need to perform 
a 32-bit data transfer once a master has acquired control of the 
bus. 



The bus-time-out option permits a bus 
master to abort a data-transfer cycle if a 
slave does not respond within a specified 
time. 



LISTING 4 

"Begin transfer : Bus time out loop contains Instructions 1 and 
2. If count reg Is loaded with 63 and FPC Is running at 20 MHz, 
then total time before bus times out is (63+1)'2*50 ns " 

walt_loop: DSO + DS1 . if (DTACK » 0) then "1" 

goto pl(transfer_complete) ; 
DSO + DS1 , while (creg <> 0) "2" 

loop to pl(wait_loop) ; 
DSO + DS1 , if (creg » 0) then 

goto pl(bus time_out) 



transfer_complete: "goto to this section if handshaking signal 
was received" 



bus_time_out; "execute this section when counter is 0" 

BTO_slgnal "activate bus time out signal" 



and fina lly loops b ack to th e instruction that tests 
DTACK. As soon as DTACK is detected, the program 
branches to the section of code that handles a normal 
data-transfer operation. 

If the counter value reaches zero, however, the 
microprogram must branch to a section of code that can 
handle situations in which data-transfer requests are 
not completed; such situations are, of course, entirely 
dependent on the user and the application. To calculate 
the time that will elapse before the program generates 
a bus-time-out signal, multiply the number of instruc- 
tions in the time-out loop by the system cycle time by 
the initial value (plus 1) that the program has loaded 
into the count register. EDN 
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Designer's Guide to 
VME Bus Control — Part 2 



By using fuse-programmable controllers (FPCs) and 
PLDs, you can implement VME Bus control in your 
computer system with a minimum of hardware. Part 
1 (October 2, pg 187) of this 2-part series described 
the bus-arbitration process and control functions and 
showed how to implement the VME Bus data-transfer 
protocol in an FPC. This second and final part de- 
scribes the implementation of similar controllers for 
slave modules and interrupt handlers, and it pro- 
vides tools for programmiiig the FPC. 



Arthur Khu, Advanced Micro Devices 

When you offload a system's bus-control functions from 
the CPU to a hardware bus controller, you considerably 
reduce the time these functions require, thereby im- 
proving the data-transfer rate over the bus. You can 
substantially reduce the cost and chip count of such a 
bus controller by implementing the bus-control logic in 
VLSI devices such as fuse-programmable controllers 
(FPCs) and programmable logic devices (PLDs). 

The VME Bus protocol requires that all data trans- 
fers over the bus be initiated by a master module. Your 
system can include several master modules, such as 
CPUs or DMA controllers; a bus-arbiter module arbi- 
trates simultaneous data-transfer requests from two or 
more masters. No data transfer can take place until the 
requesting master has been given control of the bus by 
the bus arbiter. 



r/EWbRYEbAJifi (SUVE) 



-DATA 



ADDRESS 

OEcooen 



INTERRUPT 
REQUEST \ 



LATCH_DATA 
STORE-DATA 



PRESENT_DATA 



WRITE 
ADDR,VAilia 



FPCAni29PL14t 




ADDRESS MODIFIERS AMo-s 



Fig 1 — This device controller, which is implemented with an FPC 
and a PLD, handles data-transfer and interrupt protocols for a slave 
device in a VME Bus system. 

Because a slave can't initiate a data transfer, your 
system must include an interrupt mechanism that al- 
lows a slave to request service from a master. You C3»" 
implement such a mechanism in your system by design- 
ing a slave subsystem like the one in Fig L This slave 
subsystem comprises a slave module and a slave inter- 
rupt controller. The slave interrupt controller, which 
consists of an Am29PL141 FPC and an AmPAL22V10 
PLD, serves as the interface between the slave subsys- 
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The interrupt controller for a slave device 
implements the interrupt protocol in 
hardware. 



tern and the VME Bus structure. Once the master has 
initialized the slave, it can issue commands to the slave. 
The slave performs these tasks in the background, 
leaving the master free to continue its own operations. 
When the slave is ready to return the status or result of 
a task to the master, it issues an interrupt request. 

The VME Bus single-cycle data-transfer protocol 
defines the interactions between master and slave con- 
trollers (Fig 2). When the master drives the address 
strobe (AS) low, the slave controller latches the address 
presented on the bus, and a separate address-decoding 
unit on the slave board decodes the address. If the 
address is not valid (ie, if it's not in the range associated 
with this slave), the slave takes no further action. 
However, if the address selects this slave subsystem. 
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Fig 2 — The slave controller recognizes its own address and receives 
a data word from the bus master in this flow diagram for single-cycle 
transfer. 
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Fig 3 — This state-machine diagram for Fig I's slave controller is derived from the flow diagram in Fig 2. The state machine has four modes, 
which handle interrupts as well as single, block, atid read-modify -vrrite data transfers. 
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The VME Bus protocol requires that all 
data transfers over the bus he initiated by 
a master module. 



the slave looks for signals from the master that specify 
whether a read or a write operation should take place. 
After presenting or storing data, the slave controller 
drives the data-transfer-acknowledge signal (DTACK) 
low to inform the master of the successful transfer. 

From the VME Bus-protocol flow diagrams (Fig 2), 
you can derive a state diagram (Fig 3) for the slave- 
controller state machines; Fig 4 shows the resulting 
timing pattern for the slave controller. You can develop 
microcode for the slave controller's single-cycle transfer 
mode from the state diagram and the timing pattern. 
Use the address-modifier lines (AM0.5) to specify the 
other three slave operating modes (sequential, or block, 
transfers; read-modify-write transfers; and interrupt 
cycle). To recognize these special modes, the slave 
controllers on each slave board constantly monitor the 
six address-modifier lines. 

When the code presented on the address-modifier 
lines specifies a block-transfer operation, the master 
retains control of the bus throughout the operation by 



holding AS and BBSY low. For a block transfer, bus 
arbitration takes place only once, before the start of the 
operation; for single-cycle transfers, bus arbitration 
takes place before the transfer of each word. A block 
transfer, therefore, takes less time than does the corre- 
sponding number of single-word transfers. 

At the start of a block-transfer operation, all the 
slaves load the address presented on the bus into their 
address counters and decoders, but only the slave 
whose memory range encompasses the decoded address 
responds to the data-transfer request. As each word 
transfer is completed, all the slaves increment (or 
decrement) their address counters and decode the new 
address. This procedure is necessary because the mem- 
ory block being transferred may reside on more than 
one slave memory board. 

In the slave subsystem in Fig 1, the PLD decodes 
control signals from the bus and slave board and sends 
two signals, OPERo and OPER,, to the slave's FPC. The 
FPC decodes these two signals, along with inputs from 




Fig 4 — You'll need to generate a timing diagram for each of the slave controller's operating modes. This diagram shows the timing for a 
single-cycle data transfer. 
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the slave, bus, and address-decoding units on the slave 
board, to determine which of the four possible slave 
operating modes to execute. The four modes, which are 
designated by binary codes, specify the following oper- 
ations: 

• (00) Perform a single-cycle transfer 

• (01) Perform a sequential-cycle transfer 

• (10) Perform a read-modify-write cycle 

• (11) Perform an interrupt cycle. 

If OPERo and OPER, are both high, the FPC operates 
as an interrupter by branching to an interrupt subrou- 
tine (Fig 3). 

When the slave requests an interrupt, the FPC 
generates an interrupt-request signal and waits for the 
interru pt-acknow ledge signal (JACK) and daisy-chain 
signal (lACKIN). When the data strobes become ac- 
tive, the PLD reads a 3-bit value from the address bus 
(A1.3); this value indicates which interrupt-request line 
was acknowledged. The PLD decodes these three bits 
to determine whether their value matches its own 



request level; if it finds no match, no further action 
occurs. If it does find a match, however, the PLD routes 
a valid signal to the FPC, indicating that the interrupt 
handler has acknowledged the slave's interrupt re- 
quest. The FPC then signals the slave board to put its 
status or identification byte on the data bus for the 
interrupt handler to use as an interrupt vector. The 
slave FPC waits in a loop until the interrupt handler 
drives the lACK signal high to signify that interrupt 
service is complete. 

Besides containing master, slave, and bus-arbiter 
modules, a VME Bus system usually has an interrupt- 
handler module that handles external I/O or special 
system events (time-out or overflow errors, for exam- 
ple). You can reduce the. logic complexity of the inter- 
rupt-handler module in your system by offloading some 
of the initial interrupt-recognition tasks to an inter- 
rupt-handling preprocessor (IHP). You can use a PLD 
as the IHP, programming it to preprocess interrupt 
requests, obtain control of the bus, and handle hand- 
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Fig 5 — The interrupt-handling preprocessor monitors the seven VME Bus interrupt-request lines (IK,:), identifies the interrupting device, 
and tells the interrupt handler when to begin servicing the interrupt. This timing diagram shows the signal states that exist before and during 
the transfer of the identification and status bytes from the slave-interrupt subsystem to the interrupt handler. 
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Whm the slave module is in block-transfer 
mode, bus arbitration takes place only once 
before the start of the operation. 



shaking signals (such as interrupt-acknowledge sig- 
nals). Only when the IHP latches the interrupt vector 
will control pass to the interrupt-handler module. 

To define interrupt-request processing, bus acquisi- 
tion, and the interrupt vector's transfer phase, you'll 
have to use logic equations written in high-level Boole- 
an notation. In the design in Fig 1, the PLD monitors 
seven interrupt-request lines and four data-transfer 
control inputs, and it sends 10 control signals to the 
interrupt handler and the VME Bus drivers. The PLD 
monitors all interrupt-request lines according to the 
following logic equation: 



IF (IRl + IR2 -I- IR.3 + IR4 + IRS + 

IR6 + IR7) THEN 
BR3 : = 1 ; 

"THIS INTERRUPT HANDLER USES" 
"THE BR3 REQUEST LINE" 



(1) 



If any interrupt-request line is active, the PJiD asserts 
BR3 to initiate the bus-acquisition phase. 

The next step is to wait for the bus-grant-in signal. 
Only when BGIN3 is active will BBSY be active. The 
logical expression of this condition is given in the 
following equations: 



IF (BR3VBG3IN) THEN 
BR3 : = 1 ; 



(2) 



IF (BR3*BG3IN 
BBSY: 



BBSY*/SERVICE_DONE) THEN (3) 
1 ; 



Eq 2 continually as serts B R3 as long as BG3IN i s not 
acti ve; Eg 3 asserts BBSY only when request line BR3 
and BG3IN are active, or if service is not complete after 
the IHP asserts BBSY. 
When the IHP is the bus master, it puts the 3-bit 



Development tools help you program FPCs 

The Am29PL141 is a single-chip fuse-programma- 
ble controller (FPC) that can implement state ma- 
chines and distributed control functions. You can 
express and functionally verify these state ma- 
chines and functions by using three FPC-develop- 
ment tools provided by the manufacturer: an as- 
sembler, a test-vector generator, and a simulator. 
These development tools, which are written in C, 
run on an IBM PC or compatible computer under 
MS-DOS. Source code for the programs is avail- 
able, so you can port the software to other 
systems. 

The FPC has a control store that's resident in 
its PROM; the store is 64 words long and 32 bits 
wide. Instructions such as jumps, loops, and sub- 
routine calls in the PROM control store are condi- 
tionally executed by the 20-MHz internal seq- 
uencer. Each 32-bit instruction is partitioned into 
the following format: 

BITS: 1 5 1 3 6 16 

FIELD: OE OPCODE POL TEST DATA OUTPUT 



The output field contains the 16 control outputs 
that the FPC generates when it executes an in- 
struction. The upper eight output bits are con- 
trolled by the output enable (OE) bit; they may 



be high, low, or disabled. The lower eight output 
bits are always enabled. The op-code field speci- 
fies which of the 29 possible instructions the FPC 
will execute if the condition selected by the test 
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7b program the Am2$PL141 FPC to Implement VME Biu- 
control ftmettont, you can use the assembler, test-vector geTiera- 
tor, and simulator provided by the FPC's manufacturer. These 
development tools, which are written in C. run on an IBM PC or 
compatible computer under MS-DOS. 
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interrupt-line-acknowledge value on the bus and as- 
serts the data strobes. Upon receipt of the DTACK 
signal from the interrupter, the I HP strobes the status 
byte from the bus into a register on the interrupt- 
handler card. The IHP then informs the interrupt 
handler that a status/identification byte is ready and 
that the handler should begin servicing the interrupt. 
The IHP resolves interrupt priorities within a single 
clock cycle because all interrupt/input signal lines are 
processed in parallel by the logic array in the PLD. 
Listing 1 shows how you'd express the procedure in a 
logic-description language. 

As you can see from Listing 1, if both the IR7 and the 
BBSY signals are active in section 1, then the 3-bit 
value generated by the IHP will be binary 111 (decimal 
7), regardless of th e sta te of t he other interrupt lines. 
In section 2, if IRl and BBSY are active and the other 
six control signals are inactive, then the 3-bit output 
value will be binary 001 (decimal 1). In section 2, IRl is 
the highest priority line that is active. 

The remaining interrupt-preprocessing steps com- 
plete the transfer of the interrupt-yector byte; this 
transfer is described logically in the following 





LISTfNG 1 


IF (BBSY) THEN 




BEGUN 




(1) IF (IR7) THEN 


"IR7 was actlva " 


INTR12H)1 : - 7 : 


"a-bit value acknowtadglng 




IntwTupt Una 7 " 


IF (/in7*IR6)THEN 


"IR6 actlva, IR7 Inactiva 


INTRI2:)1 : - 6 : 


"acknowladgaintemiptHna 6" 


IF(/IH7'/IR6'IR5)THEN ''IR5 highest prtortty " | 


INTR[2:01 : - 6 ; 


"Interrupt Hne active " 


(2) IF (/IR7*/IR6*/IR6VIR<'/in3*/IR2*IR1)THEN 


INTHf2:0] : - 1 ; 


"acluiowledfle Interrupt Una 1" 


END: 





IF (BBSY) THEN 

BEGIN 

lACK := 1; 
(X) IF (DTACK) THEN 

LATCH.STATUS:= 1; 



(4) 



"begin interrupt acknowledge daisy" 
"chain; if device sends data " 

"transfer acknowledge (DTACK) " 
"signal, then latch the status " 



END: 
IF (lACK)THEN 
AS:= 1; 



"assert the address strobe signal to inform the 
interrupter that the interrupt acknowledge 
level is ready" 



field is true. The polarity bit (POL) determines 
whether a high input or a low input represents 
true for the test input selected. You can use the 
data field as an argument for the op-code field. 
For example, if you want the FPC to jump (op 
code 25) to location 34 (22hex) when the second bit 
in the test field is high (true), you put the follow- 
ing 32-bit word into the control store (though 
youll also have to define the OE bit and the 
output bits): 

OE 11001 010 100010 OUTPUT 

The assembler simplifies system design by al- 
lowing you to use a high-level language. Instead 
of specifying the bit patterns shown above, you 
can write the following section of code: 

OUTPUT , IF (T2 = 1) THEN 
GOTO PL (34)j 

The assembler translates this statement into the 
appropriate bit patterns; when the FPC executes 
the instruction, it generates the bit pattern de- 
fined by the symbol OUTPUT. 

Every microinstruction written in assembly 
language follows this format: 



< LABEL : > OUTPUT , STATEMENT ; 

The label field, which is optional, simplifies the 
writing of subroutme calls and conditional branch 
instructions, which you can express in IF-THEN- 
ELSE, WHILE-DO, or COMPARE forms. Once 
you've written all the instructions with the editor 
and translated them to executable form with the 
assembler, you can cause the assembler to gener- 
ate a JEDEC fuse map. 

Before physically programming the PROM con- 
trol store that you've designed, you can test your 
design with the help of the test-vector generator 
and simulator. The test-vector generator produces 
test vectors, from a user-generated truth table, 
and converts them into a JEDEC-standard test- 
vector file that the simulator can use. 

The simulator performs two important func- 
tions: It verifies the control flow of your design, 
and it checks the fuse map that the assembler 
generates. By using the simulator's interactive 
mode, you can inspect, modify, or preset any of 
the internal registers in the FPC. The simulator 
also has single-stepping and break-point features 
so that you can trace the operation of your design 
in detail. 
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An interrupt-handlin£i preprocessor can off- 
load some of the initial interrupt-recogni- 
tion tasks of your system's interrupt-han- 
dler module. 



Once the assertion of the DTACK signal indicates the 
successful transfer of the 8-bit status or identification 
byte, the IHP instructs the interrupt-handler module 
to begin the interrupt-service routine. This instruction 
from the IHP is logically defined as follows: 



IF (BBSY*LATCH_STATUS + BBSY* 

START_SERVICE*/SERVICE_DONE)THEN 
START SERVICE : = 1; 



(5) 



This definition states that the IHP constantly asserts 
the START_SERVICE signal until the interrupt han- 
dler generates a SERVICE-DONE signal, at which 
point the IHP drives START_SERVICE low. The logic 
described above generates the timing diagram shown in 
Fig 5. 



Development tools simplify controller design 

Development tools provided by FPC and PLD manu- 
facturers simplify the task of programming these de- 
vices as VME Bus controllers. Once you've analyzed the 
bus protocols and converted these into state-machine 
diagrams, you can write assembly-language programs 
and high-level logic equations to describe the state 
machines. The assembler and logic software will then 
process these programs and equations to fit into the 
FPC or PLD. For a summary of the programming tools 
available for the Am29PL141 FPC, see box, "Develop- 
ment tools help you program FPCs." EDN 
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The first fuse-programmable controller 

eliminates bulky and expensive designs, freeing distributed 

intelligence to carve out a greater niche for itself. 



in much the same way that cars and highways 
spawned the suburbs, standard microprocessor 
buses and add-on boards have distributed pro- 
cessing intelligence, revolutionizing the design of 
digital systems. Breaking systems into independent 
modules shortens the design cycle, eases upgrades, 
and accelerates fault diagnosis. But despite these ad- 
vantages, an essential element has been missing: a 
one-chip controller geared specifically to the needs of 
distributed intelligence. 

Without that critical ingredient, engineers have 
been forced to turn to less than optimal solutions. 
One approach relies on boards packed with as many 
as 35 SSI and MSI devices. Another tack is to go with 
a powerful— yet costly— VLSI chip. Alternatively, a 
programmable logic device can be pressed into ser- 
vice, but such circuitry lacks the computing power to 
control peripherals. 

The missing element, a fuse-programmable con- 
troller chip, is now here. By mixing intelligence and 
control, the Am29PL141 stakes out new territory for 
distributed systems. The 20-MHz IC combines for the 
first time all of the elements of an intelligent micro- 
code controller. Its powerful sequencing logic. steps 
through the controller's 64-by-32-bit pipelined 
PROM. That fuse-programmable memory stores a 
user-defined microprogram drawn from a set of 29 
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microinstructions, including a repertoire of jumps, 
multiple branches (or case statements), and sub- 
routine calls. All can be executed conditionally, de- 
pending upon the outcome of one of eight tests. In 
addition, a serial shadow register on the 28-pin chip 
helps designers diagnose system troubles right down 
to a particular IC. (In the past, expediency often dic- 
tated that complex trouble-shooting be avoided for 
as long as possible.) 

Four basic blocks 

The controller comprises four main functional 
blocks. Three of them— the microaddress control 
logic, condition code selector, and microinstruction 
decoder— form the cornerstone of the controller, the 
address sequencer. The fourth is a microprogram 
memory (64 by 32 bits) with a pipelined register and 
serial shadow register (Fig. 1). 

For the most part, the elements of the address 
sequencer are fairly typical. Nevertheless, the way in 
which they are organized and connected, as well as 
the instruction set, make the chip unique. For ex- 
ample, the microaddress control portion of the 
sequencer contains one register for counting loops 
and another for stacking subroutine return address- 
es. Yet either register can be employed to double the 
capacity of the other. Consequently, the chip can nest 
two levels of loop counting or two levels of subroutine 
branching (the instruction set reflects those abili- 
ties). And the high degree of interaction between ele- 
ments, particularly within the microaddress control 
logic, makes necessary a highly sophisticated micro- 
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instruction set. 

The microaddress control logic is the brain of the 
address sequencer, since it generates the addresses 
that access the microinstructions. At any time, the 
address that is called depends on the preceding in- 
struction and the outcome of any conditional tests. 

Within the control logic, a program-counter multi- 
plexer supplies the PROM's 6-bit address. The multi- 
plexer takes the address from a microprogram 
counter, incremented program counter, branch con- 
trol logic, or subroutine register. Because the pro- 
gram counter contains the address of the currently 
executing instruction, that instruction is executed 
again when the program counter is selected as the 
address source. As a result, the counter plays a fun- 
damental role in tallying loops and executing "wait 
until true" instructions. 

The incrementer holds the next address in the 
sequence, and is the expected source when no jumps 
are executed and no branch or subroutine conditions 
exist. When conditional statements like "if . . . 
then . . . else" and multiple branches pass the re- 
quired tests, or when unconditional jumps are exe- 
cuted, the branch control logic supplies the address. 
Finally, when the program calls a subroutine, the 
subroutine register supplies the necessary address. 

A multiplexer selects one of three address sources. 
If only one stack level is needed, the value stored in 
the subroutine register is chosen. When the count 
register feeds the subroutine register, however, it 
furnishes an additional stack level. The third source, 
the incrementer, supplies the subroutine's return 
addresses. 

Doing double duty 

If not needed for a second subroutine level, the 
count register can, among other functions, execute it- 
erative loops and time external events. To accom- 
plish the former, the controller loads the register 
with the number of iterations to be run. Each iter- 
ation decrements the register until it reaches zero. 
The zero-detection logic associated with the counter 
'informs the chip's microinstruction decoding logic 
when the register "bottoms out." 

Using the same logic, an instruction can be re- 
peated a set number of times. Repeated executions of 
the same instruction is a simple way to insert wait 
states and, therefore, build an interface to different 
microprocessors and peripherals. 

The count register is loaded from any of four 
sources: a decrementer, for normal loops; an instruc- 



tion field; the subroutine register; and the branch- 
control logic. The last derives a 6-bit value from a 
data field in a microinstruction. 

The branch-control logic, a powerful block within 
the sequencer, calculates the 6-bit value either by ap- 
plying the microinstruction data field directly or by 
using it to mask the chip's six test inputs. To through 
T5. In the second case, the masked input actually be- 
comes the branch address. Moreover, either the data 
field or masked test value serves as both a branch ad- 
dress and a count value. 

The same control logic also compares the masked 
test inputs to a constant in a microinstruction field. 
The outcome of this check affects a flip-flop. The lat- 
ter's condition itself becomes a factor in deciding 
conditional branch and subroutine instructions. If a 
match occurs, the flip-flop is set. Alternatively, the 
flip-flop remains unchanged if there is no match. 
Because the flip-flop does not change when there is 
no match, it is particularly useful for comparing 
ASCII characters and other 6-bit fields, as well as for 
successively checking the chip's test inputs. 

Controlling conditions 

A set flip-flop is one of the eight aforementioned 
tests that fulfills a conditional branch or subroutine. 
Through its condition-code selection logic, the con- 
troller is able to check each of its six test input lines, 
as well as the Condition Code input. Further, an 
exclusive-OR gate within the selection logic switches 
the meaning, or interpretation, of a test result. In 
other words, with no external hardware, a test condi- 
tion can be asserted either when a match occurs or 
when one does not. 

The final component of the chip's sequencer sec- 
tion is the microinstruction decoder. That pro- 
grammable logic array generates the IC's internal 
control signals based on the microinstruction being 
executed and the test results reported by the con- 
dition-code logic. 

The IC's fourth functional block comprises the 
fuse-programmable microprogram memory, pipe- 
lined register, and serial shadow register. The pipe- 
line register is 32 bits wide, and stores the micro- 
instruction being run. The next address is calculated 
by the sequencer and its contents is fetched from the 
microprogram memory. The upper 16 bits of the 
pipeline's output remain within the chip to sequence 
addresses and control internal functions. 

Only the 16 low-order bits link to the outside, as 
user-defined control lines. Of these, the upper byte is 
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1. The 20-MHz Am29PL141 is the first complete microprogrammable controller chip, making it an important 
building block for distributed processing systems. Its powerful sequencing logic steps the controller through 
its pipelined PROM. The fuse-programmable memory is 64 by 32 bits. 
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put in the high-impedance state by setting a micro- 
instruction's Output Enable bit to 0. Moreover, chips 
can be cascaded readily if more than 16 control bits 
are needed (Fig. 2). 

The serial shadow register, also 32-bits wide, sim- 
plifies device- and system-level diagnostics. It can be 
loaded in parallel with the contents of the pipelined 
register or loaded serially from the Serial Data Input 
pin. On the other hand, the serial register can also 
load the pipeline or shift data out serially. It also may 
simply hold the data sent to it. 

To check out the chip, an instruction is shifted 
serially into the shadow register and then loaded in 
parallel into the pipeline. Doing so forces the instruc- 
tion to be executed, and its results transferred back 



from the pipeline into the shadow register. From 
there it is shifted out for diagnosis. If all the shadow 
registers in a system are tied together, a series diag- 
nostic loop is created that isolates a problem down to 
a single chip. 

The shadow fuse 

A separate fuse must be blown to set up the serial 
shadow register. When that is done, four pins are 
redefined to handle diagnostics. Specifically, the 
Condition Code and Zero lines and Output Data Bits 
6 and 7 become, respectively, the Serial Data In 
(SDI), Serial Data Out (SDO), Diagnostic Clock 
(DCLK), and Mode control lines. 

The strength of any controller— and the advantage 
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2. When an application calls (or more than 16 control bits, two or more chips can be 
cascaded horizontally. The lower 16 bits of each ol the chip's 32-bit microinstructions 
(of which there are 29) serve to control a system's components. Eight of these 16 con- 
trol bits can be put into a high-impedance state under microinstruction control; the oth- 
er 8 bits are always enabled. 
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of a microprogrammed system that employs it— lies 
in an engineer's ability to specify the sequence in 
which microinstructions are executed. To ensure 
that ability, the controller executes all the basic 
high-level constructs required for structured micro- 
programming. Its 29 op codes include sequential in- 
structions, conditional instructions, dual branching 
forks, and multibranchingcase statements. Iterative 
executions, like For, While, When, and Until, round 
out the set. In addition. Jump, Jump to Subroutine, 
Loop, and Compare instructions allow designers to 
store very complex algorithms in the chip's 64-word 
memory. 

Instruction formats fall into two categories. The 
first is for general microinstructions; the second is 
for the chip's Compare instructions. The latter com- 
pare a 6-bit test input to a masked constant. The 
Compare instructions are well-suited for character 
searches, as well as key searches in a look-up table. 

A single-precision, floating-point peripheral board 



(Fig. 3) presents a good example of the part that the 
controller plays in a distributed system. As a micro- 
programmed design, the peripheral serves as an add- 
on math accelerator card that plays with different 
hosts and buses. The controller orchestrates the ac- 
tions of the floating-point processor, and various reg- 
isters, register files, and memory chips. 

Simple arithmetic 

The processor is simple to use, partly because it in- 
curs no pipeline delays. It conforms to IEEE and oth- 
er industry standards, and takes only a single clock 
cycle to add, subtract, or multiply. It needs five cycles 
to divide, using the Newton-Raphson method that in- 
verts one of the factors and multiplies. In operation, 
to divide X by Y the chip fetches the approximate in- 
verse of Y from a PROM-based table and multiplies 
it by X. One or two iterations of this method increase 
the initial accuracy. 

The floating-point board works with a microword 
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3. In a typical application, ttie controller oversees the workings of a floating-point pro- 
cessor board. Host instructions sent to the FIFO and instruction registers initiate sub- 
routines in the controller that generate the signals that run the board. Two controllers 
are employed to supply the necessary number of control signals. 
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of at least 25 bits, 9 more than available with one con- 
troller. Thus the design employs two controller chips. 
The floating-point processor requires five command 
bits. Three are instructions and two select the input 
source. It also needs three control bits to enable its 
trio of data registers. Two other chips (each a dual- 
access four-port register of 64 words by 18 bits) tem- 
porarily store commands. They accept 12 register ad- 
dress bits (6 for source and 6 for destination) from 
the host or the controller's microprogram memory. 

Data passes to and from the host through input 
and output registers on the board, which call for 
their own enable signals. Another bit is needed to ad- 
vance the FIFO instruction register. One is necessary 
to enable and another to select a status word. (The 
floating-point chip supplies status information, 
which is available to the controller through its test 
inputs as well as to the host through the register file.) 
Seven control bits are left for miscellaneous tasks. 

Operation begins when at least one 16-bit instruc- 
tion is loaded from the host into the peripheral's 
FIFO register. The instruction consists of a 4-bit op 
code, a 6-bit source-register address, and a 6-bit ad- 
dress for the second source register, which also stores 
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4. Loading an instruction into the FIFO starts the pe- 
ripheral and activates the controller, which loads an 
external instruction register and decodes the op 
code field. The op code initiates a subroutine in the 
controller, issuing the proper control signals, ad- 
vancing the FIFO register, and loading the next 
instruction. 



the results. Until an instruction is received, the con- 
troller is in the wait state (Fig. 4) . When an instruc- 
tion arrives, however, the FIFO's Empty signal acti- 
vates the controller, which then reads the command 
from the FIFO into a separate instruction register. 
The controller also loads the instruction's op code in- 
to its test inputs. It then masks the two unused test 
bits and jumps to a subroutine that performs the 
operation specified by the op code. After completing 
it, the controller advances the FIFO register to load 
the next instruction. 

The peripheral executes up to 16 op codes. The first 
eight are single-cycle operations and identical to 
those of the floating-point processor. They consist of 
addition, subtraction, multiplication, and format 
conversion instructions. The remaining op codes are 
used to load, store, and divide data, and a multiple cy- 
cle instruction multiplies and accumulates values. 
The four remaining op codes can be defined by the 
user to implement application-related operations. 

Software and hardware tools are a necessary part 
of such projects as the foregoing peripheral. The soft- 
ware assembles high-level microprograms and a 
JEDEC output file that specifies the fuse pattern to 
be burned into the PROM array. Currently, a pro- 
gram called Fuse Formatter, which runs on the IBM 
PC personal computer, lets designers enter hexadeci- 
mal code that corresponds to PROM data. From that 
code, the program creates a file that is downloaded 
directly to one of several PROM programmers. The 
latter blow the corresponding fuses in the micro- 
program memory and are the only required hard- 
ware tools. D 

Om Ayrawal is the product planning manager for 
programmable logic devices at AMD. He has designed 
16- and 32-bit minicomputers, and is the coauthor of a 
book on high-speed memory systems. Agrawal holds a 
PhD in electrical engineering and computer science 
from Iowa State University. He also received an MBA 
from the University of Santa Clara. 

As a senior product marketing engineer for the 
company's microprocessor division, Deepak Mithani, 
designs and markets bipolar microprocessors. He 
earned a BSE E from India's Maharaja Sayajirao 
University and an MSEE from the University of 
Wisconsin. 
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Advanced Micro Devices Inc., 901 Thonrpson PI., P.O. Box 3453, Sunnyvale, CA 94088; (408) 732-2400. 



Housed in a 20-pin 
DIP, the smallest PAL 
device to date for 
building state ma- 
chines sports registers 
that save I/O pins, yet 
It Is easy to test. 



Designers who build state machines from pro- 
grammable logic devices (PLDs) must develop 
good juggling skills. The job usually takes a careful 
balancing of the I/O pins, the product terms, and 
the registers that store states and output bits. But 
because these resources all are limited in a PLD, 
performance and compactness often have to be sac- 
rificed in favor of the desired function. 

Ideally, state-machine designers need a PLD 
with buried, or internal, state registers, compact 

packaging and high 

speed. Buried registers 
free valuable I/O pins; a 
small package saves 
precious board space; 
and high operating 
speed speaks for itself. 

Approaching the 
ideal is the AmPAL- 
23S8, the first 20-pin 
programmable array 
logic (PAL) device de- 
signed specifically for building state machines and 
sequencers. Its architecture is more flexible than 
any 24- or 28-pin PAL device or programmable 
logic array (PLA). For example, it has 14 edge- 
triggered, D-type flip-flops (Fig. 1). That alone 
compares favorably with 24-pin bipolar PAL de- 
vices, which have up to 10 registers, and with 24- 
and 28-pin PLAs, which have 12 to 14 registers. 
Only 40-pin PAL devices with 16 registers surpass 
the new chip. 

As significant is the fact that of the chip's regis- 
ters, six hold buried state bits. By definition, the 
buried registers do not merely free I/O pins: In- 
deed, they actually enhance the chip's value as a 
state machine. Driven by the chip's AND-OR 
array the buried state registers, along with the eight 
registers dedicated to I/O pins, make possible a 
wide range of states and outputs. 

Moreover, the buried state bits can be easily 
accessed, controlled, and observed. Although the 



first two attributes are common among PAL de- 
vices, the third has always presented problems. The 
23S8 is the first chip whose product-term array 
makes the buried state registers observable. 

Of the other eight registers, four serve in output 
logic macrocells and four store output states. Each 
macrocell contains three main blocks (Fig. 2). One 
block is the register, a rising-edge-triggered flip- 
flop sharing asynchronous reset and synchronous 
preset inputs with all the other flip-flops. The other 
two blocks are multiplexers, one of them selecting 
from one of four output paths, the other from one of 
two feedback paths. 

The logic chip is small, fast, and easy to program. 
Packaged in a 0.3-in. DIP, it is half the width of 
28- and 40-pin PAL and PLA type devices. More- 
over, it comes in 33- and 28.5-MHz versions, with 
propagation delays of 20 and 25 ns, respectively. 
The faster version is an ideal companion for new 
8-MHz, 16-bit microprocessors, as well as for 
16-MHz, 32-bit units. Even at 28.5 MHz, the new 
chip is faster than existing bipolar PLA-based se- 
quencers, which operate from 20 to 25 MHz. It also 
beats equivalent-density PAL devices operating at 
or below 25 MHz. 

Programming takes place by blowing reliable 
and fast platinum silicide fuses, and using available 
software packages. Like all PAL devices, the 23S8's 
fusible AND-OR array is a direct measure of its 
capability. With 6200 fuses, it is three to six times 
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the size of standard 20-pin bipolar PAL devices. In fact, it 
is larger than all other bipolar PLDs except for the 40-pin 
chips, which are three to four times as large and have 
some 8000 fuses. The new chip's fuse array handles all 1 4 
registers, with 6 to 1 2 logic products for each output, and 
up to 23 inputs — 9 of them dedicated, 8 for feedback, and 
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1. The AmPAL23S8 Is the first 20-pln PAL device de- 
signed specifically for building state machines 
and sequencers. It contains 14 registers, 6 of 
them buried and requiring no i/O pins, 4 dedi- 
cated to output macrocells, and 4 storing output 
bits. The states of the burled registers are easily 
observed, which simplifies debugging. 



6 for the buried states. 

The number and distribution of the chip's product 
terms are two other measures of its capability. It has alto- 
gether 1 35 product terms for logic and control functions. 
Moreover, the AND array that drives the 14 flip-flops is 
variably distributed (see the table ). 

The distribution of the 1 24 logic product terms helps 
adapt the chip to system requirements, while optimizing 
its internal resources. (Of the 1 1 other terms, 8 control 
output, the remaining 3 control preset, reset, and observ- 
ability, respectively.) For additional flexibility in de- 
signing powerful state machines, the chip can trade off the 
product terms of its buried registers. 

OBSERVABILITY CLOSE UP 

The chip's observable product term is generated by the 
AND-OR array, along with an ability to preload the 
buried registers. Observability helps check a buried regis- 
ter's state, so that a designer can monitor any register dur- 
ing debugging. 

Under control either of a single pin or of a combination 
of six input signals, the observable product term drives a 
set of inverting buffers. In normal operation, the buffers 
also serve the four dedicated output registers and two of 
the four macrocells. When activated, therefore, the ob- 
servable product term disables signal flow from those reg- 
isters and macrocells and simultaneously connects the six 
buried registers to their respective I/O pins. 

Flexibility in connguring a state machine derives from 
the independent control of output and feedback multi- 
plexers. Each output macrocell has three fuses, two to 
determine the output path and one the feedback path. The 
fuses can configure the macrocells in eight ways: a combi- 
natorial or a registered output with either an I/O pin or a 
registered feedback. Each of those four variations offers 
either active high or active low outputs. 

In the output path, fuse S, determines whether the out- 
put is active high or active low. The output-nature fuse, 
Sj, selects sequential or combinatorial operation. When 
both of those fuses are intact, a flip-flop's Q pin passes 
through an inverting buffer, making the output active low 
and registered. 

On the other hand, programming or blowing S i renders 
the output term active high; blowing S2 causes the output 
to bypass the register, making it combinatorial. A combi- 
natorial output with feedback can, moreover, provide 
more than one level of ORing logic. Finally, fuse S3 con- 
trols the feedback path, sending either the flip-flop's Q 
output or the I/O pin back into the array. 

Controlling each output's enable line through a prod- 
uct term from the AND array creates further options by 
transforming an I/O into a dedicated input, a dedicated 
output, or a dynamically controlled I/O pin. A designer 
can thus adjust the number of input and output pins, and 
have the dynamic I/O capability required by most buses. 
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Each output cnuble product term is further associated 
with an individual polarity fuse. This polarity fuse allows 
designers to modify a control signal to enable outputs us- 
ing De Morgan's theorem, which is especially useful for 
bus control applications. 
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2. Each programmable macroceli has three main 
blocks: a rising-edge-friggered D-typ© flip-flop, 
an output multiplexer, and a feedback multi- 
plexer. The output multiplexer selects from one of 
four output sources; the feedback multiplexer 
feeds either on I/O pin or the flip-flop's Q output 
into the programmable AND-OR array. 



AS o- 

Ai-Aj, CI 



MREQ O- 
RFCLK O- 



RMC Of 

Initialize O 



AmPAL23S8 

for timing 

and arbitration 



0,-0, 
0, 



Ao-A,0- 

SIZo-SIZ,0- 

PTRo-PRT,0- 

R/WO- 



AmPAL23S8 

<i 
V 

-I. 0: 

-Is 



Oo-Oj 



I (or data sizing q 
' and alignment ' 



COIJ^DD 



I 



Clock O- 



3. Two 23S8S lie at the heart of an advanced dynamic RAM controller that 
compares well in its data-transfer features— such as dynamic bus sizing and 
handling misaligned data — with recent 32-bit microprocessors. One of the two 
main PAL devices controls the timing and arbitration of the memory cycles; 
the other sizes and aligns data transfers. 



The chip's high speed and buried state registers make it 
an ideal foundation for a flexible and fast dynamic RAM 
controller that meets the complex requirements of the lat- 
est microprocessors. Even at a 16-MHz processor clock 
rate it can become a controller with few or no wait stales 
that might otherwise have to be built from scratch. 

There is, for example, no ready-made controller that 
can lake advantage of dynamic bus sizing the auto- 
matic adjustment to the amount of data movable in one 
bus cycle. Nor can any off-the-shelf unit handle misal- 
igned data, that is, when an operand falls outside its 
proper memory boundaries. Bus sizing and data align- 
ment are features of the recently arrived Motorola 68020 
and the Intel 80386, both 32-bit microprocessors. But a 
controller able to perform the two functions can be built 
around two 23S8s, one for timing and arbitration logic 
and the other for data sizing and alignment (Fig. 3). 

The timing and arbitration chip arbitrates among 
present, refresh, and CPU cycles; executes read or write, 
read-modify-write, and refresh cycles; and asserts inter- 
face signals. The chip first arbitrates between the proces- 
sor's Memory Request (MREQ) and Refresh Request 
(RFCLK) signals. It gives priority to the cycle currently 
being executed, follows this with the refresh cycle, and 
concludes with the processor cycle. Both intermittent and 
burst-mode refresh schemes are therefore possible. 

The design's timing relationships and arbitration re- 
quirements define a state-machine diagram (Fig. 4). The 
actual state machine 
employs five of the six 
available buried regis- 
ters. The sixth register 
is a flag and so acts as a 
small, independent slate 
machine. Either of two 
software packages 
Cupl from Personal 
CAD Systems inc.'s 
Assisted Technology 
Division (San Jose, 
Calif.) or Abel from 
Data I/O Corp. (Red- 
mond, Wash.) easily 
describes the state oper- 
ations. (Cupl is the lan- 
guage used in this par- 
ticular example.) 

No arbitration takes 
place during the exe- 
cution of a cycle, which 
is indicated by a state 
other than 0. The tim- 
ing and arbitration chip 
stores any refresh re- 
quest and respt)nds to it 



10-bit bus 
interface latcfi 

(Am29841) 
OE 



10-bit bus 
Interface latch 

(Am29841) 
OE 



Bl 



"■= 10-blt 
counter 
(AmPAL22V10) 



To dynamic RAM 
address lines 



-•^RASo-RASi 
-•O CASo-CASj 



-•^WE 

-•^ DSACKo-DSACK, 



U 



MonolUhiG 



m 



Memories 



^ 



2-641 



PAL Device Buries Registers, Brings State Machines to Life 



later, continuing to follow the current cycle through its 
various states. Thus the current state automatically takes 
priority, and no other cycle is started until the current one 
is complete. 

At the end of the cycle, at state 0, the timing and arbi- 
tration chip arbitrates between MREQ (a CPU cycle) 
and RFCLK. In the small independent state machine, 
the buried register flag. Bis, latches RFCLK, which may 
last no longer than one clock cycle. When RFCLK is 
high, Bs toggles to state 1 and stays there until the refresh 
cycle is executed. Then it toggles back to state to be 
ready for the next Rf'CLK. 

During refresh cycles, the timing and arbitration chip 
asserts the Refresh signal (RFSH), which supplies the re- 
freshed row address to the memory address bus. The chip 
also generates the Row and Column Address Enable 
Lines, ROWADD and COLADD, which multiplex ad- 
dresses onto the memory bus. At state 16, the chip re- 
moves ROWADD and asserts RFSH; state 17 asserts the 
Row Address Strobe signal (RAS) and refreshes all 
memory banks before removing RAS and RFSH. Re- 




Note: When signal designations appear 
above and beiow a line, the upper Indicates 
Inputs and the lower, outputs. 



4. A state diagram for ttte timing and arbitration ctiip reflects Its oper- 
ation for selecting and orctiestrating ttie memory cycles. The ctilp 
waits in state to hiondie eittier a processor cycle or a refresti cycle; 
the latter talces priority over any but a cycle currently in progress. The 
chip performs refresh, read, write, and reod-modify-write cycles. 



moving RFSH increments an external 1 0-bit refresh-row- 
address counter (AmPAL22V 10) to the next row. 

If no refresh request is pending during state and 
MREQ is asserted, a processor cycle begins and the ma- 
chine jumps to state 1 . Otherwise the machine stays in 
state and polls for either a refresh or processor cycle. 

The PAL chip responds to the processor's signal KMC 
to execute a read, a write, or a read-modify-write cycle, 
with the internal register? controlling timing for the dif- 
ferent cycles. If, for instance, the memory consists of 
4 Mwords, organized as four equal banks of 32-bit words, 
the chip asserts a RAS signal for each bank — RASo 
through RAS3. 

The strobe signal being asserted depends on address- 
select signals ADSELq and ADSEL,, which in most sys- 
tems are the high bits of the processor's address bus. After 
asserting the proper strobe, the machine jumps to state 2 
to manipulate the address-handling signals. For instance, 
to multiplex addresses, ROWADD and COLADD are 
again asserted. 

In state 2, the timing and arbitration chip removes 
ROWADD and asserts COL- 
ADD, along with SIG, the refer- 
ence that tells the data-sizing and 
alignment chip that a processor cy- 
cle is in progress. SIG syn- 
chronizes the two 23S8s and in- 
structs the data-sizing and 
alignment PAL to send a column- 
address strobe (CAS) to memory. 
The state machine remains in 
state 2 for an extra-cycle, and then 
moves to state 4 to allow time to ac- 
cess the data. There it decides, 
based on processor signal RMC, 
whether to execute a read, a write, 
or a read-modify-write cycle. Be- 
fore completing a read or write ac- 
cess, the timing and arbitration 
chip removes RAS and waits for 
one more cycle to allow for the 
RAS precharge time. Simulta- 
neously, it switches back to ROW- 
ADD from COLADD in prepara- 
tion for the next processor cycle, 
and disables SIG. 

When the RMC signal requests 
a read-modify-write cycle, the 
timing and arbitration chip auto- 
matically lengthens the cycle. It 
holds RAS until DSACK and WE 
signals have been asserted, and 
only then completes the cycle. 

The data-sizing and alignment 
chip works only during a CPU cy- 
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cle. It creates WE for the write and read-modify-write cy- 
cles, and it accepts SIG, which synchronizes the chip's 
different states (Fig. 5) with those of the timing and 
arbitration chip. 

Four of the data-sizing and alignment chip's buried 
state registers generate its state-machine function. The 
machine starts polling in state and stays there until it re- 
ceives SIG, the start of a processor cycle. 

The 68020 has a 32-bit maximum data width and a 
data bus divided into four byte-wide segments, each inde- 
pendently controlled by a CAS signal from the data- 
sizing and alignment chip. The processor can therefore 
read and write byte-wide data over any segment. 

The four CAS signals, CASo to CAS3, strobe the col- 
umn address from the high- to low-order memory bytes. 
The signals also allow the microprocessor to read and 
write across the low-low, low-middle, high-middle, and 
high-high bytes of a 32-bit data port. The data-sizing and 
alignment chip must assert all four CAS signals for a 
32-bit port; only CAS2 and CAS3 are needed to supply a 
16-bit port's low- and high-byte strobes; while for an 8-bit 
port, CAS3 is enough. 




Note: Where signal designations 
appear above and beiow a line. 
the upper indicates inputs and 
the lower, outputs. 



5. The data-sizing and alignment chip works only 
during a processor cycle and waits in state until 
one is requested. It \t\o processor calls (or a read 
or a write cycle, the chip asserts OSACK, CAS, 
and, (or a write cycle, WE. For a read-modity-write 
cycle, it asserts two OSACK signals, one each (or 
the read and write segments. 



The data-sizing and alignment chip asserts the CAS 
signals in state 1 , and selects them according to the size of 
the port in use, the width of the data being moved, and 
how the data aligns with the port. Coding on input pins 
PRTo and PRT, shows the port size; signals SIZq and 
SIZ| from the microprocessor show data width; and the 
microprocessor's least-significant address bits, Aq and 
A|, give the alignment. 

THE PROCESSORS PREFERENCE 

The size of the port dictates the amount of data that can 
be transferred in any one cycle. Based on the PRT signals 
the state machine asserts DSACK signals, which ac- 
knowledge the CPU and tell it how much data it can 
move, since it will always try to move as much as possible. 
The state machine synchronizes the acknowledgment 
timing, asserts the required CAS signals, and holds them 
until the read, write, or read-modify-write cycle is com- 
plete. 

A read cycle's DSACK signal is asserted in state 1 and 
remains until the end of the cycle at state 4. A write cycle 
is identical, except for the additional WE signal. But a 
read-modify-write cycle requires two DSACK signals, 
one for reading, the other for writing. The first DSACK is 
the same as that for a read or a write cycle; the second 
DSACK occurs in state 8 and remains until state 1 1, with 
a delayed WE asserted in state 8 by an internal 4-bit 
counter. D 

Om Agrawal is product planning manager for pro- 
grammable logic devices at Advanced Micro Devices. 
He has designed 16-bit and 32-bit minicomputers and 
is co-author of a book on high-speed memory systems. 
He holds a PhD in electrical engineering and computer 
science from Iowa State University and an MBA from 
the University of Santa Clara. 

Kapil Shankar is a senior product planning engineer 
for programmable logic and memory devices, and has 
designed advanced graphics systems. He has an MS in 
computer and systems engineering and an ME in elec- 
trical power engineering, both from Rensselaer 
Polytechnic Institute. 
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Conquers Timing Restraints 



Bruce Threewitt, Manager, Product Planning 
Advanced Micro Devices, Sunnyvale, CA 



When it comes to generating 
complex, high-resolution dig- 
ital waveforms. Advanced 
Micro Devices' Am2971 picks up 
where older parts leave off. Precise 
time delays once required the use of 
either a hybrid structure consisting 
of an analog delay line combined 
with digital logic or costly counters 
driven by high-frequency clocks. On 
the one hand, analog delay lines have 
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short enough tap-to-tap time delays, 
or resolutions, to be usable for han- 
dling the tighter timing relation- 
ships of dynamic and static RAMs. 
However, those lines cannot also ac- 
commodate the longer total delays 
needed by these devices. 

On the other hand, the desired 
waveforms can be obtained digitally 
from the outputs of one or more 
binary counters. But to achieve reso- 
lutions of 10 ns, counters must be 
driven with 100-MHz clock frequen- 
cies, which are difficult to distribute 
around a board. 

The programmable event genera- 
tor (PEG) a monolithic IC, offers 



a more elegant solution for the prob- 
lem of generating complex high- 
speed timing waveforms (see Fig. 
1). Applications for the PEG (see 
box, "A new PEG in the designer's 
tool box") range from simply cor- 
recting the clock skew that results 
from distributing a clock signal on 
a backplane to generating complex 
state-machine timing. The PEG's 12 
output lines can be programmed by 
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Fig. 1. In response to a logic tran- 
sition on Its trigger (TRIG) input, the 
PEG will generate 12 simultaneous 
user-programmed timing waveforms 
on Its Tt to 111 output lines. 

the user to assume either a logic- 
high or a logic-low level within each 
of 32 time slots or events. The fre- 
quency of these events — that is, the 
frequency of the output waveforms 
— is also programmable. But it is the 
precision and resolution of these 
waveforms that is the PEG's claim 
to fame (see Fig. 2) . 

Internal operation 

The PEG is an edge-triggered 
logic device that generates a pre- 
programmed digital waveform in 
response to a triggering signal. It 
consists of four basic blocks : a next- 
address and event store, a start-ad- 
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dress store, an oscillator and clock- 
control block, and control logic (see 
Fig. 3) . A total of 623 user-program- 
mable platinum-silicide fuses, simi- 
lar to those used in AMD's bipolar 
PROMs and programmable array 
logic devices, are located throughout 
these blocks. 

When the PEG's fuses are being 
programmed, the 12 timing-tap out- 
puts (Tg to Tji) are used as fuse- 
address inputs. A thorough descrip- 
tion of the procedure for program- 
ming the PEG can be found on its 
data sheet, so for the purpose of 
understanding what goes on inside 
the chip, let's assume that it has al- 
ready been programmed. 

An output-timing sequence is ini- 
tiated by a logic transition at the 
TRIG input. Two user-program- 
mable fuses are located in the trig- 
ger-polarity block. One of them is a 
polarity-select fuse, which when un- 



programmed will cause the timing 
sequence to start during a negative 
tran.sition of the TRIG input. If it 
is programmed, the sequence starts 
when a positive transition occurs. 

The second fuse in this block is 
used to define the end of a timing 
sequence. If this fuse is left unpro- 
grammed, the timing sequence is 
stopped on the trailing edge of TRIG 
pulse. Otherwise if the second fuse 
is programmed, the end of the tim- 
ing sequence is defined by the stop 
bits as programmed by the user into 
the next-address/event generator 
functional block. 

Customized waveforms 

A transition at the TRIG input 
latches three address bits. A,, to Aj, 
which are decoded by the start-ad- 
dress store, which serves as an 8 x 
5-bit mapping PROM. Each of its 
eight addresses represents a differ- 



ent starting point for the 12 output 
waveforms. For example, three dif- 
ferent addresses in this store could 
be used to initialize output wave- 
forms that implement the read-, 
write-, and page-mode timing se- 
quences of a dynamic RAM. 

The five bits of each address are 
used to select one of thirty-two 18- 
bit locations from the next-address 
and event store (also a PROM). 
Each 18-bit string contains three 
groups of information. The first 5 
bits define the next address of the 
desired timing sequence. The next 
12 bits define the logic levels on each 
of the timing-output lines (Tq to 
Tj 1 ) . The 18th bit is a sequence-stop 
bit. As the timing sequence pro- 
gresses from the first to the thirty- 
second event-store address, the out- 
put lines will produce 12 independent 
waveforms. 

The assumption so far is that the 
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Fig. 2. Tap-to-tap resolution of an 
analog delay line (a) is not indepen- 
dent of its total delay time. However, 
the PEG'S 12 programmable output 
waveforms (b) are independent of 
one another. Resolutions down to 
10 ns can be easily obtained. 
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f\g. 3. The PEG consists of tour basic blocks: a next-address and event store, 
a start-address store, an oscillator and clock-control block, and control logic. 



stop-function fuse has not been 
programmed. If it has, the timing 
sequence will no longer stop on the 
trailing edge of the TRIG pulse. The 
end of a timing sequence will instead 
be defined by the stop bit. When a 
timing sequence encounters a stop 
bit, the entire sequence comes to a 
halt, and the 12 timing-output lines 
remain at their current logic level. 

Each of the timing-waveform out- 
puts has a minimum useful cycle 
time of 40 ns (or 20 ns per change) . 
When the PEG is operating at its 
maximum internal-clock frequency 
(fe = 100 MHz), the outputs must 
be programmed to remain un- 
changed for at least two clock pe- 
riods for each change of logic level. 
That is, although an output can only 
change a minimum of every 20 ns, 
the timing resolution between events 
among taps may be as low as 10 ns. 
When the PEG's internal clock is 
programmed to operate at 50 MHz or 
slower, the timing waveform can be 
programmed to change once each 
clock period. And its clock-frequency 
range is low enough — from 7 to 20 
MHz — for the user to drive the 
PEG'S CLK input from the system 
clock via an internal clock input pin. 

Alternatively, the designer may 
opt for crystal control by connecting 
a 7 to 20-MHz crystal to the X, and 
Xj inputs of the PEG. And thanks 
to the five fuses within the oscillator 
and clock-control block, the user can 
also set the frequency of the output 



waveforms. Four of these fuses are 
used to select f^. 

When an external crystal is used, 
the PEG'S internal phase-locked loop 
(PLL) will treat the crystal's fre- 
quency as an input-reference fre- 
quency, f,, and will multiply it by a 
programmable value (1, 5, 10, %, %, 
or ^%) to obtain the desired value of 
fe (see Fig. 4). Also an output-clock 
frequency, f^, is available on the 
CLKOUT pin. This output clock— 
whose frequency is either % or Mo 
of the internal-clock frequency — 
may be used to synchronize the out- 

A new PEG in the 
designer's too! box 

Designers no longer have to grap- 
ple with low-resolution analog de- 
lay lines or costly timing logic 
when high-speed waveforms are 
needed. The Am2971 program- 
mable event generator, trade- 
marked PEG, from Advanced Mi- 
cro Devices has 12 output lines 
that can produce and accurately 
place logic transitions as short as 
20 ns, with a minimum waveform- 
to-waveform resolution of 10 ns. 



puts of the PEG with the remainder 
of the system. And speaking of sys- 
tems, the PEG comes in very handy 
for generating the specialty cycles 
offered by today's DRAMs. 

DRAM-timing appilcation 

Typically, the system designer 
must use more than one analog de- 
lay line and some exte rnal logic to 
produce the multiple CAS pulses 
needed for the page-mode cycles of 
some DRAMs. Timing designs using 
high-speed clocks and counters are 
equally cumbersome in this case. 
Once again, the PEG chip offers a 
streamlined solution because i t can 
readily generate the multiple CAS 
pulses. 

Another job for the PEG comes 
about as a result of the use of multi- 
plexed address lines on high-density 
DRAM boards. Because the many 
address lines present a relatively 
large capacitive load to the multi- 
plexer, significant time delays are 
introduced in the address-timing 
waveforms. 

Typically, a signal provided by the 
system switches one or more ad- 
dress-multiplexer ICs. This signal 
must be accurately timed to change 
state after time interval 1 in Fi(i. 5. 
The performance of the individual 
DRAM — and ultimately of the entire 
memory system in which it resides — 
depends on reducing interval 2 as 
much as possible. Interval 1 is 15 to 
20 ns in most standard DRAM sjie- 
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Fig. 4. The PEG can be clocked by 
an external crystal (a) or by the sys- 
tem clock. On-chip programmable 
clock-control logic (b) enables the 
user to set the internal-clock fre- 
quency, fc. 

cifications. Thus, a resolution of 10 
to 15 ns is desirable for the DRAM- 
timing waveforms — an easy job for 
the PEG. 

What's more, the PEG can be used 
to generate the necessary timing 
waveforms for multi-array mem- 
ories too. In video-DRAM systems, 
for example, the rising edge of the 
transfer-enable signal, TRG, must 
be accurately placed relative to the 
rising edge of the serial-port clock 
signal, SC. When the PEG is used to 
generate both waveforms, the de- 
signer has full control over the reso- 




luti on be tween the rising edges of 
the TRG and SC signals. 

More events and/or channels 

In some applications, 12 output 
channels are sufficient but more than 
32 event states are necessary. These 
additional states can be obtained by 
connecting two PEGs as shown in 
Fig. 6. After PEG 1 has cycled 
through its 32 states, its Tq line 
triggers PEG 2. The second PEG 
will then begin its timing sequence 
at the start address specified by PEG 
1. If necessary, the clock frequency 
of PEG 2 may be changed by using 
the clock output of PEG 1 as the 
clock input to PEG 2. Otherwise, the 
internal clocks and TRIG polarities 
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Fig. 5. In this DRAM delayed-write cycle, interval 1 must be kept to within 10 
or 15 ns, and interval 2 must be as short as possible. When mounted o n the 
same DRAM memory board, the PEG could be used to generate the RAS, 
CAS, and WE signals. 
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Fig. 6. By connecting two PEGs, de- 
signers can obtain timing waveforms 
that consist of more than 32 events. 

of the two PEG chips are indepen- 
dently programmable. 

If the waveforms must be nested, 
one of the output taps of PEG 2 can 
be used to remove the TRIG signal 
from PEG 1. PEGs can also be par- 
alleled for applications in which 32 
event states are sufficient but more 
than 12 output channels are needed. 
For example, when 12 to 24 channels 
are needed, the designer can simply 
drive two PEG chips with the same 
input signals and use as many of the 
output lines as necessary. D 



Monollihio 



m 



Memories 



2-647 



Wait-State Remover Improves 
System Performance 



The trends toward faster micropro- 
cessors and denser dynamic ran- 
dom-access memories have 
spariced a conflict. New-generation mi- 
croprocessors require machine cycle 
times of 150 to 250 ns (Table 1). But 
traditionally as DRAM density increases, 
cycle times slow. Needed is a way to 
combine high performance with zero- 
wait-state operation. 

Although hardware-intensive cache 
schemes can work, they are very costly. 
It is possible, however, to improve sys- 
tem performance with little additional 
hardware by substituting enhanced- 
page- and static-column-mode cycles for 
regular memory cycles, thus cutting 
down the number of wait states. The 
resulting system would be especially 
useful for small systems and subsystems 
with space restrictions. 

Our design incorporates a DRAM such 
as the Am90C256, a CMOS device that 
offers uninterrupted serial or random ac- 
cess to 512 bits of data, or the 
Am90C257, which combines dynamic 
storage and static decode to nearly dou- 
ble sequential access rates. These de- 
vices provide extended-page and static- 
column access modes at a small increase 
in cost. These modes permit data access 
times of 40 to 60 ns, enabling the mem- 
ory controller to remove wait states for 
several processor cycles. 

The only additional hardware is an 
AmPAL22V10, a 24-pin programmable 
logic device that offers an AND/OR logic 
structure for custom programming, 22 
inputs, and 10 outputs with 12 product 
terms each to provide timing and arbitra- 
tion for extended-page-mode and refresh 
cycles. Also needed are two devices to 
latch and compare row addresses and a 
counter to track the extended-page- 
mode cycles' time out. 

Usually, microprocessors perform a 
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FIGURE 1. Wait state removei arbitration scheme. 



number of continuous accesses in a lo- 
calized address range to fill up the in- 
struction pipeline or to store or retrieve 
data. Our scheme is based upon this 
locality of memory access, somewhat 
similar to cache schemes. It involves 
performing an extended-page-mode (or 
a static-column/ripple-mode) access cy- 
cle, instead of a regular memory cycle. 
The time required for the data to be read 
or written for this cycle is the same as for 
a regular memory cycle. The cycle's 
row address is stored and then compared 
to that of the subsequent cycle. This row 
address would be the same for a local- 



ized access cycle within the same row of 
up to 5 1 2 (for a 256K) or 1 054 (for a 1 M 
DRAM) column addresses. 

If the row address is the same, the 
system performs a fast-access, en- 
hanced-page-mode cycle that requires 
only the column address and no proces- 
sor wait state. If the addresses are differ- 
ent, the current extended-page-mode cy- 
cle is completed, and a fresh cycle is 
performed at the new row address. Com- 
pleting the previous cycle and initiating 
a new one adds 20 to 40 ns to the normal 
cycle, depending on the speed of the 
DRAM controller. Usually, one wait- 
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state processor cycle provides additional 
time of 50 to 125 ns, enough to squeeze 
in this overhead and not cause further 
wait states. 

Because row addresses can be com- 
pared during the same time interval as 
the address decoding and generation of a 
memory request or select, this action 
does not increase cycle time. The result 
is a regular cycle speed with one (or two) 
wait states for random memory accesses 
at different row addresses and fast, no- 
wait-state cycles for localized accesses 
at the same row address. Depending on 
the program, execution speed can in- 
crease 5% to 20%. 

The AmPAL22V10also acts as a tim- 
er and arbiter for the extended-page- 
mode and refresh cycles (Figure 1 and 
Table 2). Its refresh-cycle counter runs 
at 16 |xs. But the DRAM controller does 
not perform a refresh every time it re- 
ceives a request: because the page-mode 
cycle can be extended up to 75 jis to 
allow the most CAS-only fast cycles, 
refreshes at l6-\is intervals would dis- 
rupt operation. Instead, the arbitration 
scheme is designed so that, during an 
extended-page-mode cycle, the memory 
controller counts up to four refresh re- 
quests before performing the refresh 
cycles. 

Description oi Circuit 
And Boolean Logic 

For the sake of clarity, the circuit 
diagram shows a general-purpose arbi- 
tration interface, not a complete DRAM 
controller. It can be either integral to a 
PAL-based DRAM controller or linked to 
a VLSI DRAM controller, depending on 
the user's needs. 

For 256K DRAMs, an Am29843 with 
9-bit latches stores a 9-bit row address. 
The arbiter PAL controls the latch-enable 
signal, LE. A 9-bit Am29809 compares 
the stored row address with that of the 
current cycle directly from the processor 
on address lines A9-A17. If the address- 
es match, the Am29809 sends a MATCH 
signal to the arbiter PAL. The arbiter PAL 
also r eceives the memory-request 
(MREQ) signal from the address-decod- 
ing circuitry (not shown here). 

Of the AmPAL22V10's two counters, 
the refresh counter is free running and 
generates refresh requests (RFCLK) at 



Processor 


Cycle time (ns) 


wait 


1 wait 


2 waits 


8088/8086 (8 MHz) 


500 


625 


750 


8088/8086 (10 MHz) 


400 


500 


600 


80286 (8 MHz) 


250 


375 


500 


80286 (10MHz) 


200 


300 


400 


80386 (8 MHz) 


188 


250 


313 


68020 (16 MHz) 


188 


250 


313 


68020 (20MHz) 


150 


200 


250 



TABLE 1 . Microprocessor cycle times. 



INPUTS 

MREQ 

MATCH 

TOUT 

RFCLK 

CACK 






OUTPUTS 

CYCFUG 

SLOG 

FSTC 

RFSC 

RCTO 

RCT1 




/CYC FLAG: = 


/MREQ * CYCFLAG * RCT1 * RCTO 

+ /CYCFLAG * /TOUT * (RCT1 + RCTO) 

* (MREQ + MATCH) 


; Assert for RAW 
; Unassert based 
; on 3 conditions 


/SLOC: 


/MREQ * CYCFLAG * SLOC * FSTC 

* RFSC • RCT1 * RCTO 

+ /MREQ * /CYCFLAG * /MATCH * SLOC 

* FSTC * RFSC * RCT1 * RCTO 

+ /SLOC * CACK 


; Start extended 
; cycle 

; Row address not 
; matched new 
; extended cycle 
; Hold till complete 


/FSTC: 


/MERQ * /CYCFLAG * MATCH * SLOC 


; Match CAS only 
; fast cycle 


/RCT1 : 


(RCT1 • /RCTO + /RCT1 * RCTO) 

• RFCLK 

+ CT * RFSC 

* (/RCT1 * /RCTO + RCT1 * RCTO) 
+ /RFCLK * (/CT + /RFSC ' /RCT1 


; Count up 
; Count down 
; Hold count 


/RCTO: 


RFCLK • RCTO 

+ CT • RFSC * RCTO 

+ /RFCLK * (/CT + /RFSC) * /RCTO 


; Count up 
; Count down 
; Hold count 


CT: 


/RFSC 





TABLE 2. Listing of equations for arbiter. 



fixed 16-jJLS intervals. The counter for 
the extended-page-mode cycle is con- 
trolled by the arbiter PAL, which starts 
the counter by sending a signal (CYC- 
FLAG). On completion of the 75-|jls tim- 
ing count, the counter sends a time-out 
signal, TOUT, to the arbiter, forcing it to 
complete the current cycle and remove 
the CYCFLAG signal. If the arbiter re- 
quires a new extended-page-mode cycle 



at a different row address before this 
timeout, it resets the counter by remov- 
ing the CYCFLAG signal. So the counter 
starts when CYCFLAG is high and stops 
or lesets when CYCFLAG is low. 

The arbiter's operation is quite sim- 
ple. It uses registers SLOC, FSTC, and 
RFSC as flags to indicate a cycle request 
to the memory controller. SLOC requests 
the initiation of an extended-page-mode 
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cycle. This new cycle continues, even 
ul'ter completion of a data transfer, as 
long as CYCf-'LAG is high. FSTC requests 
the initiation of a fast CAS-only cycle at 
the same row address at which an ex- 
tended-page-mode cycle is in progress, 
meaning that CYCFLAG must be high for 
the FSTC signal to be asserted. 

The third RFSC requests the initiation 
of a refresh cycle. All cycle requests for 
new slow extended-page-niodc cycles, 
fast CAS-only extcndcd-page-mode cy- 
cles, and refresh cy cles are arbitrated on 
the basis of inputs MRFQ and MATCH, 
and status feedbacks of CYCFLAG and 
(RCTO, RCTl ), a counter to keep track of 
number of refresh requests. A new cycle 
request is arbitrated only when no cur- 
rent cycle request is active, indicated by 
SLOC. FSTC, and RFSC being low. 

When CYCFLAG is low, the DRAM 
controller should complete the existing 
extended-page-mode cycle (usually by 
deactivating both RAS and CAS signals to 
the memory). As shown in the Boolean 
equations, CYCFLAG goes low if at least 
one of the following three conditions is 
encountered: 

1 . Hxtended-page-mode cycle timeout 
(TOUT) high. 

2. Refresh count up to four (RCTO= I 
and RCTl = 1). 

3. New cycle at a different row ad- 
dress MR[-Q and MATCH. 

This allows arbitration to begin afresh 
for a new cycle. Because CYCFLAG is 
generated for each new extendcd-page- 
modc cycle, it is also used as a latch- 
enable signal for the 9-bit row-address 
latch. 

The counter made by register RCTO 
and RCTl remembers refresh requests 
made during an extended-page-mode 
cycle. It counts up by one upon receiv- 
ing each RFCLK, to the maximum of 
four; it counts down by one when a 
refresh cycle is executed. An extra regis- 
ter, CT. monitors refresh-cycle execu- 
tion by detecting the rising edge of the 
refresh-cycle request RFSC. CT and RFSC 
low indicate the refresh cycle's comple- 
tion, and this is used to decrement 
counter RCTO. RCTl. The refresh cycles 
are repeated, and no memory cycles are 
allowed, until this counter is reset to 
zero. n 
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PLDs Implement Encoder/ Decoder 
for Disk Drives 



By tisin0 sofhvare to define programmable 
logic devices as run-length -limited encode! 
decode systems, you can design disk-drive 
systems that have 50% more data-storage 
capacity than drives that implement the 
MFM code. 



Arthur Khu and Rudy Sterner, 
Advanced Micro Devices Inc 

When you're designing a disli-drive system, you can 
implement run-length-hmited (RLL) 2,7 encoding/de- 
coding circuitry in your design by using only three 
programmable-logic devices (PLDs) and two shift reg- 
isters. You design the encoder and decoder as state 
machines and use the timing diagrams to determine 
what the timing and control signals must be. 

To create a disk controller with encoding/decoding 
features, you can use three AmPAL22V10 PLDs and a 
disk controller such as the Am9580/Am9582 chip set 
(Fig 1). The Am9580 hard-disk controller and the 
Am9582 disk-data separator perform all the general 
disk-control functions. The PLDs have appropriate 
architectures for implementing the encoding and decod- 
ing state machines. Further, you can reprogram the 
PLDs to implement higher density ratios for data 
encoding, and you can increase your system's speed 
simply by using faster PLDs. 

An RLL code is a code in which the number of zeros 
between ones — the run length — is definite. The "2,7" 



designation means that the code for the binary data 
string has at least two and at most seven zeros separat- 
ing the ones. RLL codes increase the density of data 
stored on a disk by reducing the number of recorded 
pulses (ones) necessary to represent a given amount of 
data. This reduction allows the disk-drive circuitry to 
pack the ones closer together, increasing the amount of 
data on the disk. 

In comparison with the (de facto) industry-standard 
approach, MFM, the RLL 2,7 code increases by 50% 
the amount of data you can store on a disk drive (see 
box, "RLL 2,7 code vs MFM code"). In addition, RLL 
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Fig 1 — A complete disk controller requires only two VLSI ICs, a 
PLD-based encoding/decoding system, and a drive interface. The 
hard-disk controller and the disk-data separator provide generic 
disk-drive control, and the PLDs provide RLL 2,7 encoding and 
decoding, which increases disk storage capacity. 



Reprinted by permission of EDN, September 18, 1986. 



z\ 



Monolithic 



,oM 



Memories 



2-651 



PLDs Implement Encoder/ Decoder for Disk Drives 



TABLE 1 — RLL 2,7 CODING RULES 



DATA 


2,7 CODE 


10 


1000 


11 


0100 


000 


100100 


010 


001000 


011 


000100 


0010 


00001000 


0011 


00100100 



2,7 decoding circuitry recovers quickly from code-de- 
tection errors. 

As Table 1 shows, RLL 2,7 encoding circuitry trans- 
lates seven data strings into 2,7 code. You can break 
any binary non-return-to-zero (NRZ) data string into 
combinations of the seven data strings. To obtain the 



decoded data string, the circuitry matches the 2,7 code 
patterns with the seven 2,7 code strings in Table 1. 

Because 2,7 code strings have variable lengths (they 
can be 2, 3, or 4 bits long), your design will need control 
logic that controls the output from the encoder/decoder 
as translation takes place. Encoding and decoding state 
machines (Figs 2 and 3) implement this control logic 
from the code in Table 1 (see box, "Convert RLL 2,7 
code to a state machine"). The encoding state machine 
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Fig 2 — The encoder state machine (a) dencrihes the tmnslatioii of input data into RLL 2,7 code. The first two cycles in the encoding of the 
data string 0010! 100 (b) demonstrate hoic the encoder determines the correct code by examining both the bit to be encoded and the next two 
(look--ahead) bits. 
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produces two 2,7 code bits for each data bit received. 
The decoding state machine, on the other hand, pro- 
duces one data bit for every two bits of 2,7 code. The 
clocking scheme in these encoding/decoding state ma- 
chines is simpler than the familiar table-look-up meth- 



During decoding, circuitry matches the 
RLL 2,7 code patterns with the seven RLL 
2,7 code strin£is to obtain the corresponding 
decoded data strinjj. 



od, which requires suspended operation during encod- 
ing and decoding. 

You can implement both of these encoding/decoding 
2,7 state machines with one PLD device (ICi) and two 
shift registers (INREG and OUTREG) (Fig 4). To 
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Fig 3 — This decoder state machine (a) represents the translation of RLL 2, 7 code into an output data stream. To determine the correct output 
data, the decoder examines four code bits— two bits to be decoded and two look-ahead bits, as shown in the example in b. 
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Because the RLL 2,7 code contains tivo bits 
for every one bit of data, the timing cir- 
cuitry divides the clock signal for the coded 
data, producing a data clock. 



synchronize the encoder/decoder with the hard-disk 
controller, you use two other PLDs (IC2 and IC3) to 
provide clock-generation and address-mark-control 
logic. ICi is a AnnPAL22V10, which has sufficient 
capacity to implement the two state machines. IC2, also 
an AmPAL22V10, utilizes the PLD's large capacity and 
programmable output cells to implement the address- 
mark-control circuitry. An AmPAL16HD8 (IC.i), which 
is sufficient for implementing the clock circuitry and the 
random logic, completes the design. 

To understand the state-machine implementation of 
the RLL 2,7 encoder, consider the state machine in Fig 



2. The encoding state machine begins in state zero; the 
first four bits of the data to be encoded are in the shift 
register INREG. For the data stream in the figure, the 
encoder, beginning in the first cycle (To) reads the first 
bit in the stream as and sees that the next two bits 
(the look-ahead bits) in INREG are and 1, respective- 
ly. According to the state diagram, the encoder pro- 
duces a 00 output because, as Table 1 shows, input data 
starting with 001 translates to a character string that 
starts with 00. The encoder then enters state 2, shifts 
the encoded out of INREG, and shifts in the next 
(fifth) bit of the data to be encoded. 



RLL 2,7 code vs MFM code 

Although the RLL 2,7 and MFM 
coding methods can both in- 
crease a disk drive's capacity, 
RLL 2,7 code is more compact. 
A disk drive that implements 
the RLL 2,7 code can, therefore, 
store 50% more data than can a 
drive that implements the MFM 
code. 

On the magnetic medium in 
the disk drive (hard-disk or flop- 
py-disk drives), binary data ap- 
pears as a change in flux (repre- 
senting a one) or as no change in 
flux (representing a zero). Be- 
cause the disk density is limited 
by the minimum distance be- 
tween flux transitions, the maxi- 
mum data density depends on 
how the data is encoded. 

MFM is an RLL code with the 
designation 1,3;1,2;1. RLL 2,7 
code (its full designation is 
2,7;1,2;3) allows a minimum of 
two zeros between each one, and 
MFM code allows a minimum of 
one zero. RLL 2,7 code can 
store as much as 50% more data 
in a given number of flux 
changes than can MFM code; 
therefore, RLL 2,7 code can 
store as much as 50% more data 
on a given section of magnetic 
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Fig A — Data storoge that's 50% more dense is the principal advantage of RLL 2,7 code 
over the de facto standard MFM code. Because it needs fewer transitions to describe 
data, the RLL 2,7 code takes up only 67% of the disk storage space that MFM code 
occupies. 



medium. 

Consider the example in Fig 
A. The data stream 1100110011 
is represented by eight transi- 
tions in MFM code, and by five 
transitions in RLL 2,7 code. 
When the disk drive uses the 
minimum distance between tran- 
sitions to record the RLL 2,7 



code, the RLL 2,7 code takes 
67% of the space that the MFM 
code takes, so it has space avail- 
able to store 50% more data. In 
most applications, the actual 
storage increase is between 35% 
and 40%, because some disk 
space is reserved for sector and 
data-field markers. 
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In the second cycle (Ti) of the encoding process, the 
encoder sees that the data bit is and the two look- 
ahead bits are 1 and 0, respectively. According to the 
state diagram, when the encoder is in state 2 and sees 
010, its output is 00. As before, the encoder then moves 
to the next state (in this case, state 0) and shifts a new 
bit into INREG. The rest of the encoding process 
proceeds similarly. 

The decoding process is similar to the encoding 
process. The decoder, which is described as a state 
machine (Fig 3), accepts two input bits in RLL 2,7 
format and sees the next two coded bits as look-ahead 
bits. In contrast to the encoder, the decoder produces 
one output bit for every two input bits. 

To implement this encoder and decoder circuitry with 



PLDs, you can use PLD-design tools such as CUPL 
from Personal CAD Systems Inc (San Jose, CA) and 
Abel from Data I/O Corp (Redmond, WA). These 
software tools include syntaxes that you can use to 
describe state machines as well as general logic equa- 
tions. 

To control the rate at which ICi (in Fig 4) receives 
data and code, IC2 and IC:j implement the timing signals 
shown in Fig 5. Three signals (Read, Write, and 
Code_Clock) from the hard-disk controller and disk- 
data separator form the basis of the timing signals. 

First, the timing circuitry produces a clock signal, 
Rd_Clk, from the Code_Clock signal that originates at 
the disk-data separator's FDDAM output. Because the 
RLL 2,7 code contains two bits for every one bit of 
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Before the disk-drive system can decode data 
on the disk, it must synchronize itself with 
the disk's data clock to find out when the 
data bits begin. 



data, the timing circuitry divides the clock signal for 
the coded data (Code_Clock), producing a data clock 
signal, Rd_Clk. The timing circuitry then uses Code- 
Clock and Rd_Clk to control the timing of shift- 
register control signals Shift—Out and Shift_In. 

To obtain the equations you need to program the 
PLDs, examine the timing diagrams. The timing dia- 
grams show that the state-machine design requires 
only a few liming signals to implement the controller. 



Because the timing circuitry loads and shifts data 
produced by ICi at the points indicated on the timing 
diagram in Fig 5, the only clock signal that ICi requires 
to code or decode data in INREG is the Rd_Clk signal. 
To control the transfer of data and code, the Shift_In 
signal latches data or code into INREG, and the 
Shift-Out signal controls the output of OUTREG. For 
example, when encoding data, the encoder produces 
two bits of code on each rising edge of Rd-Clk. Because 



Convert RLL 2,7 code to a state machine 



You can use a simple algorithmic 
procedure to convert a code 
table to an encoding state ma- 
chine. For each row of the code 
table, you create a simple 2- 
state expression for the conver- 
sion of the data into the code. 
Then you combine the expres- 
sions into one state machine for 
the entire table. 

For the code in Table 1 of the 
accompanying article, you con- 
sider each data bit and the asso- 
ciated pair of bits of RLL 2,7 
code. For example, the first row 
of the table contains two data 
bits: one, which is associated 
with the code 10, and zero, 
which is associated with the 
code 00. The state machine for 
this row begins in state zero 
(which is arbitrarily assigned) 
and changes state when it sees 
that the first data bit is a one 
and the next bit (the look-ahead 
bit) is a zero. This change of 
state, which appears graphically 
in Fig Aa, results in the out- 
put 10. 

Note that the state machine 
must evaluate the look-ahead 
bit. If this bit is a one instead of 
a zero, the input data will be 11, 
corresponding to the second row 
of the table, so your state ma- 
chine must generate a 01. For 




1^1) \ 0OR1 



RLL 2.7 CODE 



(b) DATA ROW 10 IS 
COMPLETELY COOED 
BY THIS STATE 
MACHINE 




(d) COMBINE THE STATE 
MACHINES FOR DATA 
ROWS 10 AND 11 



Fig A — Creating a state machine from a code table is a 2-part process. First, you 
create a simple state machine for each row in the table, and then you combine the state 
Tnachines into one state machine for the whole table. 



this code table, no more than 
two look-ahead bits are neces- 
sary for encoding any data bit. 

If the state machine is in state 
one, and the input data is zero, 
the state machine produces a 00 
output and returns to state zero 
(Fig Ab.) Note that the data bit 
encoded in this state is the look- 
ahead bit from the previous 
state and that no look-ahead bit 
is necessary for encoding be- 
cause the zero is the last bit in 
data row 10. 



You use this procedure to cre- 
ate state machines for the other 
table rows as well. Fig Ac, for 
example, contains the state ma- 
chine for data row 11. Because 
all the state machines have the 
same beginning state (state 
zero), you can combine them to 
form the complete encoding 
state machine. Furthermore, the 
state machines may share other 
states, as shown in the combina- 
tion of the two data rows 
(Fig Ad). 
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the Load sij,nial is high at the same time that R(LClk is 
high, the bits are loaded into the shift register. Because 
the frecjuency of Shift_Out is twice that of RcLCii<, 
Shift_Out shifts both encoded bits out before the next 
encoded bits are loaded. Shift_In jjresents the next 
(lata bit to the encoder at the same time that the second 
encoded bit is shifted out, starting the next encode 
cycle. 



Before the disk-drive system can decode data on the 
disk, it must synchronize itself with the data clock from 
the disk and find out when the data bits begin. To assist 
the circuitry in synchronization and initialization, you 
can place synchronization signals, as well as a marker 
indicating the beginning of data, at the beginning of the 
RLL code. When the RLL circuitry reads these pat- 
terns, it's ready to decode RLL data. 
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LOGIC EQUATIONS: 

RD. CLK .= RD.CLK (A REGISTERED SIGNAL FROM ICj) 
SHIFT. IN = REAO-'CODE. CLOCK *WRITE'RD. CLK 
SHIFT. OUT = READ- RD.CLK* WRITE'CODE_ CLOCK 
LOAD = READ*WRITE'RD.CLK 

CODE . CLOCK : CODE-RATE CLOCK SIGNAL FROM FDDAM PIN 
ON Am9582 DISK-DATA SEPARATOR 
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Fig 5 — The timing signals from IC2 and IC3 control the loading and shifting of code and data in the INREG and Ol'TREG irgislcrs. Xotc 
that the cluck and shifting signals for the coded bits have twice the frequency of those for the data bits, a situation that corrcsfmnds to the J: I 
density ratio between code and data. 
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If the decoder detects any pattern before it 
detects the address marker, the circuit resets 
itself and begins the initialization sequence 
anew. 
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Fig 6 — The ability to recover from an error is a useful feature of 
RLL 2,7 code. An error at cycle Ei results in incorrectly translated 
data until the machine recovers fully. 

The synchronization field comprises a series of pat- 
terns that allow the phase-locked loop in the disk-data 
separator to synchronize to the frequency of the incom- 
ing data. These patterns represent the maximum fre- 
quency input. For RLL 2,7 code, that input is 100100, 
because this code has the fewest permissible zeros 
between ones. Using the highest possible frequencies 
minimizes the synchronization time, so the patterns are 
completed quickly. 

A marker that follows the synchronization field indi- 
cates the beginning of data. This marker, the address 
mark, must be distinct from all other code words. For 
example, you can use a pattern that violates the code 
rules, such as a string of zeros (this string is called "dc 
erase" because it removes all flux transitions from the 
recording medium). Alternatively, you can use a pat- 
tern that obeys the code rules but is not a defined code 
pattern. In the examples in this article, the pattern 
00000100 identifies the beginning of data. Although this 
pattern is not a defined code word, it doesn't violate the 
RLL 2,7 code rules as long as it's preceded by no more 
than two zeros in a row. 

When the 100100 pattern is synchronizing the circuit- 
ry, the circuitry produces the Rd_Clk signal, which is 
in phase with the pattern. In order for the decoder to 
operate properly, the Rd_Clk signal must rise with the 
first one in the pattern and fall with the second one. 
However, because of the repetitive nature of the pat- 
tern, the circuitry could produce a falling edge of 
Rd_Clk on the first one in the pattern and a rising edge 
on the second one, in which case the circuitry would not 



be in synchronization with the beginning of the data. 

The encoding circuitry resolves this synchronization 
problem by inserting the pattern "0100" eight times 
between the synchronization field and the address 
mark. The phase-locked-loop system locks onto the 
0100 pattern, and the Rd_Clk control circuitry in IC2 
sets itself to decode the data correctly. 

To put the synchronization and marker data into the 
RLL 2,7 code on the disk, you must design the RLL 2,7 
encoder to produce the signals. When the signal WG 
(from the hard-disk controller IC3) goes high, the data is 
encoded to all zeros. The encoder translates the zeros 
as the synchronization pattern 100100, which is then 
stored on disk. 

Next, the address-mark-control (AMC) signal from 
the hard-disk controller sets an internal latch in IC2, 
producing the output Am_Latch, which forces IC3 to 
put ones in the INREG register. The encoding circuitry 
codes the ones as the string 0100. After the eighth 0100 
pattern, IC2 sets the Complt signal high. On sensing 
Complt, ICi writes the address mark. 

The encoding circuitry must have the first four data 
bits in INREG by the time the address mark is written. 
ICi writes two patterns for the address mark. At the 
start of the second address-mark pattern, ICi sets AMF 
(Address Mark Found) High and resets Am_Latch low. 
The assertion of AMF signals the hard-disk controller 
to begin shifting data into INREG for encoding. Be- 
cause four Rd_Clk cycles occur while the address-mark 
pattern is being written, the first four bits are shifted 
into INREG by the time the second address mark is 
written. 

The decoder circuit has two safeguards that prevent 
it from identifying the address-mark pattern incorrect- 
ly. First, the decoder must detect at least five 0100 
patterns before it acknowledges the address-mark pat- 
terns. Second, if it detects any pattern before it detects 
the address mark, the circuit resets itself and begins 
the initialization sequence. 

IC2 implements these two safeguards by monitoring 
the RLL code that ICi decodes. When the hard-disk 
controller asserts the RG and AMC signals, IC2 sets 
Complt and Am_Latch low. IC2 sets Complt high when 
it detects the fifth consecutive 0100 pattern, enabling 
ICi to detect the address mark. If the input to ICi is 
anything other than a 0100 pattern or an address mark, 
ICz sets Complt low, and the initialization begins anew. 

If ICi successfully detects the address-mark pattern, 
it sets AMF high. The assertion of AMF causes IC2 to 
set Am_Latch high, thus enabling the output of OUT- 
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REG to send decoded data to the hard-disk controller. 
f\.m_Latch remains high as long as RG is high. 

RLL 2,7 encoding also provides for error recovery. 
Wienever a disk-drive system reads code from a disk, 
:he read/write heads or the transmission cables can 
:ause transmission errors. One of the properties of 
RLL 2,7 code is that any single-bit error (for example, a 
;oded one detected as a zero) will correct itself after a 
run of at most 16 correctly detected bits. 

In short, whenever a 2-bit pattern doesn't match any 
of the expected patterns for a particular state of the 
decoding circuitry, the decoding state machine returns 
to state zero and generates a zero as a translation for 
the erroneous code bits. Then the decoder shifts the 
next two code bits into the INREG register and contin- 
ues decoding from state zero. 

In such cases, the decoding state machine can cor- 
rectly decode coded data, but it may not recover 
immediately upon returning to state zero. As Fig 6 
shows, although the code bits that the decoder detects 
may be valid, the decoded data is incorrect because the 
error has forced that state machine into the wrong 
state. In this example, the decoder doesn't recover until 
6 code bits later (in cycle E6), when it again falls into 
the correct state and accurately decodes the data. EDN 
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Mixing Data Paths Expands 
Options in System Design 



Chip designers are creating powerful CPUs and peripherals 
with 16- and 32-bit parts. Mixing these with 8-bit parts 
overcomes limitations imposed by established designs, 
incomplete families, and software incompatibility. 



by Mark S. Young and 
James R. Williamson 



Integrating 16- and 32-bit peripherals and CPUs 
into 8-bit designs, at the simplest level, means 
separating the control and data paths from new 
peripherals and the systems. Mixing different data 
path widths and control protocols, however, makes 
possible major improvements in function, perfor- 
mance, and cost. 

The price/performance curve of VLSI chips, for 
example, allows designers to obtain more and bet- 
ter functions for the same amount of money every 
year. Alternately, the functionality of a device can 
remain constant while the price fails. 

Moreover, these new devices with wider data paths 
can extend the life of older designs. For example, 
many of the most popular personal computers today 
use the 8088 microprocessor and, therefore, are con- 
strained to an 8-bit data path. Designers of add-on 
accessories for these personal computers prefer the 
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newer 16-bit peripherals. These peripherals will let 
users preserve their software investments, improve 
performance, and stave off obsolescence. 

Mixing different data path widths can also enhance 
new designs. For example, it is less expensive to use 
an 8-bit bus in a new design because the memory 
requirements are generally cheaper. Only half as 
many dynamic RAMs are necessary for the same 
number of kilobytes of memory. In addition, an 
8-bit bus needs much less control and support logic. 
Designers can mix smaller data path peripherals with 
wider data path CPUs. This allows them to introduce 
systems based on the newer, more powerful 32-bit 
CPUs even before 32-bit peripherals are available. 

Designers can use this mixing method to obtain 
wider data paths from existing designs until a new 
system design is warranted. They can also use parts 
in unexpected applications. For example, cost- 
conscious terminal manufacturers might want to use 
the Am8052/8152A chip set (the 8052 is an advanced 
CRT controller and the 8152A is a video system con- 
troller) in new terminals based on the relatively 
inexpensive 8051 microprocessor. Mixing the 8-bit, 
single-chip microprocessor with the 16-bit CRT con- 
troller allows designers to maximize the cost/perfor- 
mance ratio of the terminal. 

Mixed data path widths can improve bus utiliza- 
tion as well. A 16-bit peripheral in a 32-bit system 
only occupies half the data bus for data transfers. 
If the designer mixes the data paths correctly, how- 
ever, the 16-bit peripheral could transfer data as 
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32-bit chunks and improve bus efficiency by 100 per- 
cent for that peripheral. 

Two central concerns stem from mixing devices 
that communicate over different-sized buses. The 
first problem results when two devices communicate 
on a "common" data bus. Consider, for example, 
a 32-bit system utilizing 8- and 16-bit peripherals. 
Overcoming the mismatched data paths requires 
some form of controlled multiplexing/demultiplexing 
of the different data paths. In addition, extra con- 
trol signals for partitioning the 32-bit word into 8-, 
16-, and 32-bit chunks may be required. 

Many 16-bit CPU-based systems that use 8-bit 
peripherals normally use just the lower 8 bits of the 
data bus to transfer data to and from the peripheral. 
This method does not work in systems using 16-bit 
peripherals and 8-bit CPUs, however, and it tends 
to break down in systems with 8-bit peripherals hav- 
ing bus master capability. 

A bus multiplexing method involves multiple 
transfers when taking data from or adding data to 
a mismatched data bus. For example, before a 16-bit 
peripheral can transfer data over an 8-bit bus, the 
16-bit data must be divided into two 8-bit chunks. 
It is then transferred sequentially. First, the lower 
8 bits are transferred out on the bus. Then, in the 
next transfer cycle, the upper 8 bits of the 16-bit 
word are sent out. The major difference in the oppo- 
site case — a bus read operation from an 8-bit bus 
to a 16-bit device — is that the first byte read from 
the system must be latched. Once the second byte 
has been fetched, the 16-bit peripheral reads in the 
assembled 16-bit (2-byte) word. Additional provi- 
sions may be needed when the 16-bit peripheral only 
wants to access a single byte. 



The other major problem in mixed data path 
transfers is the actual data read/write operation. The 
nature of the multiple transfer forces designers to 
guarantee that the stretched transfer will occur and 
that it will not be interrupted. Two aspects of stretch- 
ing the transfer cycle from or to tiie peripheral illus- 
trate the complexity of this problem. 

The first case, when the peripheral is the bus 
master, is the simplest. A 16-bit peripheral holds its 
data available for what normally would be two com- 
plete bus transfer cycles. This function can be per- 
formed when the transfer acknowledge signal to the 
peripheral is delayed. If the data was latched instead 
of holding the peripheral in a multiple word transfer, 
however, the device could try to send the next 16-bit 
data word and its "new" address. The procedure 
of latching the data and releasing the peripheral 
should not be used, therefore, because it may inter- 
fere with the addressing of the remaining (pending) 
8-bit transfer. 

Whenever a device acts as a bus slave to a CPU 
that cannot access the device's natural word width 
in a single operation, a different constraint appears. 
The sequence must be set up so the peripheral cannot 
obtain the bus while the CPU is in the middle of a 
slave read/write operation. In a typical system, the 
CPU is the last device in the interrupt queue. It is 
possible for the peripheral to become bus master 
between the first and second read operations and in- 
validate the results of the first read operation in a 
realtime system. This is because an 8-bit CPU would 
have to perform two consecutive read operations to 
examine a 16-bit peripheral control register. 

This function can be handled two different ways. 
If the CPU has a bus lock instruction, as in iheiAP.X 
family of CPUs, the programmer must use one of 
these instructions before the CPU accesses the 
peripheral. Alternately, the CPU needs to disable 
the arbitration logic while it is performing the unin- 
terruptible access with the !6-bit peripheral. 

Crucial cycle 

The uninterruptible word transfer cycle is crucial 
for maintaining Ihe integrity of the data transferred. 
When either the CPU or a peripheral on the bus 
makes an access using the 8/16-bit control logic, it 
must complete the larger device's word access before 
relinquishing the bus. If this requirement is not met, 
a transfer's integrity can be violated easily by some 
other device. This interrupts the transfer, and cor- 
rupts or aborts the multiplexing sequence. 

To illustrate this point, consider a system consist- 
ing of an 8-bit CPU and several 8- and 16-bit periph- 
erals. Assume one of the peripherals is executing a 
block transfer of 16-bit data onto the 8-bit bus. If 
the CPU interrupted the transfer in order to poll the 
peripheral during a half-word transfer, two undesir- 
able events would occur. Either the multiplexing 
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sequence would be damaged irreparably when the 
CPU polled the peripheral, or the CPU would read 
garbage from the peripheral. 

Designing the control interface to allow mixing 
of 8- and 16-bit peripherals requires attention to the 
data and control flow. During a write dperation, the 
data is written out sequentially: the lower byte comes 
before the upper byte (or vice versa). The read oper- 
ation differs only because the data bus is 8 bits and 
because it forgets the last byte transferred; it knows 
the current byte only. Hence, the interface requires 
that one Of the bytes be latched until the full 16-bit 
word has been assembled. 

The slave mode of operation works almost the 
same as the peripheral bus master mode. The single 
exception is the slave write operation. When the 
interface is defined, the designer must make a con- 
scious choice about which byte (upper or lower) to 
latch during peripheral read operations (or con- 
versely, slave peripheral write operations). Once this 
decision has been made, the CPU must always access 
the latched data byte firsj (during a slave write) and 
then access the non-latched byte to complete the 
transfer. This restriction is minor, requiring no extra 
software overhead, it could affect the ease of the 
programmer's coding if not handled properly, how- 
ever. For example, if the programmer used a com- 
piler to generate the software for the system, extra 
care may be necessary to ensure the compiler gener- 
ates the correct addressing sequence. 

An alternative solution would be to latch both the 
upper and lower data bytes, in this case, however, 
the cost of the interface would increase, as would 
the complexity, with no appreciable gain. The con- 
trol flow in these designs derives from two differ- 



ent sources: the state control flow itself and the 16-bit 
peripheral interfacing with the 8-bit bus. A state dia- 
gram can be used to specify how uninterrupted word 
transfers will occur and how the upper and lower 
byte address is generated. 

In addition, the specific bus timing of the periph- 
eral and the data bus must be examined to quantify 
the state control flow. These timing specifics also 
provide information on data latching, read/write 
control strobes, and addressing to and from the 
peripheral. The state control flow is divided into four 
operations: bus master read, bus master write, slave 
read, and slave write. 

F^or a bus master read/write operation from a 
16-bit peripheral device operating on an 8-bit bus, 
four control signals must be generated by the 
8/16-bit comrol unit: address bit 0(A0), peri pher al 
hold (WAIT), bus read (RD), and bus write (WR). 
The AO line is generated by the 8/16-bit control logic 
to indicate which byte is to be transferred in bus 
master modes only. Otherwise, the AO generated by 
the system is u^ed to indicate which byte is being 
accessed. The WAIT Ime ho lds up the peripheral 
during transfers. The RD and WR lines are required 
to indicate successive transfer cycles on the bus. 

Hidden transfers 

The peripheral's signals will only strobe active 
once because it does not know that two transfers are 
being executed. The slave transfer flows are almost 
identical, except the CPU is generating the bus sig- 
nals and the transfer directions are reversed (ie, a 
bus write goes into the peripheral). 

For this 16- to 8-bit data flow example, the data 
on the upper byte only needs to be latched when data 
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The 16-to 32-bIt conversation logic diagram indicates 
tlie complexity of the bus and tunnel logic control, it 
must convert between difTerent signal conventions 
and polarities as well as generate extra functions and 
bus arbitration control signals. 



is being read (as bus master) or written (as a bus 
slave). An interfaee to handle this operation needs 
lo lateh data eoming from the 8-bit data bus into 
the peripheral, it also needs to aet as transeeiver 
when the peripheral is sending data out to the system. 
A deviee with a eloeked, iri-state output that has an 
8-bit wide latch in one direction and a tri-state trans- 
ceiver in the other direction would be ideal for 
accomplishing such an interface. 

The Am2952 8-bit bidirectional I/O port provides 
a good enough match to the logic and allows the 
upper data bus latch and upper data transceiver chips 
to be combined on one IC. It provides two 8-bit 
clocked I/O ports, each with tri-state output con- 
trols and individual clocks and latch enables. An 
Am2949 bidirectional bus transceiver completes the 
.logic required for the data path function. 

The state flow control requires logic that can move 
sequentially from state to state, hold in a particular 
state, and be reset or initialized back to a predefined 
state. Depending on the number of states required 
(generally less than 16 distinct slates for a design of 
this complexity), a 3- or 4-bit counter should be able 
to solve the problem nicely. 

Considerable bus control logic is required to gen- 
erate the data path flow logic and the bus control 
signals. This is especially true if the peripherals and 
CPUs use diffe rent signal conventions (eg, when AS, 
D.S. and R/W use address latch enable, RD, and 
WR). Conversion from one signal convention to 



aiu)ther, changes in signal polarity, and provision 
for extra functions (such as generating AO) require 
a lot i)f logic synthesis ability. If the peripheral has 
bus master capability, such additional ii\formation 
as bus arbitration controls must be fed into the ne.\t 
state deiermination logic in order lo decide what con- 
trol sequence to follow. 

Customized interface minimizes cost 

An 8/16-bit control interface between the AmKO.'iZ 
CR r controller and an 8-bit CPU provides a good 
example o\' how customi/ing a general interface can 
reduce costs. (The CRT controller is designed with 
a 16-bil data interface.) The onboard DMA unit 
fetches data from sysiem memory and the CPU polls 
the C"R'r coniroller's internal status and control 
registers. Because the CRT controller does not 
modify system memory, however, a bus master write 
operation is unnecessary. Thus, there is no reason 
to generate a sysiem write control signal (WR). 

In addition, the control and display information 
must be aligned on word boundaries. This require- 
ment relieves the8/l6-bil control logic from funneling 
the bytes and performing odd/even byte iransfers. 
It also saves control inputs from the CR T controller 
because all iransfers are words; that is, no need 
exists for upper and lower data strobes or byte high 
enable inputs. 

The bus master read operations are standard I6-bii 
data transfers divided into two K-bit iransfers. The 
CPU's slave accesses are either pointer writes (to 
select the desired control/status register) or 16-bil 
data read/write operations. (Pointer write operations 
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arc actually 8-bit operations because only the lower 
8 bits of the data t'oriii the register address.) The bus 
master read operation can be represented by a state 
flow diagram or a timing diagram. Conceptually, 
state How diagrams are easier to understand, but tim- 
ing diagrams usually convey more information. 
Other state flow diagrams can be derived directly 
from the timing diagrams of the CRT controller to 
8-bil interface. 

Simplifications allow synthesis on one device 

Two special conditions must be met in the state 
machine implemented in the 8/16 interface. I'irst, 
before a new transfer cycle is attempted (when the 
state machine is waiting in the initial state, SO), 
memory acknowledge (MRDY) must be inactive. 
This prevents interference from the last transfer. 

The second special condition occurs when the 
CRT controller asserts the R/W line to indicate a 
write operation. Although the CRT controller does 
not write data into system memory, when it updates 
the upper 8 bits of the 24-bit address latch the R/VV 
line indicates a write operation (in conjunction with 
AS). The CRT controller is nol actually performing 
a system data write, only an address latch update. 
The state machine, therefore, must not start a bus 
sequence if the R/W line is held active low by the 
CRT controller during a bus master operation. 

These simplifications in design allow the CRT con- 
troller to 8-bit CPU control interface to be synthe- 
sized in a single Aml'A1.22Vl() programmable logic 
array device, in addition, the bus control signals are 
converted from AS, 1)S, and R/W to Rl) and WR. 
The minimum CR! controller and bus contriil sig- 
nals that must be generated are Rl), AO, l)S, and 
R/W. Although the CRT eonlroller uses l)S and 
R/W as inputs during a bus master operation, the 



PAl, device must convert the CPU RD and WR sig- 
nals to DS and t/W for slave I/O operations. 

The signals AO and RD are generated by the con- 
trol logic when the C RT controller is performing a 
read access to system. The WAIT (or not READY) 
signal to the CRT controller must also be generated 
by the control logic. The data flow controls require 
six additional controls to load and strobe the latch, 
and to enable transceivers to pass data to and Irom 
the 8-bit bus. Theoretically, 4 more bits (outputs) 
are required to represent all the control states needed 
to manipulate the 8/16-bit control logic. This means 
the design appears to need 14 output logic units in 
a PAL device to perform the required task. 

Reducing the 14 output cells to the 10 cells avail- 
able in the PAI. device requires a closer look at the 
timing and output switching functions. The .AO and 
RD control lines are in effect part of the system bus 
control and, therefore, cannot be multiplexed easily. 
The DS and R/W lines to the CRT controller are 
also fixed because they must be valid throughout the 
entire transfer cycle as well. 

This leaves 6 of the 10 output logic cells of the 
PAI, device to represent the remaining 10 identified 
control lines. This method of minimi/ation involves 
careful stale synthesis, analysis of the signal switch- 
ing functions during the transfers, and utili/'ation 
of several control pins on the CRT controller. Hv 
using the liRI-iQ, liACKI, BACKO, CS, and C D 
inputs to the PAI, device, we can reduce the nuni- 
ber of unique states required to 8 instead of 15. This 
reduces the number of logic cells required for the 
state machine from 4 to 3 bits. 

At this stage, the design requires seven control sig- 
nals to manipulaie the data transfer registers and 
WAI I line. Ihe two latch enables (Cl-s and CDr) 
on the Am2952 bidirectional I/O port can be 
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permanently enabled. By controlling the clock signal 
to the latches, the controls required for three pins 
can be reduced to one. The interface control state 
machine will only use the correct side of the dual 
latches on the bidirectional I/O port'. 

The Am8052 CRT controller helps considerably 
with its own control bus interface. Two signals 
provided by the CRT controller, TBtN and RBEN, 
switch the data transceivers in the correct direction 
regardless of the type of data transfer (as a bus 
master or bus slave). When the controller is a bus 
master performing a read operation, or when it is 
a bus slave undergoing a write operation, therefore, 
the RBEN signal is strobed to obtain the correct 
polarity. By using this line, two of the remaining 
six control lines can be eliminated (REN on the 
Am2949 and OEas on the Am2952). Although the 
TBEN line performs a similar function, it does not 
function correctly in a 16- to 8-bit multiplexed 
bus environment. 

Two of the remaining control lines (OE,\s on the 
Am2952 and 10 on the bidirectional bus transceiver) 
must be generated by individual cells in the PAL 
device. The two clock enables on the Am2952 are 
permanently enabled. The two Am2952 clocks are 
tied together to minimize the amount of logic re- 
quired in the PAL device used to generate clock 
strobes to the latches. 

This leaves the design with three logic cells and 
four output functions (the WAIT line to the CRT 
controller and the 3 state bits). Careful analvsis of 



the slate i'lows and timing diagrams indicates that 
the WAI I' line is only asserted in 4 of the 8 states. 
A clever assignment of slate numbers to the state 
flow sequence allows the WAI I line to be absorbed 
into the 3 state encoding bits. The logic equations 
for the AmPAL22VI() device can be derived directly 
from the timing diagrams. 

An unusual problem might occur when a periph- 
eral device operates as a bus slave on a smaller data 
bus, such as a 16-bit peripheral to 8-bii CPU. During 
the first slave write operation, the chip select CS is 
enabled by the bus master making the access. No 
actual data — just the data latch — is strobed into the 
peripheral, however. After the first byte of data has 
been written, the second access causes the full 16-bit 
data to be strobed into the peripheral. 

If the designer is using a common CS function to 
both the peripheral and the 8/16-bit control logic, 
the controller logic must be designed not to glitch 
or strobe any of the control lines to the peripheral 
(it must prevent DS, R/W from being enabled, for 
example). For some peripheral devices, glitches on 
the control lines might cause the register to be written 
accidentally onto a register that will be overwritten 
in the next write cycle anyway. With other periph- 
erals this might be a catastrophic event. Many 
devices acting as bus slaves have write recovery time 
requirements (ie, a certain minimum interval between 
consecutive write operations). Glitches on the con- 
trol lines might force the next (and final) write oper- 
ation to be delayed — or cause a violation of the 
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device specifications. Glitches might evade any spe- 
cial addressing/register accessing scheme used in liie 
peripheral. This might occur, tor example, if the 
slave device requires the user to write the address 
of the register that was accessed immediati.'ly before 
the register was written. In this case, glitches or use- 
less control strobes could wreck the secjuence. 

The problem can also be solved by using two lines. 
In this solution, one of the lines would go to the 
peripheral device and the other would connect to (he 
8/16-bit controller. The chip select to the peripheral 
is activated each time a slave read occurs (for both 
upper and lower byte accesses), or when a slave write 
operation occurs and the unlatched 8-bit data is 
being written. The chip select function to the 8/16-bit 
controller is chosen each time the peripheral is 
selected normally (for slave read/writes on both 
upper and lower 8-bit data transfers). This problem 
is bypassed completely when two separate chip select 
functions are used: one for loading up the Am2952 
latch during a slave write/read and one to strobe the 
Am8052 controller into action when it is needed by 
the 8-bit CPU. 

Bus conversion maximizes flexibility 

A data bus and control interface to an 8088 8-bit 
microprocessor and Am9516 16-bit DMA controller 
can be created using four devices: an AmPAL22V10 
for the control block, a 74LS161 counter for the state 
sequencer, an Am2952 bidirectional I/O port, and 
an Am2949 bidirectional transceiver. 

This design incorporates certain simplifications. 
The DMA controller '•equires word accesses only 
during command chaining and for slave register ac- 
cesses. The 8/16-bit data transfer interface for bus 
master operations (ie, DMA data transfer functions) 
is handled automatically as a programmable option. 
During slave write operations, (he first byte output 
to the DMA controller must have an odd address 
and the following second byte an even address. C"on- 
versely, during a slave read cycle, the first byte read 
from the DMA controller must be at an even address 
and the second at the next higher odd address. 

I'urthermore, for bus master operations, the sys- 
tem inust use the latched address line AO (l.AO) from 
the AmPA1.22Vl() as its sole AO. IJecausc the logic 
is already available, the system does not have to pro- 
vide this function. l.AO now becomes the system ad- 
dress bit with lull 24-mA drive capability. 

Deciding on a means for conlrolling the lunncling 
of the data stream— that is, tran'-i'orming 16-bii data 
into 8-bit data and vice versa — was (he first step in 
deriving this example. As mentioned earlier, simply 
dividing each I6-bil access into two 8-bil data trans- 
fer cycles presents one way of doing this. On out- 
going accesses (16-bit path from the DMA controller) 
during the first cycle, the upper half of the 16-bii 
path is latched while tiie lower half passes through 
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ALE 
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= 18 
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IF (RESET) THEN ARESETI |; 
This section defines the wigjies when the Ain9516 is bus master 

IF (HLDA) THEN ENA8LEI |; 
IF (/S(2] ' HLDAi THEN BEGIN 
IF (S|ll " /S[01l THEN 

LAO = /CK BW -I- /BW ' AO ■ 

ALE -t- / BW > LAO • /ALE ; 
ELSE 

UO = BW -(■ /BW AO - 

ALE -t- /BW ' LAO ' /ALE ; 
END: 

IF IHLDAI THEN 
(CASE) (S|2:01l 
BEGIN 
1) BEGIN 

RD = /RW • DS 
A = /BW • /RW " /CK 
WR = /BW ■ RW DS 
C = /BW RW 
WAIT = 1 
END: 
21 BEGIN 

RD = /RW - DS 
B = BW 
A = /BW /RW 
WR = /BW ^ RW ' DS 
C = /BW • RW 
WAIT = /BW 
END: 
31 BEGIN 

RD = /RW ' DS ' B 
B = BW ' CK 
A = /BW - RD 
WR = /BW * RW ■ OS 
C = /BW ' RW 
WAIT = BW 
END: 

5) BEGIN 

RD = /RW ' DS 
A = /BW ' /CK 
WAIT = BW 

END: 

6) BEGIN 

RD = /RW * DS : 

A = 'BW 
END: 

71 BEGIN 

RD = /RW • DS 

A = RD 
END: 
END: 

This section defines the wi^els when the 8088 is bus master 

BEGIN 

IA0= AO ■ ALE SEL * LAO ' /ALE ■ SEL 

B = LAO ' WR • SEL 

A = /UO ■ WR ' SEL 

OS = A -f- /LAO • RD ' SEL 

C = /LAO • RD ' SEL 

== UO RD SEL 

END: 
END. 
Ihis IM.PI. file implemenis an inlrrfacr belHorn the 
8-bil 8I)KK and Ihi- l6-hil Ani951h. 
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Programming the PAL and the counter 



In writing the Programming Language for Program- 
mable Logic (PLPL) file to control ttie operation of 
the AmPAL22V10 and the 74LS16i counter, the 
inputs to the PAL device from the counter are as- 
signed SO, S1, and S2, respectively. Then, it is pos- 
sible to apply a "sculptured design" technique to 
the entire timing diagram (see figure in Panel, "A 
matter of timing") by using the Case statement from 
PLPL. By assigning combinatorial equations to only 
one binary partition or column at a time (Case), the 
designer can ignore all other aspects of the design 
for the time being and generate simple equations 
directly from the timing waveforms. 

During clock time T1 of the Am9516's word read 
cycle the state of the 74LS161 (SO, SI , S2) is cleared 
to 000 by the assertion of address latch enable 
(ALE). LAO is the only output control signal from the 
CRT controller asserted during this period. This sig- 
nal is handled as a special case, however. During 
tinid T2 of t he DM A controller's word read cycle, 
the RD and WAIT outputs from the CRT controller 
must be asserted. This time partition corresponds 
to the state inputs S2, SI, S0 = 001. Therefore, the 
first Case equations are 

CASE (S(2;01) 

BEGIN 
1) BEGIN 

RD= /BW'DS ; Transform Control 
; Signals /RW and DS 
; into Intel /RD 



WAIT=1 



Assert Wait 
unconditionally 



Duriri^g time T2 of the DMA controller's byte read 
cycle, A is the only additional output not already 



accounted for in the Case statement. This signal 
allows a byte of data to flow through the bidirec- 
tional bus transceiver into the DMA controller 
during byte read operations. Some additional con- 
straints are placed on this signal, however: it must 
only be asserted in time T2 on byte read operations 
(the BAA/ input) and it must be delayed^by a half 
clock period from the rising edge of T2 (CK signal). 
Thus the Case statement becomes 

CASE (S[2:01) 
BEGIN 
1) BEGIN 

RD= /RWDS ; 

A= /BW/RW/CK ; enable ttie 

; receiver 



END; 

Finally, by examining the last time T2 elements 
(Wf^ and C) during the DMA controller's byte write 
cycle, the remaining terms in Case 1 are derived. 
With the exception of LAO, the remaining equations 
were developed in the same fashion. Clearly, this 
"sculptured" technique is a very simple and 
methodical means for arriving at the Boolean re- 
quirements for a logic block. 

As the PLPL listing shows, the signal LAO was 
handled slightly differently from the previously dis- 
cussed method. The number of product terms gen- 
erated via the Case statement made this approach 
necessary. The number exceeded the upper limit 
(16 terms) for a programmable logic array. As a prac- 
tical matter, therefore, it was necessary to optimize 
this signal manually. However, it should be noted 
that this step will not be necessary once the fully 
optimized version of PLPL becomes available. 



a tri-state buffer onto the 8-bit bus. During the sec- 
ond cycle, the tri-state buffer is turned off and the 
previously latched half of the data is driven onto the 
bus. On incoming accesses (8-bit path to 16-bit path), 
the process is reversed. 

The control mechanisms that perform this cycling 
depend on the WAIT and R/W signals passing to 
and from the DMA controller, and on the ability 
to enable or di.sable the latches and transceivers selec- 
tively. The Am2952 bidirectional I/O port was 
chosen because of its dual registers and its flexible 
control. The AmPAL22V10 device was chosen to 
match the required number of control pins and func- 
tions. Since the complexity of this design requires 
the use of all of the PAL's I/O pins for control func- 
tions, however, it was necessary to use a 74LSI61 
counter to provide the state sequencer function. 

Programming with PLPL 

It has long been the logic designer's "art" to merge 
the often very different concepts and notations of tim- 
ing information with Boolean logic. Yet, the evolu- 



tion of a syntax to fully express this art has taken 
a long time. AMD recently developed such a language 
for programming the AmPAL22VI0, however. 

"Programming language for Programmable 
Logic," or PLPL, allows the designer to specify a 
design using multiple input formats. This specifica- 
tion flexibility supports the variety of design 
approaches necessary lo express differciil design 
problems efficiently. These formats range from sim- 
ple sum-of-products Boolean equations to high level 
constructs. PLPL also supports the input specifica- 
tions for many types of AND/OR based devices, in- 
cluding all of the current AMD programmable logic 
array and PROM devices. 

PLPL is block structured, and includes the high 
level language constructs If-Thcn-lilse. Case, and 
I'or; all familiar to many programmers of the C" and 
Pascal languages. Macros, functions, constants, and 
variables may also be used in PLPL. The language 
also facilitates use, clarity, and self-docunienlation. 

Such current programmable logic technology and 
associated programming langua^""- as PLPL allow 
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A matter of timing 



The complex AmPAL22V10 design used the accom- 
panying timing diagram to correspond to the 
desired waveforms. They are partitioned by the 
respective binary state (or count) from the counter. 
The desired timing requirements during the 
period when the DMA controller is bus master 
appears below. During time T1, address latch 
enable (ALE) is asserted by the DMA controller to 
denote the beginning of the cycle; a short time later, 
an address is driven onto the bus. This address is 
valid at the falling edge of ALE. The control signal 
LAO (latched AO), therefore, must be valid at this 
time, as well. In this phase of the cycle, it must also 
be high to enable the odd byte from memory to be 
loaded into the bidirectional I/O port. In addition, 
the assertion of ALE performs the function of reset- 
ting the 74LS161 counter to 0000 in order to syn- 
chronize the cycle. 

^During time T2, the DMA controller will assert its 

DS signal. The timing for this signal, in conjunction 
with the RAA/ signal (asserted in T1) must be trans- 



formed into an 8088-equivalent RD signal. During 
a word read cycle, this RD signal also must be arti- 
ficially negated and then reasserted to accomplish 
a double byte read. At the same time, the DMA con- 
troller must be "parked" in order to multiplex or 
assemble a word. Thus, the WAIT signal is also 
asserted at time T2. During time TW (S2, SI, 
SO = 010), the receiver clock enable control signal 
B must be asserted in order to allow the next sys- 
tem clock's rising edge to strobe the upper byte into 
the bidirectional I/O port. This is accomplished dur- 
ing the next TW period (82, SI, SO = 011). 

During the remainder of the word read cycle, RD 
is negated and then reasserted after LAOhas been 
forced low to address the even byte. A is then 
asserted to allow both the previously latched upper 
byte and the current lower byte to be driyen_onto 
the DMA controller's pins. And finally, the WAIT sig- 
nal is negated, allowing the DMA controller to finish 
its read cycle by strobing in the 16 bits of command 
data on its data pins. 
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highly organized application-oriented control blocks 
to be formed easily. These tools can conceptually 
raise the designer above the details of the design at 
the logic level and directly translate the necessary 
response characteristics from a timing diagram. 

This approach can be referred to as a "sculptured 
design" technique because it is analogous to the way 
solid stone is formed according to an artist's image. 
Raw logic can be transformed directly into useful con- 
trol functions from the desired timing information. 



The AmPAL22V10 is, in essence, a fuse-program- 
mable gate containing up to 22 inputs and 10 out- 
puts. It can define and program that architecture of 
each output on a pin by pin basis. Thus, the designer 
is free to optimize the design mix between registered 
and combinatorial functions as needed. 

The AmPAL22V10 is programmed by opening fus- 
able links in any or all of its 10 output macrocells, 
as well as in its AND gate array. The AND gate struc- 
ture is very similar to other PAL devices; therefore 
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It allows the same powertul, yet familiar teatures. 
However, it is the AmPAL22V10's 10 output logic 
niacrocclls that give the designer substantial new 
design freedom. Moreover, at each macrocell out- 
put is a tri-statc output buffer controlled by a 
separate output-enable AND gate. 

These macrocells provide the AmPAL22V10's key 
features. They can be configured to make any or all 
of the I/O pins act either in sequence or in combi- 
nation and have either active-high or active-low char- 
acteristics. Furthermore, the output enables can 
individually control the direction of the pins so they 
act as outputs, inputs, or bidirectional ports. 

A number of trade-offs and limitations are appar- 
ent in a design that so dramatically affects the input 
and output of the system. The most obvious limita- 
tion stems from under utilization of 16-bit periph- 
erals on an 8-bit bus— the speed of all I/O operations 
are cut in half. As a result, bus utilization will 
increase if the 16-bit peripheral represents a signifi- 
cant factor of the bus use. A CRT controller such 
as the .^m8052 might use 5 to 10 percent of the bus 
bandwidth for display information when using 16-bit 
I/O. Converting to 8-bit I/O would double bus use 
to 10 to 20 percent. Another factor that might affect 
the bus usage is the efficiency of the 8- to 16-bit con- 



version control logic. If the state machine designed 
to perform the 8/16-bit (or 16/32-bit) conversion is 
improperly designed, extra transfer overhead might 
be introduced. This might mean a sequential transfer 
of two 8-bit values would take twice as long a single 
16-bit transfer. 

The design constraints might limit the use of the 
peripheral to byte-only operations during data trans- 
fers (as in the design using the DMA Am95I6 con- 
troller), and slow it down by a factor of two during 
command operations. For such a DMA device as the 
Am9516, the extra time required for command fetch- 
ing is not usually a significant portion of bus time. 

System designers will have to weigh the cost of 
the extra overhead on a case-by-case basis. The ben- 
fits may well justify these limitations — particularly 
when the bus is self-limiting, but the device charac- 
teristics allow for value-added designs, in addition 
to bus degradation for certain configurations, extra 
logic and design effort are involved. Most interfaces 
outside a system's immediate family require some 
kind of extra interface logic, however. By manipulat- 
ing the signals and incorporating them into program- 
mable logic devices such as the AniPAL22V10 device,, 
therefore, most of this logic is free. 



Monolithic 



m 



Memories 



£1 



2-669 



Programmable Logic Chip 
Rivals Gate Arrays in Flexibility 



Hanging a user-customized macrocell at eacti output of an LSI-level 
fuse-programmable array logic gives it the flexibility of a gate array. 
The result: Less hardware is needed for high-level functions. 

Rsprint with parmlMlon from ELECTRONIC DESIGN 



Designers of semicustom ICs have long appre- 
ciated the in-house programming capabilities of 
programmable array logic devices. But gate arrays, 
with their LSI densities and wider opportunities for 
customization, are an inviting alternative, even if 
they must be sent out for metal-mask processing. 
Which approach should 
the designer use? 

An agonizing choice be- 
tween convenience and 
versatility is no longer 
necessary. The first fuse- 
programmable chip capa- 
ble of implementing LSI 
circuits blends the advan- 
tages of both types. The 
result is more cost- 
effective designs with 
higher-level functions 
than previous pro- 
grammable logic devices 
offered and a shorter 
turn-around time than is 
possible with gate arrays. 

The new chip, the 
AmPAL22V10, can be 
thought of as a fuse- 
programmable gate array, 

since at twice the density of previous programmable 
array logic devices, it replaces the equivalent of log- 
ic circuits having 500 to 1000 gates (see "Blending 
Programmable Logic and Gate Arrays," p. 97). In 

Brad Kitson, Section Manager 
David Laws, Managing Director 
Warren Miller, Section Manager 
Advanced Micro Devices Inc. 
901 Thompson Place, PO Box 3453 
Sunnyvale, Calif. 94088 




fact, because of its flexible, programmable architec- 
ture, the chip spans the breadth of standard SSI, 
MSI, and present-generation PAL devices, often ac- 
complishing what could not be practically done with 
several such chips. 
Designers program the 22V10 by opening fusable 
links in any or all of its 10 
output macrocells, as well 
as in its AND-gate array, 
developing the needed log- 
ic functions. Though the 
AND-gate structure 
makes the unit similar to 
other PAL devices, the 
chip's 10 output logic mac- 
rocells afford a substan- 
tial new degree of design 
freedom (Fig. 1). 

What's more, at the 
chip's output pins are 10 
three-state output buff- 
ers, each fed by a mac- 
rocell and controlled 
through a separate out- 
put-enable AND gate. 

The macrocells provide 
the 22V10's key features: 
They can be configured to 
make any or all of the I/O pins act either sequen- 
tially or combinatorially and have either active 
highs or active lows. Additionally the Output En- 
able can individually control the I/O terminals to 
act as output pins, input pins, or bidirectional ports. 
Thus the designer can readily change the chip's ar- 
chitecture. 

Within the array, each AND gate is fed by the 
true and complementary levels of the chip's 12 input 
lines, one of which doubles as a clock line. Also, each 
AND gate is fed by the true and complementary 
levels of the 10 feedback outputs from the output 
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logic macrocells. 

Each of the AND gates therefore has 44 inputs, 
and there are a total of 132 AND gates in the array. 
The result is over 5800 fuses that can be pro- 
grammed to perform the desired function. 

A total of 120 of the input AND gates are distrib- 
uted to drive 10 OR gates, forming the logical part 
of the AND array. In addition, one AND gate is 
assigned to control each of the 10 three-state out- 
puts, and the remaining two gates activate synchro- 
nous register presets and asynchronous register 
resets. 

The array itself consists of 10 AND-gate macro- 
cells (five similar pairs), each logically ORing the 
outputs of anywhere from 8 to 16 AND gates. The 
AND gates are distributed so that the two out- 
ermost macrocells in the array logically OR to- 
gether eight AND gates. The ne.xt two outermost 
macrocells logically OR together 10 AND gates; the 
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1. A (use-programmable LSI device combines the efficiency 
ol programmable logic arrays vvilti the flexibility of gate 
arrays. Ten user-definable output macrocells allow the 
designer to set the logic outputs individually (or one of (our 
condgurations. They can even be used as inputs or 
dynamically controlled I/O. 



next, 12 AND gates; and so on, traveling inward 
along the array. In other words, moving toward the 
array's center, two AND gates are added to succes- 
sive pairs of macrocells, ending with the two inner- 
most macrocells, each ORing 16 AND gates apiece. 

The variable distribution of AND gates is partic- 
ularly useful when implementing counting, ex- 
clusive-OR functions, and complex state machines. 
For example, a 10-bit binary counter has only one 
AND gate on the LSB but requires an additional 
gate for each succeeding bit, for a total of 10 AND 
gates to implement the MSB. Should a hold, parallel 
load, or specialized function also be desired, the 
MSB requires even more AND gates. 

To program the array into its familiar sum-of- 
products Boolean function, a designer blows the cor- 
responding fuse links. Those links initially con- 
nected the chip's pins and their complements to all 
132 of the 44-input AND gates. As mentioned, the 
designer can also program any OR-gate output to be 
a dedicated input or a dynamically controlled I/O 
pin. This is possible because the connection from an 
output macrocell to its output pin is through a 
three-state output buffer and is controlled by an 
output-enable AND gate. Turning off the connec- 
tion turns the output pin into an input. Otherwise 
the pin remains connected to the output macrocell, 
fed by the OR gate. 

An output witti a difference 

The chip's output section is where it appears more 
like a gate array than a PAL device. The output 
macrocells allow the designer, by individually set- 
ting the outputs to be either sequentially or combi- 
natorially and either active high or active low, to 
program either the true or complementary version 
of a logical term— whichever makes the most effi- 
cient use of the chip's AND array and fits the chip 
to the application. Furthermore, independent con- 
trol over the active state helps the designer conform 
to common logic conventions, thus avoiding con- 
fusion and error. 

For example, certain functions, such as chip se- 
lects and resets, tend to be active low. The output 
macrocells avoid contradictions with convention by 
inverting outputs, individually changing them when 
necessary to conform. Finally, by individually 
selecting either sequential or combinatorial out- 
puts, the macrocells impart to the chip the capacity 
for implementing more functions than is possible 
with existing PALs. 

The macrocell (Fig. 2a) contains three main logic 
blocks: a rising-edge-triggered D-type flip-flop 
with asynchronous reset and synchronous preset in- 
puts, a four-to-one output-path selection multiplex- 
er, and a two-to-one feedback-path selection multi- 
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plexer. Two fuses are imbcddi'd in each macrocTll to 
control the four possible output confijiurations: li„ 
selects sequential or combinatorial operation; I*„ se- 
lects active low or active hif^h. 

With both fuses intact, the output is se(iuential 
and active low (Fi^. 2b). The Q output of the flip- 
flop passes to an inverting output buffer, and the Q 
output feeds back to the AND array internally. This 
configuration is particularly suitable for state- 
machine designs. Alternatively, by pro^rammint^ 
(blowing;) P„, the designer can make the output ac- 
tive hifjh. 

On the other hand, by projjrammin^i only R,„ the 
desi(;ner makes the output combinatorial and active 
low. For this scheme, the OH-jjate output bypasses 
the flip-flop and feeds the inverting output buffer 
directly. At the same time, the feedback multiplexer 
passes the output back into the array (Fig. 2c). 

By programminn both fuses, the designer makes 
the output combinatorial and active high, and, 
again, the feedback multiplexer passes the output . 
back to the array. This configuration serves best for 
"glue" logic and for generating complex control 
functions. 

Finally, when the device is programmed in the 
combinatorial configuration, the programmable 
Output Enable can be used to transform an output 
pin into a dedicated input or a dynamically con- 
trolled I/O terminal. Once enabled as a dedicated 



input, however, a pin's output function is sacrificed. 
This cajiability is extremely valuable when par- 
titioning functions into the device. The limitation is 
no longer the numbers of input devices, but the total 
number of device pins. Lastly, as a dynamically con- 
trolled I/O, the pin serves both functions, meeting 
most bus requirements. 

Intelligent use 

One of the most challenging jobs in designing 
microprocessor systems is installing intelligent pe- 
ripheral controllers— that is, controllers that can 
transfer data themselves and so unburden the pro- 
cessor from inefficient polling operations. One solu- 
tion is to build a custom intelligent controller from 
scratch usingthe22V10(Fig. 3). This controller gov- 
erns data flowing between a microprocessor and up 
to eight peripheral devices, all conforming to the 
Small Computer Systems Interface standard (SCSI, 
the ANSI X3T9.2/82-2). 

The interface consists of an 8-bit bidirectional 
data port (DB t-DB q); three control outputs- 
Acknowledge (ACK), Select (SEL), and Rese t 
(RST); and five control input s-Re quest ( REQ ), 
Control/Data (C/D), Message (MSG), Busy (BSY), 
and Input/Output (I/O). There are additional 
signals, unrelated to the SCSI standard. They in- 
clude several microprocessor controls, such as ad- 
dress lines A] and A2 for setting the data's desti- 



Blending programmable logic and gate arrays 



Centered on a fuse-programmable AND array, the 
architecture of a PAL (programmable array logic) 
device offers designers a powerful, instantly mod- 
ifiable logic building block. As a result, PAL devices 
are used in a wide variety of applications, ranging 
from microprocessor-based systems to supermini- 
computers. 

Since gate arrays are not programmed in house, 
they have a much slower turnaround time and higher 
cost. Yet gate arrays' simple logic structure makes 
them easier to customize than PAL devices, and their 
larger size makes for denser logic structures. 

A gate array's simpler structure can be a draw- 
back, however. For example, typical gate arrays are 
based on just two- or three-mput gates; therefore, to 
implement complex functions, they often require 
many gate levels, which results in slow logic struc- 
tures. In contrast, PAL devices, like memories, make 
more efficient use of their silicon area and can be 
configured into fast logic structures. Consequently, 
even with a slight overhead for programming cir- 
cuitry, PAL densities are beginning to rival those of 
gate arrays. 

However, innovative circuit techniques and an 
oxide-isolated bipolar process team up in the 
AmPAL22V10 and combine the advantages of PAL 



devices and gate arrays. For example, the pro- 
grammable output macrocells are designed to add 
only one Schottky diode in the delay path. As a 
result, the chips selected for high speed have a worst- 
case input-to-output propagation delay and setup 
time of only 25 ns, as well as a worst-case clock-to- 
output delay of a speedy 15 ns, over the commercial 
temperature range. Also, the oxide-isolated process 
fits the chip's ability to implement complex func- 
tions into a 24-pin, 0.3-in.-wide package. 

For convenience, a feature called Preload loads the 
chip's internal registers with an arbitrary value, so 
that the designer can perform a complete post- 
programming functional test. In fact. Preload is a 
necessity for testing state machines, where the suc- 
cessive states depend on past and present values, as 
well as on the inputs. Without Preload, complex and 
sometimes impossible sequences would have to be 
entered before the first test could be performed. 

What's more, platinum silicide fuse technology 
makes programming faster and increases reliability, 
and a well-controlled melting rate yields large, sta- 
ble, nonconductive gaps. Address circuitry, much like 
that of a PROM, blows one fuse at a time, and pro- 
gramming is done with an algorithm that is easily 
adapted to a wide variety of available machines. 
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nation, read and write co ntrols (lO RD and lOWR), 
I nterru pt Request (I NTREQ ), Chip Select 
(CSSCSI), System ClocMSYSCLK). and buffer and 
latch enables (BE and LE) for passing bus data be- 
tween the processor and its peripherals. 

In operation, the controller assigns one of eight 
peripheral devices to be a master by activating SEL 
and at the same time sends the master's address to 
the SCSI data port.. From then on, when a device 
notifies the controller that it needs to transfer ei- 
ther a command or data, the controller interrupts 
the host processor. In turn, the processor reads the 
controller's status register to determine which de- 
vice is requesting service and the type of request it 
is making. After its status register is read, the con- 
troller removes the interrupt request and carries 
out the required transfer. 

Meanwhile the selected device issues a BSY 
signal, letting the other devices Know that the bus is 
in use. The d ata tr ansf er is m anaged by the hand- 
shake s igna ls ACK and REQ, and the proce ssor ac- 
tivates RST as needed by pulling both WR and RD 
low at the same time. 



Matching the chip to its application starts with 
an assessment of the available resources against 
those needed to do the job. In general, that means 
establishing that the chip has enough input and 
output pins and product terms to accommodate the 
logic functions required by the application. 

As to the pinouts, the 10 outputs required by the 
application do, in fact, match the 10 available on the 
chip. Four of the pins are used for the bidirectional 
data bus and the rest are dedicated outputs. As for 
inputs, the SCSI controller requires 11, which the 
chip more than satisfies with its total of 12. 

Next the designer develops the product terms 
needed to execute the controller's job and compares 
the number and size of the required terms with the 
number available from the chip. The resulting con- 
troller equations (see Table 1) number 10, and these 
nestle neatly into the chip's 10 available AND-gate 
macrocells. Also, the maximum number of terms in 
any one equation— in this case, those making up the 
acknowledgment function — is only 6, compared 
with the chip's capacity of 8 to 16. Thus the control- 
ler function fits easily into a single programmable 
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2. Each of 10 output macrocells (a) are Independently fuae-programmed (or one of (our 
configurationa. Fuse Rn controla whether the outputs are sequential or combinatorial, and 
(use P„ determines whether they are active high or active low. With no fuses blown, the 
output is sequential and active low (b). Blowing only Rn gives a combinatorial active-low 
output, and blowing only Pn gives a sequential active-high output. Finally, blowing both 
fuses changes the output to combinatorial and active high (c). 
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maa 



logic array, as can more complicated micro- 
processor peripheral interfaces. 

The 22V10's real power, however, becomes appar- 
ent when it is needed to fill an application that 
appears to demand more than the chip's resources 
can deliver. Insufficient input or output pins or 
product terms are frequent shortcomings of many 
devices when the job is very formidable. With its 
highly versatile I/O architecture, however, the 
22V10 can accommodate conditions that would 
make chips even larger than itself inadequate. 



Table 1. Peripheral controller equations 



Combinatorial oqutliont 

BE - CS • RD • A2 • A, 
LE - CS • WR • Aj 
D3 =■ A, • C + A, • ACK 
D2 - A, • M + A, • SEL 
Di - A, • B + A, • RST 
Do = Ai • I + A, • R 

Soquontial oquillont 

INTREQ "R'(I"C'M + J'C'M + I-C'M + ICM) 

ACK"CS • WR • A2 • A, • D3-I-R • ACK ■ ( CS " WR ' *2 * ^i ) + 
R • CS • RD • A2 • Ai 

SEL = CS • WR • A2 • A, • Dj + B • SEL ■ ( CS " W^ ' ^2 ' Ai ) 

RST = RD • WR + CS • WR ■ A2 • A, • Di + 
RST • ( CS • WR ■ A2 • A, ) 




Data 0.«_ 



ACK 
SEL 
RST 



■ ACK 
■SEL 



■REO 
•C/D 
•MSG 
■ Busy 
• I/O 



■f*- DBj-DBo 



3. The problem of designing an intelligent peripheral 
controller it simplified by the 22V10's flexible I/O structure. 
With the freedom to assign different architectures to 
different pins, the designer can make better use of the 
chip's input and output pins and product terms. 



For example, one of the most complicated func- 
tions to perform with a digital computer is floating- 
point arithmetic, or more specifically the nor- 
malizing of numbers in the process. As a result, no 
standard MSI or LSI logic for normalization exists. 
Yet the programmable array is well-suited to inte- 
grating at least part of this complicated logic func- 
tion. 

Normalization is necessary because a floating- 
point number is represented in two parts: a man- 
tissa and an exponent. The mantissa is a fractional 
number that is adjusted, or normalized, so that a 
binary 1 always occupies the most significant bit. To 
normalize a floating-point number without chang- 
ing its value, a compensating adjustment must be 
made to the exponent. In effect, the exponent, a 
power of two, is incremented for each bit position 
that the mantissa is shifted to the left. 

Because the process of normalization can be very 
costly if done wholly in parallel, it is more efficient 
to process a number one byte at a time. Further- 
more, because of the function's complexity, it is rea- 
sonable to partition the normalization unit into two 
units: an execution unit, which acts on the incoming 
data, and a control unit, which directs the execution 
unit. Thus the two units together form a byte-wide 
floating-point normalization (BFN) unit, with the 
programmable logic array making up the execution 
unit (Fig. 4). 

The design of the execution unit breaks up log- 
ically into the four main instruction categories; De- 
tect, Normalize, Merge, and Shift. Detect tests the 
incoming data for a binary one in any bit position. 
When such a nonzero byte appears, a flag, F, 
informs the control unit and tells it to begin the 
normalization process. Next, the Normalize com- 
mand determines the number of bit positions to 
shift the incoming byte, in order to put its first bit 
in the most significant position. Then the Shift com- 
mand adjusts the input data to the left by the spec- 
ified number of bit positions, filling the lower bits 
with Os. Finally, the Merge command adds in the 
full numbers' remaining bits, adjusting them so 
that the entire number is intact and shifted. 

From the beginning, even before the equations of 
the execution unit have been fully developed, it ap- 
pears that the resources of the programmable array 
will be insufficient for the job (see Table 2). Four- 
teen input pins are needed, af>ainst the chip's 12, 
and 12 output pins are needed, whereas the chip has 
only 10. Nevertheless, a closer look at the execution 
unit equations, with an eye toward consolidating 
I/O pins and functions, turns up paths around the 
seeming shortages. 

One "problem" that proves not to be one is the 
number of outputs. Although they number 13, not 
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all are active at the same time. For example, the 
outputs V„-V^, which specify the number of shift 
positions as a three-bit binary number, are active 
only when no other outputs are. Thus'they can share 
their output circuitry with other signals, relieving 
the output shortage. 

However, combining the shift outputs with other 
terms creates a product-term deficiency. Yet, again, 
careful observatic-n reveals a solution. One way to 
reduce the product terms of a function is to invert it, 
using the programmable array's output macrocell. 
The most likely candidate for such a reduction is the 
function with the most product terms. Although 
there are several, the best, because it is relatively 
removed functionally from the others, is the detec- 
tion flag, F. Its function is given by: 

where D7 through D,, are input data bits, which, if 
any are high, indicate a nonzero byte. 



When De Morgan's theorem is applied, the long 
OR string of active-high data reduces to a single 
AND term, saving a total of seven product terms. 
Still, two input pins must be found for the pro- 
grammable array to fully incorporate the logic of 
the execution unit. Since De Morgan's theorem can- 
not minimize the logic any further, the only answer 
is to locate any product terms that underuse the 
chip's resources, combine them, and lend their out- 
put pins to the cause by making them I/O pins. 

Examination of the instruction input specifica- 
tion (see Table 3) reveals that during Detect only 
three of the five instruction inputs are necessary, 
freeing the two others to be used as outputs. There- 
fore instruction bits lo and I, can share the outputs 
that were previously dedicated to shift bits Vq and 
V,, thus meeting the execution unit's final require- 
ments in a single chip.D 



Table 2. Allocation of chip resources 




Input pins 


Output pins 


Product terms 




Number 


Names 


Numlicr 


Names 


Initial 
require- 
ments 


14 


CLK. 

D7-D0. 

I4-I0 


12 


F (detection flag), 

V2-V0, 

R7-R0 


114 required 


Resources 


12 


None 


10 


None 


120 available 


Final 
require- 
ments 


12 


CLK, 

D7-O0. 

I4-I2. 

I1/V1, 

lo/Vo 


12 


Iq/Vo. 
R7/V2. 
Ro/F. 
Rfl-Rl 


IOB required 



Table 3. Instruction definitions 




Instruction 
code (I4-I0) 


Detection 
Hag 


Detect 
Normalize 
Merge 
Shift 


OOXXX 

ooxxx 
01N2N2NO 
10N2N1NO 



1 

X 
X 



Function unit 

cloclt (50 ns) " 

(CLK) 



Data Input (100 ns), 
D,-D„ 



Byte-wide floating-point 

normalization (BFN) 

control unit 



instructions, 
14- lo 



Status 

(detection 

(lag and 

others) 



BFN 

execution 

unit 



f 



BFN functional unit 



Values, 
V2-V„ 



Result output, 
R,-Ro 



Exponent 

adjustment 

functional 

unit 



Exponent output 



4. Floating-point arithmetic is a hardware-intensive process that, lor the most part, has 
defied integration. Yet by partitioning the tasks into smaller pieces, a designer can 
apply the power of the programmable array to the job. The chip's flexibility 
contributes to its successful implementation as a floating-point execution unit, even 
though its resources may seem insufficient. 
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XOR PLDs^ unique architecture is better 
suited than that of conventional AND-OR 
PLDs to certain applications. Once you un- 
derstand the basic theory behind their oper- 
ation, you can easily configure them to per- 
form the tasks of such devices as binary 
counters and video shift registers. 

Chris Jay, Monolithic Memories Inc 

Many designs — notably high-speed counters for ac- 
cessing memories, for counting events, or for sequential 
control in high-speed processor systems — require state 



machines that step through their sequence of operation 
by a binary count. XOR PLDs are ideal devices for 
these types of jobs, but they are rarely considered for 
such appHcations — perhaps because design engineers 
don't understand their useful, but unusual, architec- 
ture. 

Like the more sophisticated of today's conventional 
devices (Fig la), exclusive-OR (XOR) versions have a 
registered architecture (Fig lb). The distinguishing 
features of the latter are two pairs of product terms 
(multiple-input AND gates), which drive two summing 
terms (OR gates), which in turn feed two inputs of an 
XOR gate. The combinational XOR gate's output con- 
nects directly to the D input of the register. (For a 
theoretical discussion of the advantages of XOR PLDs 
over AND-OR PLDs, see box, "Principles of XOR PLD 
operation.") 



INOUTS, FEEDBACK AND I/O 



(a) 

■-fe: 




■=^^ 



INOUTS, FEEDBACK AND I/O 



CLOCK OC 



(b) : 




)E>— 



Sj 



-=^^ 



Fig 1 — You can easily distinguish between a conventional AND-OR PLD (a) awrf an XOR PLD (b). The latter has an XOR gate preceding the 
register. 



Reprinted from EDN, May 28, 1987 
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The distinguishing features of an XOR 
PLD are two pairs of product terms, which 
drive two summing terms, which in turn 
feed two inputs of an XOR^ate. 



The applications described here make use of MMI's 
20X8A and 20X10A PAL devices. These 24-pin XOR 
PLDs have an improved 22.2-MHz clock and worst-case 
propagation delay of 30 nsec compared with older, 
12.5-MHz, 50-nsec parts. 

XOR PLDs bring benefits to counters 

The prime use of XOR PLDs is in counters. Fig 2a 
shows the pinouts and Fig 2b the Palasm logic equa- 
tions of a 10-bit, loadable, up/down counter. You can 
design such a counter into a single 20X10A PAL device. 
The counter has ten registers, QO through Q9, capable 
of binary, up/down counting. The up control is high for 
an up count and low for a down count. Registers Ql 
through Q8 load synchronously through data inputs Dl 
through D8; the loading is enabled by an active-low /LD 
input (see box, "Palasm notational conventions"). Be- 
cause of a pin limitation of the package, Q9 loads via the 



up/down-control pin. This pin (the D9 input during 
loading) controls the counter's up- or down-counting 
mode. QO is always loaded with a logic low. 

A counter larger than 10 bits requires two or more 
XOR PLDs. Unlike MSI counters, which come with 
carry-in and carry-out pins, you must incorporate in 
your design provisions for cascading XOR PLDs. When 
cascading two XOR PLDs, you must provide a carry- 
out-enable signal from the least significant counter to 
the count-enable input of the most significant counter. 

When the least significant counter has reached its 
maximum count, it should enable the most significant 
counter to increment its count after the next rising 
edge of the clock pulse. However, time delays arising 
from the time needed to decode and propagate the 
Carry signal could slow the system down. To avoid this 
delay, your design should generate a look-ahead-carry 
output. 











Fig 2b 




10-BIT 


EVEN BOUNDARY 


LOADABLE 


/QO 


:= QO*/LD 
+ LD 


! COUNT 
;LOAD 




UP/DOWN COUNTER 




:+: GND + GND 




CLKr" 








/Ql 


:= /QIVLD 
+ /D1*LD 
:+: QO*/LD*UP 
+ /QOVLD*/UP 


;HOLD 

; LOAD 

; COUNT UP 

; COUNT DOWN 


\^ 




^VCC 


~°n. 






^QO 


/Q2 


:- /Q2VI.D 
+ /D2»LD 
:+! Q0*Q1*/LD»UP 
+ /Q0VQ1*/LD*/UP 


;HOLD 

; LOAD 

; COUNT UP 

; COUNT DOWN 


DlC 






:]Qi 


/Q3 


:- /Q3VLD 
+ /D3*LD 


;HOLD 

; LOAD 


D2[^ 






^^^ 




:+: Q0*Q1*Q2*/LD*UP 


; COUNT UP 






~1q2 




+ /Q0VQ1*/Q2*/LD*/UP 


; COUNT DOWN 










/Q4 


:=> /Q4*/LD 


/HOLD 


DSF" 






~1q3 




+ /D4*LD 

:+: Q0*Q1»Q2*Q3*/LD*UP 


;LOAD 

; COUNT UP 








+ /Q0VQ1*/Q2*/Q3*/LD*/UP 


; COUNT DOWN 


D4r" 


PAL 




^04 


/Q5 


••- /Q5*/LD 
+ /D5*LD 


;HOLD 
;LOAD 


DsP" 


20X10A 




I]Q5 


/Q6 


: + : Q0*Q1*Q2*Q3»Q4VI.D*UP 

+ /Q0*/Q1*/Q2«/Q3*/Q4*/I'DVUP 

!- /Q6*/LD 


; COUNT UP 
; COUNT DOWN 

;HOLD 


DsF" 






^06 




+ /D6»LD 

: + : Q0*Q1«Q2«Q3*Q4*Q5VI.D»UP 

+ /Q0«/Q1»/Q2VQ3VQ4*/Q5»/LDVUP 


; LOAD 

; COUNT UP 

; COUNT DOWN 


D7|- 






1107 


/Q7 


:- /QTVLD 
+ /D7*LD 


;HOLD 
;LOAD 


Dar* 






"Hqs 




:+: Q0*Q1«Q2«Q3*Q4*Q5*Q6*/LD«UP 


; COUNT UP 








+ /Q0VQ1VQ2VQ3VQ4VQ5VQ6VLDVUP 


; COUNT DOWN 










/QS 


:- /Q8VI.D 


;HOLD 


UP (D9)r" 






^09 




+ /D8*LD 

: + : Q0*Q1»Q2*Q3*Q4*Q5*Q6*Q7»/I'D*UP 


; LOAD 

; COUNT UP 








+ /Q0VQ1*/Q2VQ3VQ4VQ5VQ6*/Q7*/LDVUP 


; COUNT DOWN 


GNpr" 






I]0E 


/Q9 


:- /Q9VI.D 
+ /UP*LD 
:+: Q0*Q1«Q2*Q3*Q4*Q5«Q6»Q7«Q3*/LD*UP 


;HOLD 
; LOAD 
; COUNT UP 








+ /Q0«/Q1VQ2VQ3VQ4VQ5*/Q6*/Q7*/Q3*/I-D*/UP 


; COUNT DOWN 



Fig 2a — This diagram shows the pin designations for a 20X10A XOR PLD configured as a 10-bit, loadable, upldoivn counter. This design is 
not cascadable. 
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Principles of XOR PLD operation 




The configuration of an cxclu- 


terms. From a generalized counter 


term comprising all the less-sig- 


sivc-OR (XOR) PLD simplifies 


with N registers, the Nth regis- 


nificant digits. The XOR gate 


counters and other state ma- 


ter, QN, requires N-i-1 product 


will hold the register in its 


chines that progress through a 


terms: 


existing state until the product 


binary-encoded counting se- 




terms of the less-significant 


quence. You do not, of course, 


/QN := QN*Q(N-1)* ... Q1*Q0 


digits go True, whereupon the 


necessarily need a PLD with 


+ /QN*/Q(N-1) 


XOR gate will toggle the output 


XOR gates to design a binary 




of the register. 


counter; you can use the simpler, 


+ /QN*/QN1 


The following is a generalized 


sum-of-products (SOP) architec- 


+ /QN*/QO 


equation for a register (QN) in a 


ture of a conventional PLD (see 




binary-count sequence using the 


Fig la in the main text) to 


For example, the most signifi- 


XOR PLD: 


implement the logic equations 


cant register is a 10-bit counter. 




that define a binary-count 


Q9, would require the summing 


QN := QN :+: Q(N-l)*Q(N-2)* 


sequence. 


of 10 product terms, using 10 


... Q2*Q1*Q0 


However, the conventional 


AND gates, in a convential 




PLD requires a larger number 


AND-OR PLD. Consequendy, 


You can consider the term 


and AND gates, or product 


although you can write abstract 


Q(N-l)*Q(N-2)*... 


terms, for a counter's higher- 


counter equations for AND-OR 


Q2*Q1*Q0 as a Carry input to 


order registers than does an XOR 


PLDs, you may not be able to 


register QN. 


PLD. The Boolean equations that 


find a real PLD with enough 


WhenQ(N-l)*Q(N-2)*... 


follow prove this point. They are 


product terms to embody those 


Q2*Q1*Q0 = 0, the register QN 


required for a generalized 


equations. 


is in a hold condition, and QN's 


binary-count sequence and are 


An XOR PLD, on the other 


output feeds back through the 


rendered in Palasm notation (sec 


hand, uses its product terms 


fuse array and the XOR gate and 


box, "Palasm, notation conven- 


more economically; no matter 


loads into itself Only when 


tions"). 


how significant the weighting of 




If QO is the least significant 


the register in a binary counter. 


Q(N-l)*Q(N-2)*... 


register, then 


each register requires only two 
product terms. 


Q2*Q1*Q0=1 


/QO := QO 


Before delving into the de- 


will the XOR function invert the 




tailed Boolean equations that 


polarity of QN and strobe the 


This is a straightforward binary 


support the preceding contention. 


inverted state into the register in 


function — divide by 2. If Ql is 


you should get a feel for the 


synchronicity with the rising 


the nest significant register, it 


logic underlying the XOR PLD. 


edge of the next clock pulse. 


requires, the Boolean sum of two 


Consider that, in a binary-count 




product terms: 


sequence, the polarity of a 
register's output toggles one 




/Ql :=Q0*Q1+/Q0*/Q1 


clock period after the state in 
which all of the less significant 




For register Q2 the equation 


registers were a logic one. 




becomes 


Next, imagine that you have a 
2-input XOR gate driving a reg- 




/Q2 ;= Q2*Q1*Q0 + /Q2*/Q1 -i- 


ister. The output of the register 




/Q2*/Q0 


is fed back to one XOR input; 
the other input sees a product 




which requires three product 
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Unlike MSI counters, which come with 
carry-in and carry-out pins, you must in- 
corporate in your design provisions for cas- 
cading XOR PLDs. 



The logic in the least significant counter can decode 
the penultimate count and feed the D input of a 
carry-output-enable register. When the counter incre- 
ments to the final count, the carry-output-enable signal 



is then clocked into the carry-output-enable register. 
The propagation of the carry-output-enable signal to 
the next counter stage is therefore coincident with the 
generation of the output of the ultimate count from the 



u/Po- 



01 D2 D3 D4 D5 D6 D7 D8 

nnnn 



UP 01 D2 D3 04 DS D6 07 08 
> OE 

UJ PAL20X10A CEBEOP- 



QO 01 Q2 Q3 Q4 Q5 06 07 Q8 



iiiiiilii 

GO 01 02 03 04 05 06 07 08 



D9 010 D11 012 013 014 015 

nnni 



UP DO 01 02 03 04 D5 06 
> CEDEOp' 

-0 CEI PAL20X10A OE 

LD 

OO 01 Q2 03 04 05 05 Q7 03 



iiiiiiiii 

09 Q10 Oil 012 Q13 Q14 Q15 Q16 Q17 



Fig 3a— Unlike the counter illustrated in Fig 2a, this counter uses a look-ahead-carry-out technique. Provision of this signal renders 
the counter cascadable. 



Fig 3b 



/QO 


'■' 


QO*/LD 
LD 


/Ql 


•- 


/QIVLD 
/D1>LD 




;♦ = 


O0«/LD»UP 
/QO*/LD*/UP 


/Q2 


1* 


/C32VLD 
/D2«LD 

00«Q1«/LD'UP 
/QO«/Q1«/LO«/UP 


/Q3 


^ 


/Q3VI.D 

/D3«LD 

QCI*Ql*Q2«/L0'UP 

/Q0VQ1'/Q2VLDVUP 


/Q'l 


;* 


/Q4«/LD 

/D4«LD 

Q0«Ol*Q2'O3«/LD«UP 

/QO VQl VQ2*/Q3 VLD VUP 


/Q5 


^ 


/QSVLD 

/D5»LD 

Q0«Q1«Q2*Q3«Q4»/LD«UP 

/QOVQ1VQ2VQ3VQ4VLDVUP 


/Q6 


j: 


/Q6VLD 

/DS«LD 

00»Q1«Q2«Q:«Q4«Q5«/LD*UP 

/OOVQl*/Q2VQ3«/Q4«/Q5VtDVUP 


/Q7 


I* 


/Q7VLD 

/D7«L0 

Q0»Q1»Q2»Q3»Q4«05«Q6'/LD«UP 

/OO VQl */Q2 * /Q3 • /Q4 « /OS « /Q6 ' /LD VUP 


/Qa 


I* 


/QSVLD 

/Da«LD 

Q0«Ql«Q2*O3«O4«Q5«QS«O7«/LD'UP 

/Q0«/Ql*/02«/Q3VQ4VQ5VQSVQ7VLDVUP 



Fig3c 



•.=■ UP*Q0*Ql"O2'Q3<Q4'Q5'Q6'O7'QaVLD'CEI 
* /UPVQ0VQ1VQ2VQ3VQ-1VQ5V06VQ7 
*/Q8*/I.D«CEI 



/QO*/LD«/CEI 

QO«/LD»CEI 

LD«/DO 

/QIVLD 
/Dl'LD 

QO>/LD'UP>CEI 
/QO'/LD»/UP'CEI 

/Q2VLD 

/D2«LD 

Q0*01*/LD«UP'CEI 

/Q0«/Q1VLDVUP*CEI 

/Q3VLD 

/D3«LD 

QQ«01*02*/LD«UP»CEI 

/QO"/Q1*/02«/LDVUP"CEI 

/Q4VLD 

/D4«LD 

QO«Q1«Q2«03VLD«UP*CEI 

/Q0«/Q1«/Q2VQ3«/LD«/UP'CEI 

/Q5VLD 

/D5«LD 

Q0«Q1'Q2«Q3'Q4«/LD'UP*CEI 

/Q0«/QIVQ2V03VQ4VLD*/UP'CEI 

/Q6*/LD 

/D6«LD 

Q0'Q1«02«Q3*Q4«Q5*/LD«UP«CEI 

/Q0«/Ql«/Q2>/Q3«/Q-1VQ5VLDVUP'CEI 
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registers in the least significant counter. By anticipat- 
ing the ultimate count in this fashion, this scheme 
avoids the propagation delay involved in decoding the 
final count. 

The counter designs specified by the sets of logic 
equations presented in Figs 3b and 3c use this 
look-ahead-carry-out technique. The designs are cas- 
cadable, 9-bit up/down counters; Fig 3c's specification 
generates both Carry and Borrow signals on a look- 
ahead basis. You can use these specifications to create 
an 18-bit counter by routing the Carry/Borrow output 
(CEBEO) of the first up/down counter to the count- 
enable input (CEI) of the second up/down counter (Fig 
3a). These counters are capable of clocking at rates as 
high as 22.2 MHz. 

To cascade more than two counters for high-speed 
counting, your design would require a composite carry- 
enable output to the more significant counters in the 
chain. You would need to generate a look-ahead-carry 
output from only the least significant counter; higher- 
order counters in the chain would require a count- 
enable output generated from the ultimate, or final, 
count rather than the penultimate count. Fig 4a shows 
a technique for cascading more than two counters in a 
synchronous-count chain. Depending upon the availa- 



Palasm notation conventions 

For consistency with the design spercifications, 
the text of this article uses the Palasm notational 
conventions for Boolean operators instead of the 
classical symbols. Palasm is MMI's PAL assem- 
bler, which converts Boolean-logic equations into 
fiise maps for PLDs. (Ed Note: Readers familiar 
with other PLD compilers may be more used to 
the C-language conventions used in Cupl and 
Abel. One compiler, loglCfrom Kontron, allows 
you, thankfully, to choose your own favorite sym- 
bol set.) 

• / represents a logical inversion. 

• * represents a logical AND. 

• -I- represents a logical OR. 

• = represents a logical equality. 

• :+: represents an exclusive- OR (XOR. 

• := represents the state-machine operation 
"Becomes equal after the rising edge of a 
clock pulse." 



bility of input pins, you could incorporate the required 
gates into the PLD itself, as Fig 4b shows. 

XOR PLDs have many applications outside of binary 
counting. The example of a video shift register shows 
how video applications can benefit from the 
programmable-polarity feature of the XOR gate. 

Fig 5a shows a typical video system using the 
PAL20X8A. Fig 5b gives the PLD design specification. 
An XOR PLD, when functioning as a video-shift regis- 
ter, loads row data, in parallel, from a character ROM 
or RAM. A control input to the XOR PLD (reverse 
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Fig 4 — You can cascade two or more XOR-PLD counters in a synchronous-count chain with either external (a) or internal (b) logic. 
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The XOR PLD series has many applica- 
tions outside of binary counting. 



DOT 
CLOCK- 



PAL VIDEO 
CONTROL 



PAL VIDEO 
REFRESH 
COUNTER 




REV V DO 

PAL20X8A . 



VIDEO 



VIDEO RAM • 1^ 11 ' 
W A10 N |aio 



74LS245 D|R 



7\ 



MPU 
DATA D0-D7 




LINE 
SYNC 




MPU 
ADDRESS 



Fig 5a — XOR PLDs suit other applications besides counters. The 

device's programmable output polarity is a vital part of this design, 
which uses an XOR PLD as a video shift register that incorporates a 
cursor control. 





Fig 5b 




/QO 


:=■ /D[)*LD*/REV*/CUES*/RST 


;LOAD DO 




+ DO*LD«REV*/CURS*/RST 


;LOAD INVERSE OF DO 




:+: /Q1*/LD*/CURS*/RST 


J SHIFT FROM Ql 




+ RST 


.■RESET QO LOW 

.•CURSOR LOW CLEARS /QO 


/Ql 


:- /D1*LD»/REV*/CURS»/RST 


;LOAD Dl 




+ D1*LD«REV*/CJRS*/RST 


JLOAD INVERSE OF Dl 




!+! /Q2*/LD*/CURS*/RST 


;SHIFT FROM Q2 




+ RST 


.•CURSOR LOW CLEARS /Ql 
; RESET CLEARS Ql 


/C2 


!- /D2*LD«/REV«/CUKS*/RST 


.•LOAD D2 




+ D2*LD»REVVCURS*/RST 


.•LOAD INVERSE OF D2 




: + : /QJVLDVCURSVRST 


; SHIFT FROM Q3 




+ RST 


! CURSOR LOW CLEARS /Q2 
.•RESET CLEARS Q2 


/Q3 


!- /D3*LD*/REV*/CURSVRST 


.•LOAD D3 




+ D3«LD*REVVCURS«/RST 


.•LOAD INVERSE OF D3 




: + : /Q4VLDVCURSVRST 


.•SHIFT FROM Q4 




+ RST 


.•RESET CLEARS Q3 
.•CURSOR LOW CLEARS /Q3 


/Q4 


:- /D4«LD*/REV«/CURSVRST 


.•LOAD D4 




+ D4«1,D*REV*/CURSVRST 


.•LOAD INVERSE OF D4 




■■*■• /Q5VLD*/CURSVRST 


.•SHIFT FROM Q5 




+ RST 


.•CURSOR LOW CLEARS /Q4 
.•RESET CLEARS Q4 


/Q5 


:- /D5*LD»/REV*/CURS«/RST 


.•LOAD D5 




+ D5*LD«REVVCURSVRST 


;LOAD INVERSE OF D5 




! + : /Q6*/I.D*/CURSVRST 


.•SHIFT FROM QS 




+ RST 


.•CURSOR LOW CLEARS /Q5 
.•RESET CLEARS Q5 


/Q6 


:- /D6«LD«/EEV*/CURSVRST 


;LOAD 06 




+ D6*LD*REV*/CURS*/RST 


.•LOAD INVERSE OF D6 




!+! /Q7*/LD*/CURS»/RST 


! SHIFT FROM Q7 




+ RST 


.•CURSOR LOW CLEARS /Q6 
.•RESET CLEARS Q6 


/Q7 


:- /D7*LD«/REV»/CURS*/RST 


;LOAD D7 




+ D7«LD«REV*/CURS«/RST 


.•LOAD INVERSE OF D7 




:+: /LD*/CURS*/RST 


.•SHIFT HIGH INTO /Q7 




+ RST 


.•RESET CLEARS Q7 



video, or REV) determines whether or not the data 
loaded into the register is to be inverted. Therefore, 
the XOR PLD can supply normal-video or reverse-video 
information. 

This design also incorporates a cursor control, which, 
when active, sets all of the register's outputs high. The 
design also has a reset line to clear all the registers. 
When you release the load input, the clock input will 
serially clock the data out of register QO as a video-data 
stream. The clock input to the PLD determines the dot 
rate of the video information and must come from an 
external dot-rate generator. 

The Palasm logic-equation files presented in this 
article omit comments, pin definitions, and device- 
simulator specifications because of space limitations. If 
you want the entire Palasm file, you may call the author 
at (408) 970-9700. EDFJ 
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Among the greatest challenges one faces when design- 
ing with the new generation of 32-bit, high- 
performance microprocessors is maintaining both 
byte and word addressability within the microprocessor's 
32-bit-wide word format and accommodating both high- 
and moderate-speed memory components and peripherals in 
the same system. Consider this example: Motorola's 32-bit 
microprocessor, the MC68020, has a feature known as dy- 
namic bus sizing. This feature helps maintain both byte and 
word addressability within the microprocessor's 32-bit word 
format. The microprocessor has internal logic that manipu- 
lates and directs words smaller 
than 32 bits. However, the 
MC68020 requires additional ex- 
ternal logic to handle 8- and 
16-bit memory arrays and pe- 
ripherals and to maintain byte 
addressability of 32-bit memory 
arrays. Consider this as 

well: The MC68020 is available in both 12-MHz and 
16.7-MHz speed grades. The 16.7-MHz speed translates into 
60-ns machine cycles and, consequently, into 90-ns memory 
cycles. Thus, the MC68020 can keep up with many of 
today's fastest memories. Moreover, in systems that use 
both fast memories and slower, low-cost bulk ones, the 
16.7-MHz version can interface with the slower memories 
without holding back the faster ones. But to do this, the 
MC68020 must insert wait states to extend the memory cycle 
for the slower memories. 

Both design problems call for additional circuitry — that 
is, some "glue" logic. And because the MC68020 employs 
asynchronous control signals and the memories need syn- 
chronous control, the additional circuitry must be capable 



A new, registered, asynchronous 

PAL device originated with suggestions 

from the manufacturer's 

customers. 



of both asynchronous and synchronous operation. A prod- 
uct from Monolithic Memories, Inc., the PAL20RA10, pro- 
vides what we believe is an elegant, single-chip solution to 
these problems. 

The PAL20RA10 is one of the most recently developed 
members of Monolithic's programmable array logic 
product family. As with all other PAL devices, it contains 
an array of programmable links that the user can configure 
to implement a variety of logic functions. The PAL20RA10 
is unusual in that the customer not only can configure the 
internal logic transfer function but also can implement logic 
functions to control the clock, 
the output enables, and the 
set and reset operations for each 
register. In addition, the 
user can configure the outputs 
to be registered or nonregistered, 
and active high or active 
low. We will discuss 
these features, as well as a universal port interface adapter 
application, in greater detail later in this article. 



PLDs and the semicustom market 

Programmable logic devices, or PLDs, fit between stan- 
dard logic and customer-designed circuits and thus are con- 
sidered semicustom circuits, along with gate arrays and 
standard cells. PAL products are a type of PLD. Discus- 
sions of semicustom integrated circuits often center on 
technological issues: process capabilities, device speed and 
power dissipation, number of available gates, and so on. 
But the customization of a device involves more than just 
meeting the customer's functional requirements. The device 
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Table 1. 

Logic devices rated by various selection 

criteria. 



Criterion 



Standard Gate Standard Full 

logic PLDs arrays cells custom 



Sourcing and 
availability 


1 


1 


3 


4 


4 


Performance 


5 


3 


3 


1 


1 


Space 
efficiency 


5 


4 


2 


2 


1 


Ease of 

design* 


2 


1 


2 


4 


4 


Ease of design 
changes 


3 


1 


2 


4 


4 


Design 

cost** 


1 


I 


3 


4 


5 


Time to 
market 


2 


I 


3 


4 


5 


System-level 
mfg. cost 


5 


3 


3 


2 


1 



* With the exception of standard logic, this means ease of use of the design 
tools, i.e., ease of understanding the design tool concepts, ease of doing the re- 
quired input, etc. 

' * Design cost is the sum of the NRE (non-recurring engineering) costs and such 
non-recurring manufacturing costs as masking, test vector generation, etc., 
that are part of the product development cycle. 



should also meet the customer's requirements for price, 
availability, risk, ease of use, and support. Only when all of 
these requirements have been met has a device been truly 
customized. 

Within this definition, even standard 7400 series logic 
might be considered customized. It meets most customers' 
requirements for low cost, availability, low risk, and high 
ease of use, and it provides just about any desired function. 

PLDs and gate arrays— semicustom 
alternatives and trade-offs 

The PLD was first conceived as a replacement for dis- 
crete TTL logic designs. Customers wanted higher integra- 
tion than that possible with standard SSI/MSI logic, but 
they could not find the functions they wanted in standard 
LSI circuits and did not want to pay for fully customized 
ones. The PLD provided a new option — semicustom 
LSI — that had the advantages of SSI/MSI availability and 
many of the advantages of fully customized LSI without 
such drawbacks as high cost and long development time. 



Today, gate arrays are rapidly becoming competitive in 
the same applications as PLDs. However, PLDs provide 
several advantages over gate arrays that make them much 
more amenable to customer needs for many applications. 

The primary advantages of PLDs are their immediate 
availability, the lower risk they entail, and their lower over- 
all cost per application. A design engineer, upon completing 
his design, can immediately customize a PLD to his specifi- 
cations. If he discovers that he has made a mistake in his 
design, he can correct it and program another device. Since 
he customizes a part just before it is used, he undertakes no 
risk in committing to a particular design. If he needs to 
make a change, he can program a part from inventory to 
the new specification — no parts are wasted. He can program 
the new part in minutes. 

According to a survey that appeared in the January 12, 
1984, issue oi EDN, the two greatest disadvantages of gate 
arrays are their high development cost and the lack of sec- 
ond sources for them. PLDs, in contrast, have an extremely 
low development cost and offer multiple sources. The short, 
simple development cycle for PLDs translates into easier 
use, lower costs, shorter time to market, and greater flex- 
ibility. Table 1 rates standard, semicustom, and full-custom 
integrated circuits according to various criteria used in 
choosing an IC for a design. A "1" indicates the best 
choice; a "4" indicates the worst. 

Dataquest, a market research firm, lists in the December 
28, 1984, issue of its Nielsen Dataquest Research Newsletter 
("Gate Array Impact on the ASIC Marketplace," by Katy 
Guill and Andy Prophet) five needs of the gate array user, 
all of which are met efficiently by PLDs. The first is for 
quick delivery of prototype units. With PLDs, prototypes 
are available in minutes once the design is complete. The 
second need if. for easy-to-use design automation tools that 
require a minimum amount of time to learn. PLD software 
tools include assemblers and higher-level languages that en- 
able a design engineer to automatically convert whatever de- 
scription he has created into a fuse plot. The PLD can then 
be programmed as easily as a PROM. 

The third need is for prototype units that work the first 
time. PLDs, being standard parts, are fully tested during 
production and can be quickly verified after customization. 
If a design error has been made, the design can be quickly 
modified and another prototype can be programmed in 
minutes. Using personal-computer-based software and a 
standard programmer, the design engineer can slay at his 
own desk and proceed to alter his design on the PC, and 
then instantly download the new design to the programmer 
and the new device. The device can then be dropped into 
the target application and the design verified. The ease with 
which PLDs can be verified encourages experimentation 
with designs. 

A fourth need is for low development and production 
costs. PLDs have extremely low development costs — only 
the up-front costs of the software and programmer. Their 
design cost consists simply of the design engineer's time. 
The production costs for PLDs are driven down by the mul- 
tiple sourcing and high-volume production characteristic of 
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standard products. Gate arrays provide neither of these 
advantages. 

The fifth need is for convenient design centers and train- 
ing. The PLD design center is very convenient indeed — the 
engineer's personal computer. Our company, Monolithic 
Memories, offers tutorial aids in several formats. Moreover, 
the standard nature of the PLD makes it easier to under- 
stand and use than the more customized and thus less 
"friendly' gate array. 

There are many other factors that influence the choice of 
semicustom device technology, of course, such as architec- 
tural efficiency. The ease of use of the PLD unfortunately 
leaves it much less efficient architecturally than other forms 
of semicustom logic. In a gate array, the user constructs log- 
ic functions using two-input NAND gates. In a PLD, the 
user constructs logic functions using higher-level macrocells. 
Naturally, there will be more wasted silicon in the PLD, al- 
though its optimized macrocell and efficient fuse array will 
still provide an effective structure. It is the gate array's very 
architectural efficiency that is the source of its high develop- 
ment cost, since that efficiency derives from computerized 
trial-and-error routing and a custom mask, both of which 
are expensive. 

Figure 1 gives the 1985 sales volume for various types of 
ICs and estimates the 1990 volume for them. All types show 
healthy growth. Because standard cells are starting at a low 
level and because they are expected to become very popular 
as the IC market matures, they are expected to show the 
highest growth rate. Linear arrays, though relatively new 
and technologically difficult, should also show impressive 
growth. PLDs and gate arrays should also show very 
healthy growth, with the growth in PLDs (367 percent) ex- 
ceeding that in gate arrays (234 percent). Part of the greater 
growth of PLDs is attributable to their ease of use, but part 
is also attributable to the ease with which they can be repro- 
grammed — for circuit designs that are likely to change, pro- 
grammable logic is the only practical solution. 

It is very difficult to directly compare PLD cost and gate 
array cost — one is comparing apples and oranges. PLDs 
have a minimal development cost and a standard unit cost. 
Gate arrays, on the other hand, have a large development 
cost and a small unit cost. Moreover, it is very difficult to 
determine the PLD architecture that is equivalent to a par- 
ticular gate array and vice versa. Currently available PLDs 
such as Monolithic's MegaPAL devices have as many as 
5000 four-input NAND-gate equivalents. However, a 
5000-gate gate array probably cannot replace a 5000-gate- 
equivalent PLD exactly, and such a PLD cannot replace a 
5000-gate gate array exactly. Such replacement is very 
application-dependent. A MegaPAL array, for example, is 
limited to 64 inputs and 32 outputs. If a gate array has more 
inputs and outputs, the MegaPAL device cannot replace it. 
And if the gate array cannot duplicate a feature of the 
MegaPAL array — such as its large fan-in on its AND gates, 
for example — then the gate array cannot replace MegaPAL 
array. Even at lower gate densities, comparisons remain 
application-dependent . 



Linear array: 




Linear arrays 
$350 



Figure 1. Growth of the worldwide application-specific 
IC market from 1985 to 1990. Dollars are in millions. 

(Source: Integrated Circuits Engineering Corp.) 



For purposes of providing a general idea of the cost effi- 
ciencies over volume of gate arrays and PLDs, we can com- 
pare them as if they did have one-to-one equivalency, how- 
ever. The nonrecurring engineering, or NRE, charge for a 
gate array can vary from $15,000 to $75,000, depending on 
complexity and design methodology. If a gate array has a 
$35,000 NRE charge and a $1.00 unit cost, its cost per unit 
drops almost inversely with volume, as shown in Figure 2. 
Now let us examine a specific type of PLD, the PAL device. 
If a PAL device costs $5.00 for a single unit and drops in 
price with volume according to standard pricing curves, it 
shows a more steady decline in price with volume (the PAL 
plot in Figure 2). The crossover point in this comparison is 
almost at 100,000 units. The graph shows quantatively the 
reasons why gate arrays become cost-effective over larger 
volumes, but it leaves out several other factors, including 
gate equivalencies, risk, and time to production. 

Let us compare the design and production costs for PLDs 
and gate arrays. For a PLD, design cost means the design 
engineer's time, as we mentioned above. This is already a 
committed cost. However, for a gate array, design cost can 
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Figure 2. PLD (PAL and HAL device) cost per unit 
vs. gate array cost per unit. The break-even point 
for PAL devices and gate arrays occurs at about 
100,000 units. 



be a sum of several factors. The time a designer spends on a 
computer to create a design can be very expensive, and the 
software he needs to create and test the design can also be 
expensive. Production costs for both types of device include 
not just the cost of the silicon but also the cost of masks, 
fabrication, and labor. And if a device does not work as ex- 
pected, the entire process must be repeated. 

With a PLD, the only purchase cost is that for the silicon 
itself. The PLD is mass-produced, so all the supporting 
design efforts— fabrication, test generation, and reliability 
enhancements — are spread over time and many devices. The 
result is much lower cost to the user. 

For high-volume, fixed applications, a PAL device can be 
replaced with a mask-programmed hardwired array logic, or 
HAL, device. The HAL device replaces the fuses of the 
PAL circuit with a final mask step that permanently 
metallizes the logic connections. This alternative is an ex- 
cellent match for the volume applications that are attractive 
to gate arrays. The HAL circuit combines the lower unit 
cost of the mask-programmed logic device with the pro- 
totyping capabilities of the PAL circuit. It offers advantages 
unmatched by any other form of semicustom logic. Assum- 



Figure 3. The basic PAL consists of two 

arrays— a fuse-programmable AND array 

and a fixed OR array. This arrangement 

follows the sum-of-products format 

familiar to the design engineer who uses 

Boolean logic. 
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Figure 4. Two-input AND array with four AND 
gates. An input (in or /in) is disconnected from an 
AND gate when the fuse associated with that input 
is blown. 



y u 



AND array 



^H^ 



^h: — ^HF 



^h; — ^Hc 



^E^f — ^h; — T)- 



{> 



o- 



£> 
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Figure 5. An "x" indicates an intact fuse. The two- 
input AND array with four AND gates (product 
lines) from Figure 4 is shown along with a two- 
output OR array. Note that the AND array is user- 
programmable, whereas the OR array is hardwired. 
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ing an NRE charge for a HAL device of $5000 and a unit 
cost of $2.00, we can plot the relationship of the HAL 
device to the PAL and gate array (see Figure 2 again). 

PAL devices— historical background 

PAL devices were introduced by Monolithic Memories in 
1978. They consist of two arrays of logic gates. The first 
array consists of AND gates that form Boolean products of 
the inputs, and the second array consists of OR gates that 
sum these products (see Figure 3). Hypothelically, there are 
n inputs to the AND array and m outputs from the OR 
array. The AND array consists of p AND gates (and p prod- 
uct terms), and all n device inputs, both inverted and non- 
inverted, are potential inputs to any of the p AND gates. In 
an unprogrammed PAL device, every device input is actu- 
ally connected to every AND gate. A device input ceases to 
be an input to a particular AND gate if the fuse that con- 
nects that input to the gate is blown. Fuses are blown ac- 
cording to a fuse map that is generated by the user via soft- 
ware. Figure 4 shows a simple two-input array with four 
AND gates. Figure 5 shows a simplified notation for this 
circuit — an " x " is used to indicate that a fuse is intact, and 
a single line is used to represent all lines of connection with 
an AND gate. (Note that the use of the single line does not 
imply the connection of those lines to each other.) Figure 5 



also shows a two-output OR array that uses the same con- 
ventions for intact fuses and lines of connection. Most PAL 
devices have a considerably greater number of inputs, prod- 
uct terms, and outputs than shown here, but this example 
illustrates the general PAL structure. 

The arrangement of the AND and OR gates follows the 
sum-of-products format familiar to the design engineer who 
uses Boolean logic to express his logic requirements. As 
implied above, the engineer can program the AND array in 
a PAL device by using links similar to those used in a 
PROM circuit. He opens the particular links needed to con- 
figure the logic to the user's specification. 

Before discussing the PAL20RA10 in particular, we 
should note the differences between PAL devices and other 
PLDs. Most PLDs consist of an AND array followed by an 
OR array. This is also the architecture of memory devices, 
including bipolar PROMs. In a PROM, the AND array is 
fixed and completely decodes the inputs. The OR array is 
programmable and holds the information at each decoded 
location, or the word at every address (Figure 6a). 

The first PLD was developed by Signetics and is called a 
field-programmable logic array, or FPLA. The FPLA is an 
implementation of the general PLD logic combination — an 
AND array followed by an OR array— in which fuses are 
used for interconnections in each array. The FPLA allows 
both arrays to be programmed (Figure 6b) and thus pro- 
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(a) 



(b) 



(c) 



Figure 6. PROM AND-OR array (a) — the AND array is fixed to decode the address, whereas the OR array 
is programmable for the stored data; field-programmable logic array, or FPLA (b)— both the AND array and 
the OR array are programmable; PAL device (c)— the AND array is programmable whereas the OR array is 
fixed. The PAL device offers much of the flexibility of the FPLA along with the low cost and easy program- 
mability of the PROM. 
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Figure 9. PAL16R4 logic diagram. This device has four registered outputs with a single (synchronous) 
clock. 
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Figure 10. History of the speed im- 
provements and power consumption 
reductions for devices in the Medium 
PAL family. 
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New product development 

New product ideas in semiconductor companies come 
from a variety of sources— from customers and those in 
direct contact with them to design engineers who have 
ingenious ideas about how to add features without 
adding cost, power, or timing delays. Since customers 
are usually willing to describe the product that would be 
ideal for them, they are often a good source of ideas. 
Product ideas usually pass through the following steps. 

First, the idea is generated. Typically this is in response 
to a limitation in a current product and is an evolutionary 
step. Revolutionary ideas such as the original PAL 
concept are rare and usually come from approaching 
old problems in a new way. If the idea is a customer's 
suggestion, it is usually reported to the factory in a 
trip report. 

The idea, whether generated internally or by a cus- 
tomer, is next submitted to a group typically consisting of 
representatives from product planning (including some- 
one from the product area into which the idea falls, 
whose job is to examine such ideas), marketing, design, 
and product engineering. The group discusses the new 
idea and responds to it; it either drops the idea, giving 
reasons for doing so, approves it for further considera- 
tion, or asks for more details. 

If a product idea is approved for further consideration, 
a member of the product planning group carries out 
market surveys and analyses. He contacts customers. 



often in person, and locates third-party market studies 
and research. He analyzes internal capabilities and seeks 
out information about products that could compete with 
the proposed product. At most companies, the result of 
his efforts is a new product approval document consisting 
of a description of the product and its applications, alter- 
natives to the product, a discussion of the market for the 
product, and a marketing strategy for it. 

If the product is approved by all the representatives in 
the group, it is assigned to design engineering. The 
design engineer is free to use whatever tools he chooses 
within the definition set out for him. This definition in- 
cludes such items as performance requirements but rarely 
specifies the process technology. It is often modified as 
new requirements arise. 

Design engineering is finished with the product only 
after it has been successfully produced on the production 
line. It is then released to product engineering, which 
takes on full responsibility for the product for its lifetime. 
Product engineering is typically involved before this point, 
however, making sure the product is progressing well and 
problems are being eliminated quickly. 

The final step is release to the marketplace. For most 
companies, release to market means sending out data 
sheets and samples to the field once production quan- 
tities have become available. For programmable products 
such as PAL devices, software and programmer support 
must also be in place. 
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Figure 7. Some of the 

first PAL devices 

from Monolithic 

Memories. 



Figure 8. The first 

registered PAL 

devices. 





vides a great deal of flexibility. However, its programming 
circuitry and fuses lead to a large, slow, power-consuming 
device and make programming expensive and difficult. 

In 1977, John Birkner of Monolithic Memories got the 
idea of taking the FPLA architecture and hardwiring the 
OR array, leaving only the AND array programmable (Fig- 
ure 6c). The result was called a PAL circuit, PAL being the 
acronym for programmable array logic. (Monolithic Memo- 
ries has since registered PAL as a trademark describing its 
products.) A PAL device offers higher speed and consumes 
less power than an FPLA. The first PAL devices were de- 
signed so that engineers could program them just like 512 x 
4 PROMs, using phantom fuses to fill in beyond the 512 
fuses in the actual devices. Thus, engineers could program 
them on widely available, inexpensive programmers. 

The first family of PAL devices included both combina- 
torial and registered devices. Combinatorial devices consist 
of just a programmable AND array followed by an OR 
array, with no registers or other macrocells. They are fairly 
simple and provide little more than a varying number of 
inputs and outputs (see Figure 7). Registered devices, on the 
other hand, provide clocked registers for the outputs and 
allow the previous output state to be stored and used for 
determining the next state. These types of devices are much 
more challenging. A register allows the device to be used as 
a state machine, one in which the outputs depend not only 
on the present inputs but also on the previous state. 



Since a state machine can be implemented in a variety of 
forms, the registered PAL architecture is difficult to define. 
To do so, one must specify the total number of registers, 
whether a register can feed its output back into the logic 
array, the nature of the control of the output enable follow- 
ing the register, and common register features such as preset 
and clear functions. These specifications have a major influ- 
ence on the potential applications of the device. 

Monolithic 's original registered PAL product family, 
whose members were designed to be used as state machines, 
is the Medium 20 family, where Medium describes the logic 
density and 20 is the number of pins. The family consists of 
four devices, all with 16 complemented inputs to the AND 
array and eight outputs from the OR array. They have vary- 
ing numbers of registers — zero, four, six, or eight — which is 
reflected in the part numbers, PAL16L8, PAL16R4, 
PAL16R6, and PAL16R8 (see Figure 8). The 16R4 directly 
implements Mealy state machines, whereas the 16R8 directly 
implements Moore state machines. In the I6L8, 16R4, and 
16R6, combinatorial outputs are enabled by individual 
product terms from the AND array. The product term 
enables allow the combinatorial outputs to function as in- 
puts. These inputs, along with feedback from the registers 
and eight dedicated inputs, provide the 16 inputs to the 
AND array. The logic diagram of the PAL16R4 is shown in 
Figure 9. 

The architecture of this family has proven capable of 
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Figure 11. 7474 dual D-type flip-flop. Note the two 
separate clocks that allow asynchronous operation of 
the registers. 



handling a remarkable percentage of what customers re- 
quire in programmable logic. The improvements customers 
typically request are higher speed and lower power con- 
sumption. We have made progressive improvements in 
speed, to a 10-ns propagation delay in the Medium 20D 
Series and a 6-ns one in the ECL PAL IC. We have also re- 
duced power consumption without sacrificing speed. The 
20B-4 Series requires one fourth the power of the original 20 
Series yet provides the same speed. With a propagation de- 
lay of 35 ns, the 20B-4 devices have a maximum supply cur- 
rent of 55 mA, compared to 180 mA for the original parts 
at the same speed. CMOS mask-programmed versions re- 
quiring zero standby power are also available. Figure 10 
shows the history of speed and power consumption im- 
provements in the Medium 20 family. 

While we have been making these improvements, we have 
also introduced new families with higher logic densities and 
the 40- and 84-pin MegaPAL families. We have added new 
features to these families, including additional logic capabil- 
ities such as exclusive-OR gates, arithmetic feedback, pro- 
grammable output polarity, product term sharing, register 
bypassing, and register preloading. We made these 
enhancements in response to specific customer needs, and 
they have opened new applications for programmable logic. 

Most of these new features were simply extensions of the 
original PAL concept. The original PAL family was a 
breakthrough, a new alternative offering completely new 
advantages in semicustom logic. In building on this family, 
we concentrated on speed and power improvements and not 
on finding revolutionary new architectures. However, the 
PAL20RA10, which we will discuss shortly, is another 
breakthrough concept. It did not derive from extensions to 



the original idea but arose as a new response to the needs of 
customers using PAL devices. 



The PAL20RA10 

The PAL20RA10 was first proposed at an internal 
Monolithic Memories conference in 1981. The original con- 
cept was described as an "unstructured PAL" having inde- 
pendent programmable clock, reset, and set controls on 
each flip-flop. Until then, all PAL devices had been struc- 
tured — or synchronous — in architecture; all transitions on 
register outputs occurred simultaneously in response to the 
rising edge of the clock signal. The proposed ability to con- 
trol the clock independently for each flip-flop would enable 
the engineer to create asynchronous logic within a PAL 
chip. 

The idea was proposed by Harry Hughes, who was then a 
field applications engineer, or FAE, in Great Britain. FAEs 
are the company representatives who have primary contact 
with customers; they have responsibilities ranging from 
helping existing customers with complex design problems to 
introducing Monolithic Memories to potential customers. 
As the primary customer contact, they are quite familiar 
with real user needs. 

The concept of the unstructured PAL circuit had origi- 
nated with several European customers. They had been 
using the Medium 20 PAL Series in large quantities. 
However, the application logic they had been integrating 
had not always fitted well in PAL devices. They had often 
had sections of individual 7474 flip-flops mixed among ran- 
dom logic. A 7474 device (see Figure 1 1) consists of two in- 
dependent D-type flip-flops, each with its own data input 
and complementary outputs, and clock, preset, and clear. A 
standard PALI6R8 has eight of these D-type flip-flops but 
only one clock and no preset and clear functions. Thus, in- 
tegrating the collection of 7474s and the glue logic had re- 
quired the customer to convert a multiclock design into a 
single-clock one. This had been difficult or impossible to do 
with standard products. 

The customer requested an independent clock, set 
(preset), and reset (clear) for each flip-flop. Since the FAE 
had found other customers requesting similar architectures, 
he proposed the device. He took the 16R8 logic diagram 
and added the four control terms, as shown in Figure 12. 
He described the concept at Monolithic's FAE conference in 
July of 1981. 

FAE conferences are held to foster technical exchange be- 
tween the factory and the field. Problems are identified and 
ideas are discussed. Now held every six months, these con- 
ferences are critical for planning future product directions. 

After it had been presented at the FAE conference, the 
unstructured PAL circuit was discussed within the factory. 
For an idea to go somewhere, factory discussion must pro- 
duce a business plan for the product that describes it com>i5 
pletely and estimates the profits that can be expected from 
it. New product ideas must pass several stages of analysis 
and approval. (See "New product development," at left.) 
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Figure 12. Harry Hughes' modification of the PAL16R8 logic diagram. To illustrate his proposal for a new device, 
he added four product terms (heavy lines) to control the enable, set, reset, and clock. 



The major problem with the unstructured PAL circuit 
was its inherent lack of testability. The power its architec- 
ture gives the customer also demands that the customer use 
it carefully. For example, the minimum setup time required 
before the register can be clocked must be carefully con- 
sidered when the clock signal is a complex product term. 
Data must be made available to a flip-flop early enough so 
that even the minimum time to activation of the clock satis- 
fies the setup time required. This required delay can be very 
difficult to calculate, especially if the clock term is under the 
control of another asynchronous register. All of the internal 
asynchronous signals must be handled with care to prevent 
invalid combinations. 

These and other testability issues caused great concern 
and led to several unique solutions. Internal test features 
were added, including the ability to test both DC and AC 
characteristics on unprogrammed devices and even the abili- 
ty to test programming characteristics without affecting the 
fuses in the actual array. Two external pins were given dedi- 
cated functions: one is an output enable that allows the 
device to be disabled during board-level testing; the other is 
for register preloading, which allows the user to gain access 
to the nip-flops without going through the fuse array. 

The final, approved product was called the 20FF10 at 
first. It had 20 array inputs and 10 outputs, and enough 
new features to earn it the name "Feature Freak"— the 
source of the "FF" in the designation. However^ a name 
must uniquely identify a product and must do so in terms 
the customer can recognize. The "FF" for "feature freak" 
was unique but not very meaningful to the customer, so this 
part of the name was changed to RA, for "registered asyn- 
"chronous," and the new device became the 20RA10. 

The approval document for the device described its archi- 
tecture, which is shown in Figure 13. The clock, set, and 
reset are controlled by product terms, as shown in Hughes' 



original sketch of the unstructured PAL circuit (see Figure 
12 again). 

A compromise was made for the output enable. An exter- 
nal output enable is required for most applications, especial- 
ly for disabling the device during board-level testing. But an 
internal (asynchronous) output enable for independent 
enabling of the flip-fiops is needed, too. One solution 
would have been an external enable pin for each output, but 
that would have been a highly inefficient use of external 
pins. The solution that was used is an enable signal that is 
the product of the external common enable pin and an in- 
ternal independent product term. Thus, the enable can be 
activated either externally (in common) or internally (asyn- 




Figure 13. The PAL20RA10 output cell. Note the 
separate product terms that control the set, reset, 
enable, and clock. Note also the output enable that 
can be controlled either externally (synchronously) 
or internally (asynchronously). 
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Figure 14. Logic diagram 
for the complete 
PAL20RA10 cir- 
cuit— AND array, OR ar- 
ray, and asynchronous 
registered outputs. 



chronously), satisfying both the design engineer, who wants 
the internal enable, and the test engineer, who wants the ex- 
ternal control. 

Another relatively new feature was implemented in this 
circuit— programmable polarity. This allows the output 
polarity to be either active low or active high. The design 
engineer can then write logic equations either way, allowing 
his software to match the sense of an equation to the actual 
device. This enables the designer to fit more logic into a 
single PAL circuit. 

The problem of how many registered and combinatorial 
outputs to provide had been solved in previous devices by 
offering a family of combinations. The solution to this 



problem for the PAL20RA10 came somewhat by accident. 
The situation of set and reset going active simultaneously 
had to be dealt with. We decided that allowing this situation 
to result in bypassing of the register provided a beneficial 
new feature. If both the set and reset product terms are 
activated, the register becomes transparent and the output 
becomes combinatorial. Because of this feature, the logic 
designer can decide at programming time how many 
registers he wants (from zero to ten) and on which outputs 
he would like them. This architectural flexibility is unique 
within programmable logic. 

These features are shown in the logic diagram in Figure 
14. However, though they describe exactly what the user en- 
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counters, they do not actually exist within the device itself. 
The logic diagram is only a logical equivalent to the actual 
layout of the circuit. 

The first thing to note about the actual design is that the 
programmable AND array and following OR array com- 
mon to all PAL circuits are actually implemented as an OR 
array and a following NAND array! This type of logic is 
■ required because of the architecture of the circuitry that 
senses the ones and zeroes in the programmable array. Opti- 
mized for bipolar PROMs, this circuitry senses current 
flowing from one or more of the transistors in a column of 
the array. Since it senses current if at least one of the tran- 
sistors is on, the logic is equivalent to an OR gate. Thus, the 
programmable array must be an OR array, as it is in a 
PROM. To create the logic of a PAL device, we placed a 
logical NAND array after the programmable OR array, 
creating an OR-NAND structure, that is, an inverted prod- 
uct of sums, which by Boolean arithmetic is equivalent to a 
sum of products. 

To maintain the flexibility of the 20RA10, we had to 
make various trade-offs. One example is the output drive. 
The original proposal specified an output drive of 32 mA. 
We lowered this to 8 mA to reduce the power requirements. 

Although standard 24-pin products were running as fast 
as 25 ns, the highest speed we could guarantee on the 
PAL20RA10 was 30 ns. This slight decrease in speed was 
due to the additional features of the device— a combina- 



torial output signal travels through the polarity control, 
register bypass multiplexer, and output buffer before it 
reaches an output pin. Monolithic Memories has been pro- 
gressively improving its process capabilities, and the part is 
being transferred to a faster process to increase its speed. 

Beyond speed, the fabrication process used for the 
PAL20RA10 created no specific limitations. The process 
was the workhorse of the bipolar efforts at Monolithic 
Memories at the time. It is a standard junction-isolated 
bipolar process, optimized for high speed (see "Process 
technology," at right). Extensive experience with the process 
results in high manufacturing yields and high reliability and, 
as a result of these, lower costs. In fact, the greatest effect 
of the process on the product is probably in the cost. The 
PAL20RA10 process is well understood and provides excel- 
lent results for what it costs. 

A die photograph of the 20RA10 is shown in Figure 15. 
The various blocks are outlined. The die had to meet a cus- 
tomer requirement — that it fit into a 24-pin, 300-mil-wide 
SKINNYDIP package. The additional features had in- 
creased the die's width, yet to fit into this package it had to 
be less than 140 mils wide. Some squeezing of the layout 
during the final stages of design allowed this requirement to 
be met. 

The PAL20RA10 is much more than a die, however. It 
also requires programming and software support. Program- 
ming support is the one critical service Monolithic Memories 



Figure 15. PAL20RA10 
die photo. 
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does not provide, and thus it depends on the major PLD 
programer manufacturers for it. But to be able to support a 
product at market release, the programmer manufacturers 
must be provided with the programming specification for 
the product months before it is released. Monolithic there- 
fore keeps in close contact with these manufacturers. 

Software support is provided by Monolithic Memories 
through its PAL assembler software package, PAL ASM 2. 
PALASM 2 takes Boolean logic equations, tests them 
against expected outputs, and converts them to a fuse map 
for a PAL device. A description of this fuse map can then 



be downloaded to a programmer in a standard JEDEC 
format for customization of the chip. 

Software support is a critical part of the product because 
it is through software that the customer implements his de- 
sign. A designer wants the conversion from what he knows 
to be the required logic to what a PAL programmer can 
understand to be as simple and easy as possible. PALASM 2 
allows the designer to express his function with Boolean 
equations. It then uses that input to configure output 
polarity, register bypassing, and all the other configuration 
features. 



Process technology 

The process used for the PAL20RA10 is a junction- 
isolated one. A CTOss section of the 20RA10 is shown in 
Figure A. NPN transistors are separated from one an- 
other by junctions. 

The process is a washed-emitter one. In this process, a 
layer of nitride (SijN,) is used as a mask. This mask layer 
defines the size of the emitter and the emitter contact and 
allows the emitter and the emitter contact to be the same 
size. In other processes, the emitter is usually made larger 
than the emitter contact to guarantee good contact. 
When formed at the same time, however, they can be 
the same size and still maintain proper contact. This 



allows the emitter to become the minimum feature size of 
the design, providing a smaller overall die size than the 
die size that could be provided by a standard process. 

A programming fuse is located between an emitter and 
a product line that in turn leads to a collector (Figure B) ; 
driving a large current through this fuse opens its narrow 
neck and causes it to change from a very low resistance 
to a very high one. The process does not cause the fuse 
to explode, as some claim; it merely severs the greater 
part of the connection. This change opens the link be- 
tween the emitter and the product line. 



Base metallization 




Emitter metallization 

Collector metallization ,ai 
^^>TiW 
■SiO, 



Figure A. Cross section of the 
PAL20RA10— NPN transistors 
are separated from one another 
by junctions. 




Fuse (cross section) 



Fuse' 
(top view) 



Figure B. Location of the pro* 
gramming fuse in the 
PAL20RA10. 
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The new features of the PAL20RA10 required that some 
new syntactical conventions be added to the PALASM 2 
software package. In addition to the standard logic transfer 
functions, the designer has to be able to specify several nov- 
el architectural situations. For example, he has to be able to 
specify register bypassing in either of two ways: implicitly or 
explicitly. He specifies bypassing implicitly by writing a 
combinatorial logic equation using an " = " sign instead of 
the ": = " sign, which is reserved for registered outputs. He 
specifies bypassing explicitly by activating the set and the 
reset on the flip-flop simultaneously, causing the register to 
be bypassed and the output to be combinatorial. 

The control terms are described by dot operators such as 
"term.set." We selected dot operators for the same reason 
they are used in popular word processing programs— they 



rarely occur in natural language and can therefore be given 
special significance. 

Third-party software is available that allows other forms 
of input such as schematic entry. ABEL from Data I/O 
Corporation and CUPL from Assisted Technology both 
provide high-level support for PLDs. 

As we asserted earlier, true customization of the 
PAL20RA10 involves much more than matching the func- 
tion to the customer's requirements. Speed and power con- 
sumption must be acceptable, design support for the device 
must be available, pricing must be reasonable, product 
availability must be high, and quality must be high. The 
PAL20RA10 has been in full production for well over a 
year, and to the good fortune of Monolithic Memories all of 
these requirements have been met. Moreover, FAEs are ex- 
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cited about teaching customers about a product that 
originated with one of their own members. In fact, the 
20RA10 was the first product proposed by an FAE to go 
into production at Monolithic Memories. Market accep- 
tance has been beyond expectations, and as a result FAEs are 
being brought even further into the product planning process. 

The PAL20RA10 is the first member of a new family of 
devices; new products are in various stages of proposal, de- 
sign, and production. A 20-pin spin-off, the PAL16RA8, is 
now in full production. A higher-speed version is in design. 
A version in a 40-pin DIP was considered, but since 
20RA10 outputs operate independently, the higher density 
and resulting larger package were difficult to justify. Two 
24-pin, 300-mil-wide parts can do more in less board space 
than one 40-pin, 600-mil-wide device. 
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Figure 16. A 
PAL20RA10 used 
as a universal port 
interface adapter 
for a Motorola 
MC68020 32-bit 
microprocessor. It 
generates logic that 
maintains both 
byte and word ad- 
dressability within 
the 68020's 32-bit- 
wide word format 
and that makes 
possible the use of 
both high- and 
moderate-speed 
components and 
peripherals in a 
single system. 



The PAL20RA10— an 
application example 

To demonstrate the capabilities of the PAL20RA10 and 
the PALASM-based design process, we present an applica- 
tion that uses the PAL20RA10 as a universal port interface 
adapter for a Motorola MC68020 microprocessor (Figure 
16). This example should help demonstrate the features we 
described above and point out how easily the 20RA10 can 
be customized to a particular user's application. 

The design engineer began this design by considering the 
large blocks: the microprocessor, the memory, and the I/O. 
Now, near the end of the design, he needs to place some 
glue logic between the microprocessor and the I/O ports. 
He is open to anything from standard to full-custom logic, 
and he considers each alternative. 

Because he is very unsure of the finality of the design, he 
does not want to commit himself to a major development 
effort. Therefore, he rules out full-custom logic and stan- 
dard cells. He may be able to make a gate array work, but 
he does not have time to work with a foundry in specifying 
the design nor time to wait for the foundry to deliver the 
part. In fact, he would like to have the part immediately. 
Moreover, since his design is not yet proven, he cannot jus- 
tify the high cost of a gate array development effort. 

He could use off-the-shelf ICs easily enough, but he does 
not want to take up that much space on his board. He has a 
limited amount of room with which to work. He could use 
standard LSI devices, but none provide the functions he re- 
quires. Thus, he arrives at the programmable logic device as 
the logical alternative. 

As we said above, in this design the PAL20RA10 func- 
tions as an asynchronous universal port interface adapter 
for a 32-bit microprocessor, the MC68020. Among the 
greatest challenges the engineer faces when designing with 
the new generation of 32-bit, high-performance micropro- 
cessors is maintaining both byte and word addressability 
within the microprocessor's 32-bit-wide word format and 
accommodating both high-speed and moderate-speed mem- 
ory components and peripherals in a single system. 

The MC68020 has a dynamic bus sizing feature that helps 
solve the first of these two problems— it has internal logic 
that manipulates and directs words smaller than 32 bits. 
However, it needs external logic to handle 8- and 16-bit 
memory arrays and peripherals. It even needs such logic to 
maintain byte addressability of 32-bit memory arrays. This 
external logic must be able to decode the transfer size con- 
trol signals from the MC68020 so it can generate memory 
chip selects and handshake back to the microprocessor with 
the port size acknowledge signal. The PAL20RA10 is well 
suited to this task. 

External logic also helps the MC68020 solve the second of 
the two problems. The 16.7-MHz speed of the MC68020 
version used here translates into 60-ns machine cycles and 
90-ns memory cycles, and therefore the MC68020 can keep 
up with many of today's fastest memories. But it may have 
to work with slower bulk memories as well. To do this with- 
out slowing down the high-performance memories, the 
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Table 2. 
DIP switch settings used in the universal port 
interface adapter to choose the data bus port 



Table 3. 

DIP switch settings used in the universal port 

interface adapter to choose the number of wait 

states. 



Position 


Data bus 


PSl PSO 


port size 


L L 


No port present 


L H 


8-bit 


H L 


16-bit 


H H 


32-bit 



Position 
WSl WSO 

L L 

L H 

H L 

H H 



Number of 
wait states 



1 
2 
3 



68020 must be able to insert wait states into the memory cy- 
cles of the slower memories. The 20RA10 can help the 
68020 do this by intercepting microprocessor memory access 
requests and, when needed, generating control signals to the 
microprocessor requesting wait states. 

In this design the PAL20RA10 functions as a universal 
port interface adapter. Both blocks of external logic we al- 
luded to in the previous two paragraphs are implemented in 
a single PAL20RA10. The asynchronous capabilities of this 
PAL circuit are required by this design because the 
MC68020 provides asynchronous control signals that must 
be processed by the PAL device between clock signals. On 
the other hand, the synchronous capabilities of the 20RA10 
are needed for generating memory control signals. An inter- 
nal PAL state machine must be synchronous with the sys- 
tem clock. Therefore, neither a combinatorial (i.e., asyn- 
chronous only) nor registered (i.e., synchronous only) PAL 
device will fill the bill. The PAL20RA10's unique combina- 
tion of features allows it to provide a single-chip solution. 

The PAL20RA10 used here generates the byte data select 
signals (UUD, UMD, LMD, and LLD) and the current port 
size select signals (DSACKO and DSACKl) and performs 
wait state generation. The designer selects the data bus port 
size and the number of wait states by setting DIP switches 
(Tables 2 and 3). 

The PAL20RA10 is a breakthrough product. It does 
not merely extend existing capabilities; it adds novel 
and highly useful features. It is also the first product 
produced by our company that originated with a customer. 
Its technical and market performance has exceeded our ex- 
pectations and has proven to us the wisdom of listening to 
customers. As a resiilt, we have devised ways of listening to 
them even better, sisi 

Questions about this article can be directed to Marc A. 
Baker, Strategic Marketing, Programmable Products Divi- 
sion, Monolithic Memories, Inc., 2175 Mission College 
Blvd., Santa Clara, CA 95054. 



Marc Baker is a strategic marketing engineer at Monolithic 
Memories, Inc., where he is responsible for the merchandis- 
ing and document literature for programmable products. 
He holds a BSEE from Stanford. 

Vince Coli is Monolithic Memories' strategic marketing 
manager in the Programmable Products Division. He was 
one of the original authors of MMI's handbook on pro- 
grammable devices. He holds a MSEE from Santa Clara 
University. 
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CMOS Logic Cell Arrays, only recently available from more than one source, are almost 
as useful as small gate arrays, hut don % carry the risk ofconuniting a design to silicon. 



By Marc A. Baker 

and Chris Jay 

Product Applications 

Monolithic Memories Inc. 

Santa Clara, Calif. 

The rise of advanced CMOS tech- 
nology has dramatically affected 
the PLD market. Although not 
as fast as bipolar devices, CMOS prod- 



the circuit by deselecting its chip se- 
lect input. Large EPROM devices 
may store three or four configura- 
tions, and tying the higher-order ad- 
dresses to DIP switches allows the 
user to select one of a given number 
of logic functions. Thus one card may 
carry a large EPROM and an LCA, 
allowing for some degree of de- 
ferred design. 



The LCA is actually a PLD, but its dense CMOS RAM 

enables designers to conGgure logic cells within 

the device and to specify I/O blocks. 



ucts are gradually approaching the per- 
formance that was expected of bipolar 
about two years ago. 

The logic designer gains two signifi- 
cant benefits from the latest in 
CMOS: greater densities for more 
complex logic, and reduced power con- 
sumption at low clocking rates. Since 
cutting power consumption means 
smaller power supplies and heat 
sinks, both factors reduce the size of 
electronic equipment. 

Most CMOS parts can be backed up 
by batteries, making them useful for 
RAM-based devices such as the Logic 
Cell Array (LCA). 

Although the LCA is based on a dense 
RAM architecture, the array itself is not a 
conventional RAM-based device. Actual- 
ly, it's a PLD, but the dense CMOS RAM 
enables the designer to configure logic 
cells within the device and to specify ^O 
blocks. The interconnected logic cells, 
called CLBs (configurable logic blocks) 
and lOBs (input/output blocks) allow com- 
plex circuits to be constructed on-chip. 
And the LCA's battery backup powers 
down to save data. 

Its dense architecture enables 
complex VLSI functions to be pro- 
grammed in the cell array. Also, the 
random glue logic that would other- 
wise ride on system cards is incorpo- 
rated into the array. Because the 
LCA is RAM based, it must be config- 
ured on power-up from a non-volatile 
source. Configuration data can .be 
read from a CMOS EPROM on a 
byte-wide basis. When the reading is 
done, the EPROM is switched out of 



Dual-layer metal, advanced twin- 
well CMOS has produced a program- 
mable logic device that can extend the 
PLD into the area of low-end gate 
arrays. 

Any custom or semicustom device 
must be tested, and PLDs are no ex- 
ception. But since they are only func- 
tionally configured, functional testing 
offers a high degree of reliability. 
Such testing can be done with the pro- 
grammer, immediately after pro- 
gramming. Production volumes can 
be taken care of simply by using an 
automatic handler. 

An alternative to user program- 
ming is factory programming: Manu- 
facturers can furnish factory pro- 



gramming or can create custom masks 
for hard-wired versions of the pro- 
grammed pattern. 

PLDs also are less expensive than 
other alternatives. Examining chip 
price alone, both mass-produced SSI 
devices and custom parts look very 
good. But other costs, such as inven- 
tory handling and non-recurring engi- 
neering (NRE) charges must be taken 
into account. 

One way to estimate the cost of 
such items as inventory, testing and 
board space is to add about $1 per 
chip. Thus, if five SSI chips cost only 
25 cents each, they actually cost the 
user $1.25 each, or $6.25. Those five 
devices can therefore be economical- 
ly replaced by a $5 PLD. Today, 
many PLDs cost less than the devices 
they'll replace, and prices will con- 
tinue to drop. 

A gate array can replace multiple 
PLDs, but creating a gate array de- 
mands a large NRE charge for, say, 
producing masks and generating test 
programs. The NRE charges for a 
large gate array, including design 
costs, masks, a possible design fix 
and test programs, can be tens of 
thousands of dollars. A PLD has no 
NRE charges. 

With the LCA, testability becomes 
an interesting issue — since the de- 
vice is RAM based, test programs can 
be loaded into it. Once configured for 
testing, test vectors may be applied 



IMPLEMENTATION COST COMPARISON 


Cost 


PLD 


SSI 


Gate An'ay 


Price 


Medium 


Very low 


Low 


Inventory & 
Handling 


Medium 


Medium 


Medium 


NRE 


None 


None 


Very High 


Design Time/ 
Equipment 


Low 


Low 


Medium-High 


Design 
Changes 


Medium 


Medium 


High 


Test 


Very Low 


Very Low 


High 



Quantity Required Low-Medium 
Per Application 



High 



Low 



«i!'{^wm^.^;vaiHi'u.'Wjaw»M«m" 



SSI chips still cost the least per unit. But since more are needed, circuit size can Iw limiting. 
Gate arrays cost less than PLDs, but their high NRE can make the difference in the long run. 



Copyright* 1987 by CMP Publications, Inc., 600 Community Drive, Manhasset, NY 11030. 
Reprinted with permission from Electronic Engineering Times. 
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to determine if the logic cells and I/O 
blocks are functioning properly. For 
example, when configuring an LCA 
with buried diagnostic shift regis- 
ters, serial diagnostic data may be 
shifted through the system to a seri- 
al output port, where the data would 
be available for analysis. When a 
complete system is configured for di- 
agnostic testing, internal nodes that 
were previously inaccessible become 
available via a serial diagnostic 
scan loop. 

When the LCA serves as part of a 
larger system, it can be initialized 
with buried shift registers perform- 
ing functional tests on the system 
itself. Once testing is complete, the 
LCA can be set up for the required 
function. 

Suppliers 

In 1978, only two suppliers were 
producing PLDs. Now there are more 
than a dozen, with several sources 
for the most popular parts. Higher- 
volume production and second sourc- 
ing alleviate concerns with factory- 
programmed devices, like gate 
arrays. Control of the design is re- 



er PALs are in the works for line 
encoding/decoding, terminal adapter 
interfaces, CRC error correction, 
frame synchronization and other ap- 
plications. 

Since the LCA is reconfigurable, it's 
possible to modify the logic after de- 
sign. In telecommunications, that 
means that functions can be changed to 
accommodate different communica- 
tions standards and protocols, or both, 
without resorting to separate logic cir- 
cuits for each. 

Tomorrow's Market 

Tomorrow's market will be driven 
more by users rather than suppliers. 
Applications beyond computers will 
be found, and engineers will push for 
the required technology. Moreover, 
users will drive product definitions. 
Software will be an integral part of 
the design solution and will require 
more adaptation to a user's design 
environment. An important factor in 
the increasing role of the user in the 
PLD market is the fact that PLDs are 
becoming part of up-front system de- 
sign, not just last-minute replace- 
ments of discrete logic. 

New settings for PLDs in high-speed 



Since the device is RAM based, test programs can be 

loaded into it, and vectors can be applied to see if the 

logic cells and I/O blocks are functioning properly. 



turned to the customer. 

In addition, the customer has 
choices not only in silicon but in ser- 
vice and support. Manufacturers have 
varying types of quality control, test 
procedures, reliability verification, 
software and technical support, price 
ranges and delivery capabilities. As a 
recent survey in EE Times showed, 
these issues are very important. Qual- 
ity control was the primary thing want- 
ed in a supplier, and software support 
was the second most important issue. 

Telecom Applications 

Because CMOS devices are not power 
hungry, they are welcomed in telecom- 
munications. 

Low power consumption in modems, 
fax machines and the like means more 
features and more functions without an 
increase in size. 

Further, in central switching sta- 
tions, units must be kept compact 
because many line interface cards 
are likely to placed in one rack, and 
many racks might share one cabinet. 

The LCA can put its talents to good 
use in telecommunications, includ- 
ing time-division multiplexing, line 
encoding/decoding and corner bend- 
ing circuits. Also, zero-standby-pow- 



processors, superminis and military 
equipment are fueling advances in bi- 
polar speeds. In the past, the fastest 
digital systems could not employ pro- 
grammable logic, but today's PLDs are 
matching and surpassing their speed- 
iest rivals. 

CMOS technology is being forced 
to meet the demands of telecom- 
munications, as well as adapt to in- 
dustry and instrumentation. Such 
equipment calls for low power and 
immunity to temperature, power and 
noise fluctuations. CMOS promises 
to meet those needs, although most 
CMOS PLDs today simply 



Specific Parts 

Users are also demanding more ap- 
plication-specific PLDs. At first ap- 
pearing to be redundant, applica- 
tion-specific PLDs are devices whose 
architecture is aimed at a particular 
application. Such architectures may 
boast bus interface controllers, sig- 
nal generators and decoders. True, 
these devices are limited to specific 
tasks, but will serve better than 
their generic kin. Suppliers are tak- 
ing risks in pursuing limited mar- 
kets, but the clear winner is the user 
who will soon have devices tailored 
to his needs. 

Recently, there's been an explosion 
in the number of PLD architectures 
and the amount of support software. 
The first PLDs were programmed by 
manually encoding the individual 
fuses, which then had to be blown. 
Later, software was supplied that 
automatically converted Boolean 
equations into programming pat- 
terns. Now, software can use a num- 
ber of design descriptions (equations, 
schematic capture, truth tables, etc.) 
and can simulate the design and cre- 
ate test vectors in addition to the pro- 
gramming pattern. 

A number of support packages are 
on hand to give the engineer the 
CAD facilities that ensure a success- 
ful design. Most of these packages 
play on an IBM-PC and start with 
assemblers that convert Boolean 
equations into fuse plots suitable for 
programming into the PLD. 

More complex devices require 
greater levels of software support, so 
state machine entry is made possible 
by many PLD languages. PLD assem- 
blers usually need not only Boolean 
literacy from system designers but also a 
detailed imderstanding of device architec- 
ture. State machine entry does not require 
such a high degree of understanding, so 
for these applications the logic designer 
can often use a descriptive rather than a 
mathematical language. 

User-friendly software tends to make 
an architecture transparent, and state 
machine entry ensures rapid turnaround. 
Small PLD designs can be turned around 



Users are also demanding more application-specific 
PLDs. At first appearing to be redundant, application- 
specific PLDs are devices whose architecture is 
aimed at a particular application. 



match low-power bipolar specs. To- 
morrow's CMOS PLDs will offer the 
zero standby power, wider operating 
conditions and dc characteristics of 
standard CMOS products. 



in minutes, and more complex devices in 
several hours. 

The software of tomorrow will be 
more responsive to users' needs, allow- 
ing for a multitude of design descrip- 
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tions while not limiting the implemen- 
tation to a particular device. Along 
with device independence will come the 
ability to handle multiple devices, with 
workstations offering the highest level 
of support. 

Programming equipment must also do 
more than just generate waveforms. Ad- 



en by manufacturers, tomorrow's will 
be guided more by the needs of the 
logic designer. As new applications 
arise, pressure will be put on both 
technology and architecture. Commu- 
nication with the customer will as- 
sume new importance, since it defines 
the right combinations. In addition, 



Just as today's market is driven by manufacturers, 

tomorrow's will be guided by designers. As needs arise, 

pressure will be on technology and architecture. 



vanced programmers will provide many 
test features, including complete func- 
tional testing. Or, programmers will be 
linked to more sophisticated testers. 

PLD suppliers are increasing the 
quality of their products, while adding 
test features to verify that quality. 
Power-up, reset and register preload- 
ing simplify testing, but the program- 
mer must be able to take advantage of 
them. 

Just as today's PLD market is driv- 



software from third-party vendors 
will make programmable logic more 
at home across the board. 

PLDs will continue to be used along- 
side discrete logic, fixed LSI functions, 
and gate arrays. But as the market 
matures, the benefits of the PLD over 
other options will become greater. In 
time, programmable logic will continue 
to enjoy a higher growth rate than the 
overall IC industry. n 
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Programmable Array Logic 



A look at the architectural differences 
between PALs and other programmable logic devices 



Vincent J. Coli 



PROGRAMMABLE LOGIC devices are 
integrated circuits that hardware designers 
can program to perform specific logic 
functions. Most PLD functions are avail- 
able from many vendors and in several 
technologies with different speed, power, 
and cost options. As with standard 7400 
chips, PLDs are available off your local 
distributor's shelf PLDs offer one distinct 
advantage over standard 7400 discrete 
logic: They are user-programmable. 

Most PLDs consist of two arrays of 
logic gates— an AND array followed by an 
OR array. The input signals to a PLD must 
first pass through an array of AND gates 
where combinations of the input signals 
are formed. Each group of AND com- 
binations is called a minterm in Boolean 
algebra or a product line in PLD nomen- 
clature. Then the product lines are 
summed in an array of OR gates. The in- 
put buffers generate both the true and 
complement of the input signals. 

Three basic types of AND/OR array- 
based PLDs exist: programmable read- 
only memories (PROMs), programmable 
logic arrays (PLAs), and programmable 
array logic (PAL) devices. The types are 
distinguished by the programmability of 
their arrays. 

In a PROM, the AND array is fixed and 
the OR array is programmable. In a PLA, 
both arrays are programmable. PAL 
devices have a programmable AND array 
and a fixed OR array. I will compare the 
PAL device to the PLA and PROM and 
then examine the architecture of some 
commonly used PAL devices. For a brief 
history of the PAL device, see the text box 



"Evolution of PALs" by John Martin 
Birkner on page 208. 

PROMs 

While most people think of PROMs as 
devices for storing fixed programs and 
memory, the PROM is also ideal for logic 
applications requiring less than 10 
inputs— especially when many product 
lines are required. PROMs designed as 
logic devices are usually referred to as 
PLEs (programmable logic elements). 

Figure 1 shows the PROM's fixed- 
AND/programmabie-OR arrays. For a 
discussion of notation used to describe 
PLD devices, see the text box "PLD 
Notation Panel" on page 210. Every input 
combination is available in the AND ar- 
ray, whether that combination of inputs is 
required or not. Since the AND array is 
hard-wired, it is not possible to perform 
logic minimization between input com- 
binations. 

The OR array is programmed to select 
the AND gate combinations (or product 
lines). Since every OR gate is connected 
to each product line, outputs may share 
product lines. For those familiar with 
memory design, the fixed-AND array is 
often called the address decoder, while the 
programmable-OR array stores the mem- 
ory bits. Another way of looking at this 
is that PROMs store the logic transfer 
function as a lookup table in memory. 

The advantage of PROMs is that every 
input combination can be decoded. The 
disadvantage is that the number of input 
pins available is restricted because the ar- 
ray size must be doubled for each addi- 



tional input. The arithmetic works like 
this: A PROM with n inputs and m out- 
puts requires an OR array of 2" lines deep 
by m lines wide. For example, a PROM 
with 10 inputs and 8 outputs requires an 
OR array of 2'° by 8 or 8192 fuse loca- 
tions. An 11th input would require that the 
array size be doubled to 16,384. Cost and 
performance constraints limit PROMs to 
13 inputs and 8 outputs. PROMs designed 
specifically for logic applications feature 
either 5 or 6 inputs and 16 outputs. 

PLAs 

The PLA structure offers the highest level 
of flexibility because both arrays are pro- 
grammable. Figure 2 shows the PLA's 
programmable-AND/programmable-OR 
structure. Because their OR arrays are 
programmable, PLAs, like PROMs, can 
share product terms among outputs. For 
example, one product line would be saved 
if two outputs required the same input 
combination (i.e., product line). 

Programmability in the AND array 
removes the restriction found in PROMs 
that the AND array must be large enough 
to provide all possible input combinations. 
This works because, statistically, only a 
conliiuicd 

Vincent J. Coli is a strategic marketing 
manager for Monolithic Memories Inc. 
(2175 Mission College Blvd. , Santa Clara, 
CA 95054). He has worked with the PAL 
products for the past si.x years with MMl. 
Vincent holds a B.S. in chemical engi- 
neering and an M.S. in electrical engi- 
neering. 



Reprinted with permission from the January 1987 issue of BYTE magazine. 
Copyright © by McGraw-Hill, Inc., New York 10020. All rights reserved. 
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The Evolution of PALs 



Computers used to be constructed 
from SSI, MSI, PROM, and RAM 
chips connected in jigsaw-puzzle fashion 
on many printed circuit boards plugged 
into a connector backplane. The com- 
puter designer's task was to build a func- 
tional unit such as a processor, disk con- 
troller, I/O controller, or memory board. 
If the design overflowed onto another 
board, connectors and ribbon cables had 
to be added. This made the design more 
expensive and sometimes risky due to 
noise coupling. The name of the game 
was to get it all on one board. 

Mixing and Matching TTL Chips 

Designers who had studied switching 
theory, information theory, Boolean 
algebra, and Quine-McCluskey mini- 
mization at college soon found that their 
textbooks would not be of much use. 
They learned that the practical art of 
computer design did not consist of op- 
timizing an architecture with an or- 
thogonal instruction set. It consisted of 
mixing and matching the collage of exist- 
ing TTL chips onto a single board until 
an approximation of the design goal was 
reached. They did not design state- 
control sequence logic from top-down 
state-graph theory, but rather, slapped 
down a 74174 hexadecimal register and 
some 7400 NAND gates. Control-logic 
design theories usually consisted of 
following signal lines around the logic 
schematic until, through superhuman 
powers of concentration, designers 
achieved clarity. 

Designers found the information they 
needed in the catalogs of young semi- 
conductor companies in California, 
Arizona, and Texas. A favorite was The 
TTL Databook by Texas Instruments. 
Most logic designers believed that 
74-series TTL parts found in this book 
would be second-sourced and could be 
"designed in." 

A processor design would begin with 
the block diagram consisting of an ALU, 
data path and register file, microprogram 
memory and sequencer, and then a small 
and obscure block called "control logic." 
It might have been a small block on the 
diagram, but the control logic usually 
represented the majority of the chip 
count. 

The control logic consisted of SSI/MSI 
gates and flip-flops connected together in 
random fashion, and there seemed to be 
no way to reduce it. The control logic 
also represented the area of highest 
design errors and was easily recognizable 



John Martin Birkner 

on the printed circuit board as the area 
with all the "cuts and jumpers." The 
engineering change notice (ECN) was the 
standard remedy for such errors and was 
a constant source of agony between 
manufacturing and engineering. Manu- 
facturing would use yellow wires to stand 
out on the green PC board. Engineering 
would use green jumper wires to camou- 
flage embarrassing mistakes. 

The engineering manager would "pilot 
release" the current revision PC board 
as soon as the green wire count was low 
enough to pacify manufacturing. The 
design engineers would then flee to the 
next design, where they were expected 
to cram even more functions onto the 
single PC board to beat the competition's 
new threat. 

I was convinced that there must be a 
better way to build computers. So, in 
1975, I packed my bags and headed for 
Silicon Valley. I remember seeing the first 
single-chip microprocessor systems on 
the market. They had one microcomputer 
chip surrounded by a sea of over 100 
SSI/MSI chips. The new LSI chips 
needed either some good planning so that 
they could talk to each other or some 
good "glue chips" to hook them up. 

The PROM, pioneered by Harris and 
Monolithic Memories, showed some 
promise as a universal and general- 
purpose glue logic element. Applications 
like memory-address decoding began 
showing up for the 32-word by 8-bit 
PROM. National Semiconductor pio- 
neered the programmable logic array 
(PLA) in a 14-in. 8-out. 96-product-term, 
24-pin fat (0.6 inch wide) DIP, bench- 
marked for 96-character EBCDlC-to- 
ASCII conversion. 

Intersil made a field programmable 
logic array, or FPLA, in the National pin- 
out, but with about half the product terms 
at 48. Signetics increased the package 
pins to 28, making the 16-in, 8-out. 
48-product-term 82S100 FPLA. These 
first attempts at providing the computer 
designer with LSI glue were met with 
mild enthusiasm. The new glue chips 
were too big (fat DIPs) and were slow, 
expensive, and hard to use. 

Monolithic Memories was the first 
company to take advantage of the bipolar 
fuse-link PROM technology to make 
some fast little FPLAs, as we first called 
them. We put them in industry-standard 
20-pin skinny (0.3 inch) DIPs, for 
minimum PC board area. We also re- 
duced the two programmable arrays down 
to one for 35 -ns high-speed operation and 



lower cost. We mimicked the TTL data- 
sheet specs down to the same ter- 
minology, graphics, and printing style to 
make the computer design engineer 
secure in replacing old 74-series TTL 
chips. We added programmable three- 
state output enable for I/O pin allocation. 
We added output registers with feedback 
for direct implementation of state- 
machine control logic from state graphs. 
We designed the programming algo- 
rithm to be compatible with existing 
PROM programmers, making low-cost 
programming possible. The first PAL 
programming module had a PAL in it. 
This presented a chicken-and-egg prob- 
lem that we solved by emulating with 
some PROM and SSI chips. The first 
PAL to be programmed was, of course, 
the pattern for the PAL programmer 
module. I headed the project, specified 
the design, and sold the customers. H. T. 
Chua provided a clever and ingenious cir- 
cuit design. 

New Design Methodology 

The new chips required a new design 
methodology. Actually, it was the same 
method that we learned in school, so we 
had to drag out our old textbooks and 
releam Boolean logic and top-down state- 
machine design. We showed the designer 
how to "design your own chip" using 
Boolean logic equations. We wrote the 
first silicon compiler, PALASM (PAL 
Assembler), and published the FOR- 
TRAN source in the PAL Handbook 
(available from McGraw-Hill), along 
with numerous design examples. 

The PAL chips replaced SSI/MSI chips 
at a chip-count reduction of 5 to 1. Data 
General gambled on the new single- 
sourced chips by designing them into the 
MV8(X)0 computer (see The Soul of a 
New Machine by Tracy Kidder). 

Apple put six PAL chips in the Macin- 
tosh. Soon the PAL chips were no longer 
single-sourced, as National Semiconduc- 
tor, Texas Instruments. Advanced Micro 
Devices, and others joined in licensing 
the now-patented PAL chips from MMI. 
Now you can find these chips every- 
where. Look at the PC expansion boards 
in this magazine; you can recognize PAL 
chips by their easy-to-read part-number 
system (e.g.. PAL16L8 and PAL16R8). 

John Martin Birkner is president of Struc- 
tured Design Inc. and coinventor of the 
programmable array logic (PAL) device. 
He can be contacted at 988 Bryant Way, 
Sunnyvale, CA 94087. 
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PLD Notation Panel 
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Figure A: Differences in the logic Figure B: The partially programmed 
notation for a TTL AND gate and a product line to implement A*B. 
PLD AND gate. 
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Figure C: Portion of the PLD array programmed to implement A*B. 
Having all fuses intact in the second product line causes a logic zero to be 
input to the second AND gate, which does not contribute to the sum at the 
OR gate. 
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Figure D: Tlie PAL device implementation of the function /Output = A*B 
+ /A*B. The standard combinatorial logic diagram of the function /Output 
= A*/B + /A*B is shown in the inset to figure D above. 



Because PLD structures are much 
different from ordinary TTL gates, 
new logic notations were developed for 
them. Figure A shows the logic 
convention adopted for a three-input 
AND gate. The PLD representation for 
an AND gate is called a "product line." 
Note that the three vertical lines are the 
inputs (A, B, and C), which are con- 
nected to the AND gate inputs through 
fuse links. An unprogrammed (or closed) 
fuse link is represented by an X at the 
intersection of an input line with a 
product line. If you wanted to disconnect 
one of those inputs from AND gate C, 
for example, you would remove the 
appropriate X from the point of inter- 
section for the C input line with the 
product line to signify a programmed (or 
open) fuse link. This product line, which 
now implements the A*B function, is 
shown in figure B. 

Since every input is available to every 
product line in a PLD, it is convenient 
to show the input lines as long lines 
running vertically through the array. 
Also, two input lines are associated with 
each input pin because both input 
polarities are available in a PLD. There- 
fore, the input buffer is shown with both 
a noninverted (true) and inverted (com- 
plement) output path; each path is hard- 
wire connected (shown as a dot) to an 
input line. 

Figure C shows a portion of a PLD 
array illustrating the input lines and 
buffers. Notice tha' an OR gate is added 
to the structure. All the fuse links in the 
lower product line are left intact, leaving 
the product line in a logic low (since true 
inputs are ANDed with complements), 
while appropriate fuse links in the upper 
product line are programmed to imple- 
ment the A*B function from figure B. 

It is common to implement two or 
more levels of logic gates such as an 
AND/OR/invert circuit in a PLD. For 
example, consider the following function 
implemented in a PAL device: 

/Output = A*/B + /A*B. 

Shown in figure D are the standard com- 
binatorial logic diagram (see inset) and 
the PAL logic equivalent for this 
function. 

Notice the details added to figure D 
that magnify the programmed fuse link 
for B in the upper product line. This 
magnification details each fuse link and 
its associated diode for a bipolar PAL 
device. A CMOS PAL device is similar, 
except an ultraviolet cell would substitute 
for the fuse link. 
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limited number of product terms is re- 
quired in any equation. Eliminating 
redundant combinations with logic 
minimization techniques, such as Kar- 
naugh maps, can reduce the required 
number of product terms even more. 
Therefore, almost any combination of in- 
puts can be decoded in a PLA. 

PLAs were the first products offered 
specifically for logic applications. Due to 
programming limitations, early PLAs 
were available only in mask-programmed 
versions. Just like on a ROM, a logic 
designer would indicate on the vendor's 
PLA AND/OR logic map where the de- 
sired connections were to be made. The 
vendor would then tool up a custom metal 
mask for the PLA to implement the 
customer's logic. Today, most PLAs are 
user-programmable. However, mask-pro- 
grammed PLA structures are used often 
in the control section of LSI/VLSI stan- 
dard logic chips, such as microprocessors, 
and offered in standard-cell libraries. 

In the world of engineering, there are 
always compromises. The facts reveal that 
a performance and silicon-die size penalty 
must be realized to provide the flexibility 
of programming both arrays. PAL devices 
are generally 5 to 10 nanoseconds faster 
than PLAs at the same power level and 
save the silicon area required to program 
and verify the second array. It turns out 
that the flexibility of a programmable-OR 
array is not required for most PLD ap- 
plications, but it can be useful for com- 
plex state-machine and sequencer applica- 
tions. 

Because of the long history of PLAs, 
their nomenclature can be a little confus- 
ing. Early vendors of user-programmable 
PLAs called their products FPLAs to 
highlight their "field programmability" 
and to distinguish FPLAs from factory 
mask-programmed PLAs. Just as ROMs 
and PROMs could be easily distinguished, 
so could PLAs and FPLAs. However, 
since most of the PLAs offered today are 
programmed by the customer, many ven- 
dors have dropped the F prefix and sim- 
ply call them PLAs. Furthermore, PLAs 
designed for sequencer applications are 
called PLSs (programmable logic se- 
quencers). 

PAL Devices 

Figure 3 shows the programmable- 
AND/fixed-OR array structure of a PAL. 
As with the PLA, having the AND array 
programmable lets the user program only 
the desired input combinations. But fix- 
ing the OR array requires that certain 
product lines be tied to specific outputs— 
typically, eight product lines per output. 
Many people use PAL and PLD synony- 
mously. Several PLD vendors add an E 
prefix to PLD, to come up with EPLD, 



which signifies ultraviolet erasable PLDs. 
Just as there are PROMs and EPROMs. 
now there are PLDs and EPLDs. 

The name HAL (hard array logic), 
refers to mask-programmed, or ROM. 
versions of PAL devices. If the volume of 
devices needed were large, converting a 
design to a HAL might be appropriate 
once the design is thoroughly debugged 
with the PAL. 

While all PAL devices are characterized 
by a programmable-AND/fixed-OR array 
structure, there is a whole line of PAL 
devices with different options. They come 
with varying numbers of inputs and out- 
puts. They might have feedback paths 
from the output back to the array. Some 
of these pins can be programmable I/O 



pins. They can have active-high or active- 
low outputs, or the output polarity might 
be programmable via an XOR gate and a 
fuse. Some come with registers at their 
outputs and are good for making sequen- 
tial circuits. Let's look at two commonly 
used PAL devices: the 16L8 and the I6R8. 

The PAL16L8 

One popular combinatorial PAL is the 
PALI6L8 (figure 4). Notice how the pins 
on the left side and bottom of the logic 
diagram (pins 1 to 9 and pin 11) are used 
for inputs and the pins on the right (pins 
12 to 19) are available as outputs. Pins 12 
and 19 can be used only as outputs, but 
six of the outputs (pins 13 to 18) are also 
conlinued 
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Figure 1: A simplified diagram of the fixed-AND/programmable-OR array 
structure of the PROM. Figures in this article are reprinted from Monolithic 
Memories' PAL Handbook (3rd ed.) with permission from Monolithic Memories. 
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Figure 2: A simplified diagram of a PLA showing that both arrays are 
programmable. 



available as inputs via the feedback line 
connection after the inverting output buf- 
fer. This feature, called programmable 
I/O, lets the user program each of these 
six pins to be either an input or output. 
I'll discuss programmable I/O in more 
detail later on. Now the PAL16L8 part- 
numbering scheme should be a little more 
obvious; 16 signifies the maximum 
number of potential inputs (10 dedicated 
inputs and 6 programmable I/O), while 8 
signifies the number of outputs and L 
signifies the output type, which is active 
low for this PAL part type. 

Refer to the logic diagram in figure 4 
and you'll see that the vertical lines run- 
ning through the array, numbered 
through 31, are the input lines. Notice that 
each input or I/O pin is associated with 



two input lines; one input line is connected 
to the true (or noninverted) sense of the 
input buffer, while the other input line is 
connected to the complement (or inverted) 
sense. This allows availability of both 
input-signal polarities to the array. 

The horizontal lines running through 
the array, numbered through 63, are the 
product lines. You can think of each of 
these product lines as an AND gate with 
32 inputs, which corresponds to the total 
number of input lines. Actually, both the 
true and complement of every input signal 
are connected via fuses to each product 
line before the device is programmed. 
This is the programmable-AND array in 
the PAL structure. To program the array, 
the user selects different combinations of 
conlinueJ 
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Figure 3: A simplified diagram showing the programmable-AND/fixed-OR array 
structure of a PAL device. 



input signals by disconnecting, via the 
blown fuse, the unwanted input signals in 
a product line. In total, 2048 fuses are 
available in this PAL device (64 product 
lines by 32 input lines). 

Notice that each output pin has eight 
product lines associated with it. The lower 
seven product lines of each group are 
summed at the OR gate, while the upper 
product line is connected to the inverting 
output buffer. The lower seven product 
lines and the OR gate provide the sum- 
of-products logic power for the R\L 
device. The OR gate determines whether 
any of the product lines are active, or true, 
and then the output buffer inverts the 
signal from the OR gate for output. Note 
that a product line with all fuses left in- 
tact will not affect the sum at the OR gate. 



since the logical result of each input 
ANDed with its complement is false. 

Programmable I/O 

This upper product line associated with 
each output controls the three-state logic 
in the output buffer. When this product 
line is active, or true, the output is enabled 
and the sum-of-products logic determines 
the output state. However, when this prod- 
uct line is inactive, or false, the output is 
disabled with the three-state buffer in the 
high-impedance state. This lets the out- 
put pin drive a three-state bus just like a 
74S240 octal buffer. Furthermore, since 
most PAL devices feature an output drive 
capability of 24 milliamperes, they are 
quite handy for bus interfacing. 
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Figure 4: 77;? actual logic diagram of the combinatorial PAL16L8. The fuse links are not shown so that a logic designer 
may place an X at those points where a fuse should he left intact. 
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The three-state product line, along with 
the feedback path on six of the outputs, 
makes the programmable I/O feature 
work. The pin is an input to the AND ar- 
ray when all the fuses in the three-state 



enable product line are left intact, while 
the pin is an output when all the fuses are 
programmed. Note that a product line will 
always be true, regardless of input com- 
binations, when all fuses are pro- 



grammed. The programmable I/O feature 
lets the user allocate pins for input or out- 
put as required by the application. 

An even higher level of flexibility is 
possible if you let the logic in the product 
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Figure 5: Tlie actual logic diagram of the PAU6R8. Notice the D-type flip-flops at the outputs. 
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line determine the pin's direction. This is 
done by programming a condition in the 
product line for which the pin will be an 
output. You can use this feature to allocate 
available pins for I/O functions or to pro- 
vide bidirectional transfer for operations 
such as shifting and rotating data. 

The PAL16L8 is used in applications 
such as complex decoders, encoders, 
multiplexers, comparators, and replace- 
ment of SSI/MSI random logic. Another 
way of viewing this is that the PAL16L8 
programmable AND array contains 2048 
fuses. You can program these fuses to 
create almost any configuration of up to 
250 AND, OR, and inverter gates, which 
is roughly 250 equivalent gates. 

PALs with Registered Outputs 
and Feedback 

The structure of registered PALs is similar 
to that of the PAL16L8 except for the ad- 
dition of the registered outputs. In the 
PAL16R8, each of the eight registers is ac- 
tually a D (data) flip-flop that is clocked 
on the rising edge (see figure 5). The 
clock signal (pin 1) is shared by all eight 
flip-flops. Each OR gate sums eight prod- 
uct lines and is the D input to the flip-flop. 
The Q output from the flip-flop is avail- 
able both for feedback into the PAL array 
and for output from the device. Either 
polarity of the feedback signal is available. 



This feedback lets the PAL device 
"remember" the previous state, and it can 
alter its function based upon that state. 
Thus, registered PAL devices are ideal for 
implementing single-chip state sequencers 
and state machines. 

Conclusion 

PROMs are limited in the number of in- 
puts they can handle, since every input 
combination is made available. They are 
useful for applications that require a large 
number of product terms but few inputs. 

PLAs are the most flexible of the 
AND/OR array PLDs with both arrays 
programmable. This flexibility makes 
them slower, since the signal has to prop- 
agate through two programmable arrays. 
PAL devices with their programmable- 
AND/fixed-OR array structure can ac- 
commodate more inputs than PROMs 
because, statistically, not every input com- 
bination is required. With only one array 
programmable, they are faster than PLAs. 

PAL devices with registered outputs are 
particularly useful for building sequential 
circuits. PAL devices also can provide 
feedback, altering the function of a given 
state based on the condition of the im- 
mediately prior state. The overriding ad- 
vantage of all PLDs, however, is the in- 
tegration of multiple functions onto a 
single programmed circuit to save board 
space and reduce chip count and cost. ■ 



2-710 



^ 



Monollthlo 



m 



Memories 



ri 



Logical Alternatives in 



Becoming the technology of choice in superminicomputer 
lesign, programmable array logic mixes with alternative 
devices to meet design constraints. 





1 — < 

1 


I r-IH- 


■ 1 1 


, ^ 












" 
























































* 
























e 




' 




































JZ" 




::::!z:: 















by Bradford S. Kitson and 
B. Joshua Rosen 



Programmable array logic devices have been a 
driving force behind the latest generation of 32-bit 
superminicomputer designs. These supermini- 
computers range from redesigns of existing archi- 
tectures that reduce cost or overcome packaging 
limitations, to designs requiring ultrafast turn- 
around, to high performance architectures speci- 
fied to take full advantage of such devices. 

Superminicomputers designed with program- 
mable array logic (pal®) include the VAX®-1 1/730 



Bradford S. Kitson is section manager in product 
planning and applications for programmable logic 
devices at Advanced Micro Devices, 901 Thompson 
PI, Sunnyvale, CA 94088. He holds a BS in electrical 
engineering and computer science from the University 
of California at Berkeley. 

B. Joshua Rosen is manager of processor design at 
Dataflow Systems Corp, 42 Nagog Park, Acton, MA 
01720. He was manager of processor development at 
Computervision Corp, Bedford, MA, when this article 
was written. He holds a BA in physics from Lawrence 
University and an MSEE from Northwestern 
University. 



from Digital Equipment Corp, the MV/8000 and 
Mv/ioooo from Data General, and Computervision's 
APU "(analytic processing unit). A reimplementa- 
tion of the original VAX-l 1/780, the VAX-ll/730 sup- 
plies 25% of the performance in 10% of the board 
space. Programmable array logic was chosen by 
the .MV/sooo designers to allow the shortest possible 
design cycle and to catch up with their competitors. 
The MV/IOOOO upgrades performance of the 
MV/8000. The Computervision APU was designed 
for high performance with PAL devices in mind, 
and provides excellent examples of how to use such 
devices to their fullest. 

Logic Design Alternatives 

Although by no means the only option, pro- 
grammable array logic has become the technology 
of choice in superminicomputer design. Logic 
design alternatives (Fig 1) include standard prod- 
ucts (fixed-function devices), semi-custom 
(programmable logic, gate arrays, and standard 
cells), and fully-custom logic devices. In supermini- 
computer design, the primary alternatives are stan- 
dard products, gate arrays, and PAL devices. The 
best choice for any given function depends upon 
the design alternative capabilities, the design con- 
straints, and the function actually being 
implemented. 

Reprinted with pormiaaion of COMPUTER DESIGN 
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Fig 1 Basic categories of digital logic present designers with 
trade-off opportunities. Standard products fit where cost is a 
major concern; custom and semi-custom products can be used 
where high diversity is desirable. 

Via fuse programming, a pal device allows the 
designer to construct a custom device or group of 
devices that precisely implement a desired func- 
tion. In contrast, fixed-function transistor- 
transistor logic (TTL) small scale integration/ 
medium scale integration (SSI/MSI) alternatives 
seldom seem to fit any application in the desired 
way. Thus, the SSI/MSI designer usually pays 
penalties via extra logic levels in the critical path 
and an increased package count. Gate arrays allow 
custom devices to be created via mask program- 
ming. However, designers using a gate array must 
finalize architecture early in the design cycle. Any 
errors will require a mask change, which can take 
months. 

In most cases, the best choice is a combination of 
the alternatives and probably includes some 
memory as well. Typically, the six basic design con- 
straints are performance (speed), cost, density 
(packaging), power dissipation, reliability, and 
design turnaround. Assigning a priority to these 
constraints will usually define the logic alternative 
that a machine is based on. Actual implementation 
trade-offs are made at the function level. The three 
basic functional portions of a design are data path, 
control path, and interface. 

Standard Products Have Their Place 

Standard products are defined as devices created 
for a wide market. Examples of standard products 
are TTL SSI/msi, fixed instruction set metal oxide 
semiconductor (MOS) microprocessors, and micro- 
programmable large scale integration (LSI) building 
blocks. These devices are usually multiple sourced 
and produced in high volume, resulting in lower 
individual device costs. In a design where cost is the 
main concern, and performance, power dissipation, 
density, and design turnaround are of little or no 
importance, standard products are probably the 
best choice. In a design such as a superminicom- 



puter, where these other considerations have a high 
priority, inherent disadvantages limit standard 
product use. 

While SSI/MSI devices offer fast individual gates, 
on a system level their density, power dissipation, 
and reliability characteristics are not as good as 
those of the alternatives. In addition, design turn- 
around characteristics are inadequate because 
changes usually require printed circuit (PC) boards 
to be laid out again. In most superminicomputers, 
therefore, SSl/MSl gates are used only in selected 
critical path functions that require one or two gate 
levels (at the expense of density and power dissipa- 
tion), and in interface applications, such as bus 
buffers, latches, registers, and transceivers. 

Standard LSI products (mostly bipolar) offer 
exceptional performance, power dissipation, density, 
and reliability characteristics, but their inflexible 
architectures limit their application range. Super- 
minicomputer designers rely on proprietary, highly 
complex architectures to differentiate their designs 
from those of their competitors, and LSI imposes 
an architecture. Therefore, applications are limited 
to general purpose, well-defined functions in the 
data path, such as parallel multipliers and arith- 
metic logic units (ALUs) that can benefit from their 
high performance characteristics. 

Gate Arrays Are "Cast in Concrete" 

Semi-custom gate arrays are defined by inte- 
grated circuit (IC) manufacturers as large arrays of 
unconnected gates. End users specify how gates are 
interconnected with actual interconnection occurring 
at the metal-mask layer of the IC process. The main 
advantages of gate arrays are high density and the 
ability to customize a design, while primary dis- 
advantages are cost and design turnaround. Each 
custom device is single sourced and low volume; 
therefore they are not cost-effective unless the 
application is density limited or the volume is very 
high. Gate arrays can adversely affect design turn- 
around because the system designer must design 
both the IC and the system in which the ic is used. 
In addition, any change in the gate array requires 
new masks and a delay for each mask iteration. 

By using gate arrays only where the design can be 
defined early, designers minimize these turnaround 
disadvantages. They then hedge their bets by sur- 
rounding the gate arrays with logic that can correct 
any design bug(s) discovered Ijiter on. As in LSI, 
what is "cast in concrete" is usually the data path. 
However, gate arrays allow more of the data path 
to be integrated because the device can be optimized 
for specific design requirements. A proprietary 
16-bit ALU slice might be a typical gate array. 

Gate arrays are also used to interface multiple 
onboard buses together as data path "glue." 
Control-path and interface applications, however, 
tend to be too likely to change. Therefore, control 
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path functions are based on implementation tech- 
niques such as writable-control-store (wcs). Inter- 
face applications frequently require changes 
because of the need to interface one designer's 
board to another's. Consider the critical timing 
between a cache, an instruction fetching unit, and 
multiple register-files. The exception handling 
capability required for typical operations, such as a 
cache miss, can be an indeterminant problem 
affecting all of the above-mentioned functional 
units in the system. Should a bug occur in the cache 
unit, the interface section of all other units will have 
to be changed to accommodate the correction. 

PAL Structure Paves the Data Path 

Combining simplicity and flexibility, the basic 
PAL structure is a fuse-programmable AND gate 
array that drives fixed connection or gates, allow- 
ing logic to be implemented in sum-of-products 
(AND-OR) Boolean form. 

By selectively blowing the appropriate fuses, a 
PAL device can implement any logic function as 
long as the number of inputs or AND gates required 
does not exceed the number provided in the device 



chosen. In the AmPAL16R4, for example, the true 
and complement version of each of the 16 inputs is 
connected via fuses to each of the 64 AND gates in 
the device (Fig 2). PAL devices provide additional 
features, such as programmable input/output (I/O) 
pins and registered outputs with internal feedback 
that further enhance their ability to implement 
logic functions efficiently. Programmable I/Os are. 
especially useful for trading off the number of 
device outputs for inputs to fit the exact number 
required by the logic functions being implemented. 
Internal registered feedback is desirable when 
implementing complex state machine designs. 

Programmable logic devices, like gate array, are 
semi-custom devices. Created by the IC manufac- 
turer, they are alterable by fuse programming for a 
specific application. Designed specifically for logic- 
oriented applications, PAL devices enable designers 
to create custom devices with fast turnaround time. 
PAL devices compare favorably to alternative 
devices in terms of performance, cost, density, 
power dissipation, and reliability. They fall behind 
gate arrays and LSI in density and power dissipa- 
tion, and behind SSI/MSI in individual device cost. 
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(Note however, that pal devices are cheaper on a 
system basis.) 

Performance and density characteristics of pal 
devices have led to significant applications in the 
data paths of superminis where they are used along 
with LSI and/or gate arrays. These devices serve to 
"glue" the LSI and the "cast in concrete" gate 
arrays into the system. If necessary, the design turn 
around capability that they display can be used to 
optimize the data path architecture and, in some 
cases, fix a bug in a gate array by reprogramming the 
PAL devices around it. Typical data path functions 
for the devices include barrel shifters, masking, 
code conversion, and multiple bus interface. 

Design turnaround becomes especially beneficial 
in the control path and interface portions of a 
design. Most control path functions are highly ran- 
dom and are prone to change and/or error While 
wcs allows design changes to be made by rewriting 
microcode, pal devices permit changes that cannot 
be made in microcode, or would adversely affect 
system performance. For instance, one supermini- 
computer manufacturer has established the rule 
that a gate array can be used in the control path 
only if eight or more PAL devices are necessary for 
the same function. In interface design, from a den- 
sity standpoint, PAL devices allow the interface to 
merge with the data path "glue" function. Since 
the interface is just as likely to change as the 
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Fig 3 Multiplier calculates 56-bit x 56-bit product. Partial 
product generation logic uses seven 8x8 slices to form 8 x 
56-bit multiplication array. Carry-save adders implement 
3-to-2 counting techniques to form two operands that are 
summed in a lookahead alu. 



control path, but without the benefit of vvcs, design 
turnaround becomes a factor. If one board's 
designer needs to change the interface, the designs 
of many other boards are impacted. Updates on all 
boards can be easily made by reprogramming one 
or more PAL devices. These devices can also pro- 
vide the drive capability required by this appli- 
cation area. 

Designing with PAL Devices in Mind 

Trade-offs among PAL devices and other digital 
logic alternatives were key factors in Computervi- 
sion's APU design. In conjunction with standard 
products, such as ALUs, LSI multipliers, and 
memory devices, PAL devices were used to create a 
patented architecture not feasible in standard TTL 
SSI/MSI. 

Designed as a very high speed 32-bit supermini- 
computer for engineering applications, the Com- 
putervision APU is twice as fast as competitive 
designs, yet occupies the same board space. Its 
designers, instead of merely replacing TTL SSl/MSl 
with PAL devices, used PAL as customizable logic 
building blocks. This allowed them to implement 
powerful logic functions in a minimum of space. 

The APU processor board set is divided into four 
modules. The parser/sequencer contains an instruc- 
tion processor that fetches and decodes instruc- 
tions in parallel with the execution unit. The 
control processor performs address and integer 
computations and the floating point pipe (fpp) per- 
forms both scalar and vector floating point opera- 
tions. The cache/address translation unit contains 
a 256-slot area page table entry cache and a 
16K-byte memory cache. 

Approximately 25% of the chips in the APU board 
set are in pal. Since the APU is the first implemen- 
tation of the new CPU architecture, many design 
aspects are subject to change as the architecture 
evolves. PAL devices permit designers to rapidly 
modify hardware to fit the architecture needs, and 
to implement feature and performance enhance- 
ments with minimal impact on the development 
schedule. 

Ability to generate a very large number of custom 
l.Cs (over 200 different PAL codes are used in the 
APU), significantly reduces the processor's size 
while increasing overall performance. This means 
that, although the apu and Digital Equipment 
Corp's VAX-1 1/750, a gate array-based machine, 
consume exactly the same amount of board space, 
the APU is more than twice as fast. In fact, the 
APU's Fortran performance is substantially faster 
than that of the vaX-1 1/780, a machine that con- 
sumes 5.2 times as much board space as the APU. 

Designed as a high speed arithmetic extension to 
the APU execution engine, the APU FPP, unlike 
comparable machines, is an integral part of the 
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internal architecture and not an optional add-on. 
As a result, the FPP not only accelerates scalar and 
vector floating point arithmetic, but also performs 
byte, word, double-word, and quad-word string 
operations. In addition, it serves to enhance the 
performance of important nonfloating point 
instructions such as Procedure Call and Return. A 
total of 79 PAL devices are used for both control- 
and data-path applications on the FPP board. 

Counter Solves Multiplication 
Problem 

The heart of the FPP is the multiplier section 
(Fig 3). Double-precision floating point multiplica- 
tion requires the calculation of a 56-bit x 56-bit 
product. Unfortunately, 56 x 56 parallel multipliers 
do not exist on silicon. From a cost/performance 
standpoint, the best solution is to use a number of 
small multipliers to build an intermediate sized par- 
allel multiplier and then produce a large product 
(56 X 56) in multiple cycles. 

Partial product generator FPP logic uses seven 
Am25S558 8x8 multiplier slices to implement an 8 x 
56-bit multiplication array. Each multiplier chip 
produces a 16-bil product. In general, the 8 most 
significant bits (MSBs) of each partial product gen- 
erator must be added to the 8 least significant bits 



(I.SBs) of the next higher slice to generate the full 
64-bit partial product. Exceptions are the 8 MSBs 
and LSBs. 

This technique also requires the ability to accu- 
mulate partial products with the partial products 
from previous cycles. Thus, each cycle must be 
accompanied by two additions: the partial product 
summation and the intermediate product accumu- 
lation. While this can be done by following the 
multipliers with two levels of lookahead adders, 
usually 74S181S, the resulting nanocycle time is 
approximately three times longer than the partial 
product generation time of the 8 x 8 multipliers. 
Modifying this scheme, however, by adding reg- 
isters between each level of logic, the pipeline 
multiplier reduces the nanocycle time to near the 
propagation delay time of the multiplier chips plus 
the clock to output time of the multiplier register 
plus the setup time of the intermediate result reg- 
ister. This scheme has two disadvantages: increased 
pipe latency, caused by the two extra levels of 
pipelining, and a high parts count. 

Still another technique involves replacing one 
level of the pipe and one level of lookahead adders 
with carry-save adders between the partial product 
generators and the pipeline registers. Carry-save 
adders are used to implement a technique that is 
called 3-to-2 counting. As seen in Fig 4(a), any 



Fig 4 Carry-save adders use 3-10-2 
counting techniques (u recode any 
three equally weighted bits into a 
2-bit field (a). Sixteen pai. units 
form triple 3-lo-2 counters (b) to 
reduce the three multiplication 
operands to two intermediate 
results. 



CP-|> 




ENABLE^>[]>- 



Monoliihio 



m 



Memories 



2-715 



Logical Alternatives in Supermini Design 



Without adding pipe latency, the APU 
is able to accumulate partial products 
at a rate of 8 x 56 bits every 112 ns. 



combination of 3 equally weighted bits can be 
receded into a 2-bit field. 

This makes it possible to reduce the three oper- 
ands generated by the multiplication process (high 
and low partial products and 64-bit intermediate 
product) into only two operands which may then 
be summed together in a single lookahead ALU. 
The 3-to-2 recoding requires no-carry propagate 
logic and is therefore very fast. Only one level of 
pipelining is required because of the 3-to-2 
counter's speed, resulting in both a reduced parts 
count and a reduced pipe latency. 

In the APU floating point engine, 16 AmPALl6R6s, 
programmed as triple 3-to-2 counters, are used to 
reduce the three multiplication operands to two 
intermediate results [Fig 4(b)]. The registered PAL 
outputs are connected to the input buses of the 
Mantissa ALU that is also used for floating point 
addition and subtraction. The Mantissa ALU then 
calculates the next intermediate product in parallel 
with the partial products calculations occurring in 
the 8 X 8 multipliers. This intermediate product and 
the new partial products are recoded by the 3-to-2 
counter PAL devices to form the next pair of inter- 
mediate results. This process continues until the 
complete 56 x 56 product is generated. Thus, with- 
out adding pipe latency, the APU is able to accumu- 
late partial products at a rate of 8 x 56 bits every 
112 ns, which coincides with the basic nanocycle 
machine time. 

Barrel Shift, a 3-Level 
Implementation 

The APU's barrel shifter performs left shift, right 
shift, and rotate operations of to 63 bits in a 
single microcycle. Used mainly for floating point 
prescale and normalize operations, the barrel 
shifter (Fig 5) is implemented in three stages: the 
word rotater, nibble shifter, and bit shift and mask 
logic. It is controlled by associated prescale, 
leading zero detect, and mask control logic. 

Prescale logic converts the signed difference pro- 
duced by the exponent arithmetic units based on 
the comparisons of the two operand exponents, 
into an absolute shift distance. This shift distance 
is then used to right shift (prescale) the smaller 
operand Mantissa of a floating point add or sub- 
tract operation. The leading zero detect logic deter- 
mines the left shift distance required to produce a 
left -justified (normalized) result. Mask control 
logic converts rotated data to shifted data by mask- 
ing off the appropriate leading or trailing bits to 



implement right or left shifts. These three sections 
are implemented in pal. 

Implementing the 3-level, 64-bit barrel shifter in 
MSI requires the use of Am25S10 4-bit shifters. The 
first level is the word rotater which performs a cir- 
cular rotate of 0, 16, 32 or 48 bits. Although imple- 
mentation is simple, the MSI solution requires 16 
packages. The second level is the nibble shifter, 
essentially identical to the word rotater but wired 
to rotate 0, 4, 8, or 12 bits. The final barrel shifter 
stage requires not only bit rotate but also leading 
and trailing bit masking and sticky bit computation 
(ie, the logical OR of the masked-out bits). An MSI 
solution requires not only the 16 packages of 
AM25S10S, used in each preceding level, but also 16 
AND gate packages for masking, with another 16 
AND packages for the sticky bit computation. Con- 
trol logic for the mask operation requires as much 
logic as the entire shift path. A more practical solu- 
tion consists of building separate left and right 
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Fig 5 Implementation of 64-blt "Nearest Neighbor Shifter" 
is in three stages: word rotater, nibble shifter, and bit shift and 
masit logic. Control derives from associated prescale, leading 
zero detect, and mask control logic. 
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Fig 6 Word ro(ali(>n, first level of 3-level shifter, consists 
of eight PAL units, each programmed as two 4-bit rotators. 
It performs a circular rotate of 0, 16, 32, or 48 bits. 



Fig 7 Nibble shifter makes up second level of barrel 
shifter. It is wired to rotate 0, 4, 8. or 12 hits. 

shit'tcrs, 48 packages apiece, and not implementing 
a sticky bit at ail. 

Implemented in PAI., a 64-bii rotatcr and shifter 
with sticky bit computation requires considerably 
fewer packages than a unidirectional MSI shifter. 
The word rotater consists of eight identical PAI. 
devices programmed as two 4-bit rotaters/package 
(Fig 6). The nibble shifter requires four Aml'^sios 
arid eight PAI. devices progran\med as 6-bit wide, 
4-place shifters (Fig 7). The bit shift and mask logic 
requires 16 P.AI. devices in the data path and two 
PAI. devices in the control path. (Fig 8). 

To implement the masking function required for 
shifting, a technique called "Nearest Neighbor 
Shifting" (U.S. patent pending, Computervision 
Corp) is used. Each shift and mask PAl. device has 
an enable input from one mask control P.M. unit. 
In addition, each shift and mask pal device is also 
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Fig 8 Bit shift and mask logic for barrel shifter requires 16 
PAi. devices in the data path and two pai devices in the 
control path, tjjch shift and mask pai device has an enable 
input from one mask control pai, device and is connected to 
enable inputs of its left and right neighbors. The 64-bit 
masking operation requires only 16 control lines. 
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Fig 9 "Nearest Neighbor" interconnection is used to 
implement masking function for shifting. A mask control p\i 
device determines if all 4 bits are to be masked; enables from 
the adjacent slice determine if the pai device is at shift 
boundary. 

connected to the enable inputs from its left and 
right-hand neighbors (Fig 9). The mask control 
PAL input determines if all 4 bits from the slice 
should be masked off. Enables from the adjacent 
slice determine if a PAI. unit is at a shift boundary. 
If only one of the neighboring slices is disabled, the 
shift and mask PAL unit masks off from to 3 of 
the bits adjacent to the disabled slice, depending on 
the bit-rotation distance. In this way, the 64-bit 
masking operation can be implemented with only 
16 control lines as opposed to at least 64 for an 
SSI/MSI solution. 

in addition to performing the final shift and 
mask operation, the bit-shifter PAL unit also com- 
putes the logical OR of the masked-out bits at each 
slice position. These outputs are then logically 
ORed to generate a sticky bit. The extra hardware 
required is less than two SSI packages. The entire 
PAL barrel shifter requires 38 devices to implement 
64-bit rotation, left shifting, right shifting, and 
sticky bit accumulation. An MSl-based left/right 
shifter, without sticky bit computation, requires a 
minimum of 96 parts. In addition, the logic re- 
quired for implementing the prescalc and normal- 
ize operations is significantly reduced through the 
use of PAL devices. 
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Introduction 

The introduction of Programmable Array Logic (PAL®) devices 
by Monolithic Memories in the late 1970s marl<ed the beginning 
of rapid growth for the programmable logic device market. By 
offering fast turn development of user-defined circuit functions 
and significant package count reduction, programmable logic 
devices (PLDs) solved such system design problems as time to 
market, system size and cost, and the cost of making design 
changes. 

First generation PLDs, such as the PAL16L8, PAL16R4, PAL16R6, 
and PAL16R8, offer relatively simple, single-function architec- 
tures. Designers choose either devices with combinatorial out- 
puts, registered outputs, or a combination of both, to solve their 
specific design tasks. 

While simple architectures still offer the fastest operating 
speeds, more universal architectures have emerged among 
second generation PAL devices, which allow more user defini- 
tion of the device configuration. The universal "macrocell" 
approach allows users to reduce the number of different parts in 
a system and enjoy considerable flexibility in defining the final 
PLD architecture. 

All of these devices enable logic designers to combine the tasks 
of several SSI devices into a single PLD. System functions in the 
form of small state machines or simple data flow machines are 
possible in a single PLD. But the ability of designers to build 
significant "systems-on-a-chip" has been somewhat hampered 
either by a lack of device flexibility or the inability to make full 
use of the logic power of the PLD. This has kept the benefits of 
PLDs inaccessible to some designers. The purpose of this paper 
is to describe some of the key features of a new PAL device 
which opens up a whole range of possibilities by removing many 
of these limitations. 

The PAL32VX1 

The new PAL32VX10 takes the synchronous logic approach 
used by the PAL22V10. Judicious improvements to the macro- 
cell provide logic power far greater than anything currently 
available in 24 pins. This is accomplished while maintaining a 25- 
ns propagation delay (tpp) with only 180-mA current drain. 

The PAL32\/X10 shares some of the features of the PAL22V10, 
such as varied product term distribution, and can emulate any 
PAL22V10 function. It is therefore appropriate to describe the 
individual features of the PAL32VX10 by contrasting them with 
the corresponding more limited options in the PAL22V10. The 
key features to be discussed are: 

1) Dual Feedback 

2) Programmable Flip-Flops 

3) Register Bypass Scheme 

After discussing these features we can look at the overall power 
provided by the integration of these features into a single PLD. 
Adapted from a paper presented at Northcon/86 



Dual Feedback 

The feedback scheme used in the PAL22\/10 (Figure la) pro- 
vides two sources for the feedback signal, only one of which is 
used at any given time. When the flip-flop is used, the feedback 
signal is taken from the output of the flip-flop itself; when the 
flip-flop is bypassed, the feedback signal comesfrom the output 
pin, allowing for bidirectional signal flow. 

There are two basic limitations in this approach. The first is the 
fact that registered outputs cannot be made bidirectional. One 
could add a separate feature allowing such an option (Figure 
lb), but such an improvement by itself provides an incremental 
benefit. The second limitation, which is substantial from the 
standpoint of device utilization, is the fact that if an output is 
needed as an input, the output macrocell for that pin is com- 
pletely wasted. 
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(b) A Possible, But Marginal Improvement 

Figure 1. PAL22V10 Feedback Selection Scheme 

The approach taken in the PAL32VX10 is to provide two com- 
pletely independent feedback lines: one from the flip-flop, and 
one from the output pin (Figure 2). That this solves the first 
limitation is obvious; the output can always be disabled and used 
as an input without interfering with the signal being fed back 
from the flip-flop. 
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Figure 2. PAL32VX10 Dual Feedback Scheme 

But the benefits go far beyond that. Using the output pin per- 
manently as an input does not require that the macrocell be 
wasted: the flip-flop can be "buried", allowing internal state 
machines to be built. This removes the second limitation of the 
PAL22V10 scheme. Alternatively, the macrocell can be used as 
an input register by feeding the input into the flip-flop, and then 
using the flip-flop feedback to drive other logic In the device. 
This same approach can also be used when the pin is used as an 
output, allowing the flip-flop to be loaded directly from its output 
pin. 

Thus the thirty-two inputs to the array consist of twelve dedi- 
cated inputs, ten output pin feedback inputs, and ten flip-flop 
feedback inputs. 

Programmable Flip-Flops 

PAL devices have traditionally been equipped with D-type flip- 
flops for data synchronization and state machines. D-type flip- 
flops work quite well for many state machines, but in many other 
cases, J-K or T-type f I ip-f lops would allow for implementation of 
more complex state machines. Yet offering alternative products 
with other dedicated types of flip-flops would unnecessarily 
overpopulate the product family, complicating the designer's 
device selection and inventory tasks. 

The ideal solution is to have a single device which can have each 
flip-flop Individually programmed to define its type. And, as will 
be seen, it is sometimes beneficial to be able to change types "on 
the fly". These capabilities have been provided in the PAL32VX1 0. 

T-type flip-flops 

As an example, the classic application involving T-type flip-flops 
is in the design of binary counters. When a D-type flip-flop is 
used, the Boolean equations have to account for each time the 
device output is to be a logic 1 . This means that the nth bit of the 
counter requires n product terms: 

Qn+:=/Qn*Qn-r...*Q1 
+ QnVQI 
+ Qn7Q2 



+ QnVQn-1 

By using a T-type flip-flop, one only has to determine when the 
flip-flop output should toggle. The nth bit toggles only when all 
lower order bits are "1". This can be expressed as 

"Qn should HOLD UNLESS all lower order bits are HI." 

The UNLESS operatorcan be implemented by an Exclusive-OR 
(XOR) gate: 

Qn+ := Qn :+: (Qn-1*Qn-2*...*Q1) 



This requires two product terms (one on each input of the the 
XOR gate), regardless of which bit of the counter is being 
defined. 

So by providing an XOR gate with Q fed back to one input, the 
combination XOR gate/D-type flip-flop acts as a T-type flip-flop. 
A sum of products on the other XOR input provides the Toggle 
signal. 

J-K flip-flops 

The classic formula for a J-K flip-flop is 

Q+ := JVQ + /K*Q. 

This expression can be directly implemented in a D-type flip- 
flop, providing /K fits within the available product terms. If this is 
not the case, or if finding /K is inconvenient, another formulation 
is needed. We know that by the definition of a J-K flip-flop, 

Q should HOLD UNLESS: 

the output is and we SET to 1; 
the output is 1 and we RESET to 0; 
we are toggling (JK= 11). 

The above conditions for "not holding" can be plugged into a 
Karnaugh map and minimized to give the following expression: 

Q+ := Q :+: (JVQ + K*Q). 

Here, with the help of the XOR gate, a J-K flip-flop is imple- 
mented directly in terms of J and K, instead of /K. Again, this is 
done by feeding to one input of the XOR gate, and by ORing 
several product terms into the other XOR input. These terms are 
divided between J terms and K terms; J terms are ANDed with 
/Q, while K terms are ANDed with Q. 

So we can implement J-K, T, and D-type flip-flops by using the 
XOR gate in combination with a D-type flip-flop. By setting one 
XOR input to 0, /0, 1 , or 0, we can implement J-K, /J-/K, J-/K, or 
/J-K flip-flops. The other XOR input provides a sum of products 
for J and K. Of course, since S-R flip-flops are a subset of J-K 
flip-flops, they can be implemented in the same fashion. 



Reconfigurable flip-flops 

This is a step in the right direction, but one can do more. 
Consider the design of a loadable counter This is a situation 
where a T-type flip-flop is preferred when counting, but a D-type 
is better when loading. A T-type configuration can be converted 
to D-type, however, if the LOAD control input is used to disable 
the XOR input containing Q. 

Thus by making the XOR input a product term, instead of a 
dedicated single input, the flip-flop type can be changed on the 
fly to allow each particular function to be implemented as effi- 
ciently as possible. Note that PLDs which provide programma- 
ble flip-flops without the XOR control product term cannot have 
their flip-flops altered once configured. So, for instance, adding 
a LOAD feature to a counter then requires twice as many extra 
product terms as required with a reconfigurable flip-flop. 

All of these considerations have been accommodated in the 
PAL32\/X10 (see Figure 3). The single XOR product term pro- 
vides control of the flip-flop type, for either permanent or recon- 
figurable flip-flops. From eight to sixteen product terms are 
available as D or T inputs, or to be divided between J and K in- 
puts. Some of the possible configurations are shown in Figure 4. 
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Figure 3. The PAL32VX10 Programmable Flip-Flop 
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Figure 4. A Few Selected Flip-Flop Configurations 
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Register Bypass 

The two primary second-generation PLDs, the PAL20RA10 and 
the PAL22V10, have different ways of bypassing their flip-flops. 
The PAL22V10 has a single fuse which configures the output 
permanently (see Figure 5a). Once the configuration is chosen, it 
cannot be changed. On the other hand, the PAL20RA1 bypasses 
its flip-flop by activating both SET and RESET simultaneously 
(see Figure 5b). This can be done dynamically, but since the SET 
and RESET affect the contents of the flip-flop, one cannot, for 
instance, clock data into the flip-flop, change to combinatorial, 
and then change back to look at the flip-flop; the data in the 
flip-flop will have been lost. 

The PAL32VX10 eliminates this restriction by using a product 
term to control the bypass multiplexer (see Figure 5c). While the 
output is combinatorial, the flip-flop is unaffected. This makes it 
possible to use the flip-flop in macrocells that are sometimes 
combinatorial. 



Programmable registered or combinatorial outputs 

Dual independent feedbacic paths allow I/O with combinatorial 
or feedback register outputs 

Programmable flip-flops allow J-K, S-R, T or D types 

Programmable output polarity 

Register set and register reset can be asynchronous or 
synchronous 

Automatic register preset on power up 

Varied product term distribution 

Up to 16 product terms per output 



Figure 6. Features of the PAL32VX10 
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(b) The PAL20RA10 Scheme 




Figure 7. The Complete PAL32VX10 Macrocell 
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(c) The PAL32VX10 Scheme 
Figure 5. Register Bypass Schemes 

The Complete Macrocell 

This has not been an exhaustive description of all of the features 
of the PAL32VX10 macrocell, a summary of which is shown in 
Figure 6. This paper has .merely highlighted the more revolu- 
tionary capabilities. The complete macrocell is shown in Figure 
7. All of the elements work together to offer unparalleled flexibil- 
ity, without compromising the performance of the device. 

An application will serve to illustrate the powerafforded by all of 
these features operating in concert. It is beyond the scope of this 
paper to describe the details; a brief description will suffice here. 



A PDP-1 1 Floating-Point 
Coprocessor Controller 

This device monitors a PDP-1 1 bus for instructions that should 
be sent to a coprocessor. As shown in Figure 9, it provides the 
PDP instruction decode, an instruction tracking state machine 
which provides various wait states for operand fetching, and 
coprocessor instruction encoding. The 3-bit state machine is 
not needed externally; thus it is buried, and the output pins 
provide additional inputs. Reset capability and a test feature for 
examining the buried flip-flops are included. 

This application takes advantage of the varied product term 
distribution, the buried flip-flops, and J-K, T, and D-type flip- 
flops. To implement the same application without a PAL32VX10 
would require: 

1 PAL22V10 
1 PAL16R4 
1 PAL14H4 

Of course, since we are talking about replacing several PAL 
devices with another PAL device, the number of discrete TTL 
chips that are being replaced is very large indeed. 
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Summary 

The PAL32VX10 provides significantly increased flexibility as a 
result of a few carefully placed improvements. The addition of 
another feedback line, an XOR gate, and a few other enhance- 
ments allows full utilization of the device. The utilization can be 
further optimized through the ability to represent the desired 
functions in the most compact form possible. These capabilities 
combine to form the most versatile 24-pin PLD available to date. 



PAL32VX10 



Figure 8. Coprocessor Controller 
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Introduction 

With increasing density and functionality, programmable logic 
devices are being used for such specialized applications as 
state machines. The PROSE (PROgrammable SEquencer) 
PMS14R21 device is the first device developed for state ma- 
chine design which integrates the hardware and software as- 
pects into a new design methodology. On a single device, the 
PROSE architecture combines the advantages of both PROM 
and PAL elements for implementing internal state sequencing 
functions, and provides a powerful device for designing state 
machines. These architectural features also lend themselves 
easily to the requirements of software design tools (such as 
PALASM®2 software) for providing an easy-to-use, high-level 
state machine design syntax. 

The PROSE design methodology offers high functionality and 
ease of use along with the traditional advantages of program- 
mable logic devices, such as instant customizability, fast turn- 
around time, low development cost and reduced real estate 
requirements. 

State Machine 

A state machine is defined as a digital logic system which tra- 
verses through a sequence of states in an orderly fashion. The 
state is a set of values measured at various points in the digital 
system. This state value is typically stored in a set of storage 
registers.Asimplemodelofa state machine is shown in Figure 1. 
It consists of three basic elements: storage registers, next 
state decoder, and output decoder. 
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Figure 1. General Model of a State Machine 

The storage registers perform the functions of storing present 
state of the state machine as well as the present output infor- 
mation (in which case it is also referred to as the pipeline regis- 
ter). The next state decoder decodes the present state of the 
machine along with the present input conditions and generates 
the next state of the machine. Typically it is combinatorial logic 
in PALVPI^-based state machines and is a PROM in microcoda- 
ble sequential state machines. 

The output decoder determines the outputs of the machine from 
the present state of the machine and the input conditions. It is 
not mandatory to have an output decoder as the outputs can 
also be generated directly by the registers. 



Sequential state machines are usually classified into two 
types depending upon their output generation. In Mealy type 
state machines, outputs are a function of the present state 
and input conditions. The output decoder takes the present 
state and input conditions, decodes them and generates the 
outputs. In Moore type state machines, outputs are strictly a 
function of the state of the machine. Thus, the Moore machine 
usually requires a simpler output decoder to decode the 
present state only, or none at all in a case where the storage 
registers can be used to provide these outputs directly. 

As transitions to the next state are dependent upon the 
present state and input conditions, the operation of a Mealy 
machine can be emulated by a Moore machine, provided that 
one cycle delay is allowed between input conditions and output 
generation. 



State Machine Representation 

The first step in the design methodology requires converting 
the design problem into its state diagram representation. The 
state diagram shown in Figure 2 consists of state identifiers, 
input conditions and the outputs generated. The states are 
represented by bubbles, with arrows indicating transitions to 
other states. The input conditions as well as outputs gene- 
rated are associated with these transitions. 




Figure 2. A Typical State Diagram 

There are two major operational requirements for state ma- 
chines as shown in Figure 3. 

• Control sequencing 

• Output generation 

Control sequencing is changing from one state to another in a 
state machine. If the transition from the present state to the 
next state is dependent upon the present state, it is direct 
control sequencing. If the transition from the present state to 
the next state is dependent upon the present state and the 
input conditions, it is called conditional control sequencing. 



Adapted from a paper presented at Electro/87. 
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As mentioned earlier, the output generation can be based upon 
the present state for a Moore type of state machine, or the 
present state and inputs for a Mealy type of state machine. 
The state diagram representation includes both the control 
sequencing and output generation representations. 
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Figure 3. Instruction Execution in State Machines 

The design process involves converting the hardware problem 
to this state machine representation. The system design is 
analyzed for its signal logic and timing requirements. The out- 
puts generated are based upon the system signal logic re- 
quirements, and the state transitions depend upon the system 
timing requirements of these signals. 



Traditional Microprogrammable 
Sequencers 

Traditional high-end microprogrammable sequencers fit this 
general model of state machines. They use simple clocked 
D-type registers as pipeline registers as shown in Figure 4. 
Typically they do not have an output decoder and allow only 
Moore machine implementations. 
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For the next state decoder, microprogrammable sequencers 
use a PROM array. Each location of the PROM functions as a 
state. PROMs have been traditionally used for this purpose 
because of their familiarity and relative ease of use. This 
PROM is addressed by the pipeline register field storing the 
present state. The PROM data bits are inputs to the pipeline 
register which are loaded at every clock cycle. The PROM data 
consists of the next state (branch), output logic values, and 
an instruction op-code. 

Output generation in microprogrammable sequencers is the di- 
rect transfer of PROM data to the output field of the pipeline 
register on every clock cycle. 

For direct control sequencing, next state values from the PROM 
are loaded into the pipeline registers at every clock cycle, rep- 
resenting a state transition. In most microprogrammable se- 
quencers, a binary counting mechanism is also added to the 
state registers. This allows the PROM to be addressed with a 
state counter. This is effectively control sequencing with serial 
binary state numbers. The PROM next state value is used only 
when a non-serial branch is required. A branch select multiplex- 
er is used to select between this serial count and the branch for 
the next state of the machine. 

For conditional control sequencing an input condition testing 
multiplexer is used. Usually it tests for one of the inputs at logic 
high or low. Based on this input condition, the branch select 
multiplexer is controlled, providing conditional branching. 

The instruction op-code stored in the PROW for every state 
controls the sequencer operation for that state. It decides be- 
tween direct and conditional control sequencing. It also decides 
the input condition to be tested. These instruction op-codes are 
stored in the data form and are decoded just before execution. 
Extra hardware is required to decode these instruction op- 
codes and generate internal multiplexer control signals. This 
slows down the device operation by the time required to decode 
the instructions. 

The combination of instruction op-codes, output, and next state 
values is called the instruction microcode. The design process 
requires writing instruction microcode based upon the state 
diagram. This approach to a microprogrammable sequencer, 
though popular, imposes several restrictions on the state ma- 
chine operation as the instruction op-codes are predefined and 
fixed. These fixea op-codes can only perform certain opera- 
tions and the state machine design is usually modified to fit 
those restrictions. This means that the sequencer operation is 
usually not optimal for the system design problem. 

For example, condition testing, which is restricted to one input 
testing only, may not fit a design requiring simultaneous testing 
of two signals. The branching is also restricted to a two-way 
branch. For multi-way branching, a microprogrammable 
sequencer requires multiple cycles which could disrupt the 
system signal timing requirements. In some devices further 
dedicated hardware is added to improve the functionality of the 
sequencer. Such hardware is usually under the control of in- 
struction op-code and provides features such as testing more 
than one input or branching to more than one next state. 



Figure 4. Traditional Microprogrammable Sequencer 
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Figure 5. A Block Diagram for the PROSE Device 



PROSE Device Architecture 

The PROSE device architecture is very similar to this model of a 
state machine. Its architecture (Figure 5) is optimized to per- 
form the functions of a sequential state machine as represent- 
ed in the model. For the storage registers, the PROSE device 
provides a set of twenty-one pipeline registers. These registers 
are used to store the present state and the outputs generated 
by the machine. This is similar to the microprogrammable 
sequencers. Beyond this however, the similarity ends. The 
architecture of the PROSE device has been defined specially to 
optimize its functionality. 

The next state decoder in the PROSE device consists of a 
PROM. This PROM stores the next state, outputs and branch 
control information. The output generation is similar to micro- 
programmable sequencers, and is a direct transfer of PROM 
data to the output field of the pipeline register. There are eight 
outputs in the PROSE PMS14R21 device. 

Control Sequencing 

Direct control sequencing in the PROSE device is implemented 
by the PROM. The depth of the PROM is 128, allowing up to 128 
states. Seven bits (A0-A6) of the pipeline register's next state 
field are used to address the PROM to provide the state transi- 
tion capability. For direct control sequencing the next state val- 
ues from the PROM are loaded into the pipeline register's next 
state field, for a state transition. 

Conditional Control Sequencing 

For conditional control sequencing, a combinatorial PAL array is 
provided. All of the eight conditional inputs 10-17 are direct in- 
puts to this PAL array. This PAL array provides two major ad- 
vantages over microprogrammable sequencers: 

• It allows selection of a combination of input signals for condi- 
tion testing. This is very useful for many system designs which 
require testing of a combination of two or more signals at partic- 
ular logic levels. This capability is not available in most 
microprogrammable sequencers. 

• It allows multiple condition testing for multiple branching. 
Microprogrammable sequencers require more than one cycle 
to implement multiple branching on multiple conditions. A four- 



way branch can be implemented in the PROSE device in a sin- 
gle cycle without disrupting the system timing requirements. 

Conditional control sequencing also requires present state 
information for allowing state-dependent conditional 
branching. The state bits A0-A6 are not used as inputs to the 
PAL array to provide the state information. An easier way to 
provide state information to this PAL array is to use extra bits 
from the pipeline registers and PROM as its inputs. These six 
bits, CS0-CS5, form the condition select field, which allows 
user-definable bit combinations to be used for any state. This 
optimizes the use of the PAL array product terms, since the 
same bit combinations can be used for testing the same input 
conditions for different states. 

Branching 

To implement branching, the two outputs of this PAL array, XO 
and XI, are XORed with the two most significant bits of the 
state address (XF1 and XFO) from the pipeline register. This 
gives a capability of inverting state bits A5 and A6 depending 
upon the condition testing in the PAL array. This conditional 
change of next state address provides a four-way branch cap- 
ability. 

For direct control sequencing, no product term in the PAL array 
is selected. The programmable array outputs XI and XO are at 
logic low level, allowing next state bits A6-A0 to decide the next 
state directly. The conditional select bits, CS5-CS0, should not 
select any product terms from the programmable array for 
states with direct control sequencing. 

For a two-way branch, either XI or XO is asserted HIGH on con- 
ditional inputs. This causes a branch to a location where the bit 
value of A6 or A5 respectively is inverted. 

For three-way and four-way branches both XI and XO are 
asserted high depending on multiple condition inputs. The se- 
quencer branches to locations where the bit values of A6 and 
A5 are 00, 01, 10, and 11. The PROSE device allows up to a 
four-way branch, which is sufficient for most designs. 

In the PROSE device, the next state decoder has been effec- 
tively partitioned into a PROM and a PAL device, to utilize the 
efficiencies of the two for control sequencing and conditional 
testing respectively. 
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PROSE Device Programming 

Figure 7a shows the state diagram representation of direct con- 
trol sequencing; Figure 7b shows conditional control sequenc- 
ing. Similarly, output generation can be represented as shown 
in Figures 7c and 7d for both Mealy and Moore machines. The 
next step in the PROSE device design process is to convert 
these state diagram representations into a textual entry form. 



A. Direct Control 

Sequencing 

STATEA := VCC -> STATE B 



B. Conditional Control 
Sequencing 
STATEC := II -> STATED 
+/11->STATEE 





C. Moore Machine 
STATEA.OUTF := 01 



D. Meaiy Machine 
STATEA.OUTF := 11 ->01 





Figure 7. State Machine Operations and PALASM 2 Software 



The instruction/operation mechanism of the PROSE device, 
however, is slightly different from the microprogrammable se- 
quencers. The PROSE device uses PALASM 2 software for pro- 
gramming, which allows easy textual constructs that have a 
direct relationship to the hardware operation. The instruction is 
divided into two distinct segments: one for control sequencing 
and one for output generation. 



Control Sequencing Instruction 

The format for control sequencing instruction is as follows: 

(present_state) 

:= (condition_1 ) -> next_state_name_1 

-H (condition_2) -> next_state_name_2 
+ (condilion_3) -> next_state_name_3 
+-> default state name 4 



Based upon the state diagram, for each present state, transi- 
tions/branches to other states can be entered. These transi- 
tions take place in the PROM of the device. The conditional- 
inputs can also be described in the condition field of the instruc- 



tion syntax. This is the condition testing done in the PAL array 
of the PROSE device. For each present state a maximum of 
four branch states can be provided, which is the device archi- 
tecture limit. The logic values of next states differ from each 
other by the most significant two bits only. PALASM 2 software 
automatically assigns these logic values to the state names 
while minimizing state locations in the PROM. PALASM 2 soft- 
ware also assigns the values to the conditional select field while 
ensuring the optimization of PAL array product terms. 



Output Generation Instruction 

The output generation instruction syntax is as follows: 

(present_state).OUTF 

:= (condition_1) -> (output_value_1) 

+•> (default_output_value) 

The present state is the state where the outputs need to be 
generated. The default output value is the logic level of the 
outputs at the present state which is also the syntax for a 
Moore machine implementation. PALASM 2 software also al- 
lows a Mealy machine implementation using the constructs for 
generation of conditional outputs. As the architecture of the 
PROSE device allows only Moore machine implementation, 
PAUVSM 2 software automatically converts the Mealy machine 
description to the Moore machine equivalent. 

After the conversion of state diagram to textual input is com- 
pleted, PALASM 2 software combines information about the 
control sequencing and output generation for each present 
state. These constructs have a direct relationship with the 
state diagram representations as well as the PROSE device 
architecture. This allows an integrated design where system 
design requirements represented by the state diagrams tiave a 
direct relationship with the device hardware operation. 

In the PROSE device, this combination of control sequencing 
and output generation is equivalent to the instruction op-code 
of a microprogrammable sequencer. In microprogrammable se- 
quencers, the instruction op-codes are stored as data, and are 
decoded before execution. PROSE device instructions do not 
require decoding as they are stored in the device in logic form. 
This speeds up the operation of the PROSE device. The cycle 
time of the PROSE device is 25 MHz, which is better than most 
other single-pipelined sequencers. 

Figures 7a, b, c and d also show the direct PAl^SM 2 software 
text representations of the state diagrams. These text files are 
compiled to generate the JEDEC fuse maps for programming 
the PROSE device. The procedure is identical to that used to 
program any other PLDs. PALASM 2 software also supports 
software simulation of device operation, which helps in design 
debugging before actual programming of the parts. 

The PROSE device incorporates the Diagnostics-On-Chip™ 
(DOC™) testing capability with a serial shadow register. This 
serial shadow register can be used to conduct diagnostics 
such as walking-1 or walking-0 tests. It can also be used for 
testing the PROM array by shifting in a test vector and loading 
it to the pipeline register. The pipeline register data for the next 
clock cycle is then checked for verification. Part of the DOC 
circuitry is also used for programming, but this is transparent 
to the users. 
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Summary 

With continuing advances in technology, new devices are 
being introduced which are tailored to specific system design 
requirements. The PROSE device is one such device, with an 
architecture optimized for state machine design. It combines a 
PROM array and a PAL array on a single device, and utilizes 
the functional advantages of both for control sequencing and 
conditional testing respectively. This combination offers higher 
functionality over the traditional approach of microprogramma- 
ble sequencers for state machines. It also offers a newer 
design methodology where software instructions are split into 
control sequencing and output generation instructions. These 
instructions are directly related to hardware requirements, 
unlike the instruction op-codes of traditional microprogramma- 
ble sequencers, which sometimes require design compromis- 
es. The PROSE device also offers extensive simulation, thus 
offering a complete solution. 



Rgure 8. DIP Pin Configuration 
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The PALI 6R8-1 Series from Monolithic Memories is the fastest 
TTL programmable logic family, and the ECL PAL devices from 
Monolithic Memories are the fastest programmable logic devices 
of any technology. 

Do these devices simply drop into the same applications as 
previous versions? No! These high-speed PAL devices offer 
several advantages, including speed, that open up whole new 
applications for programmable logic devices. 



The World's Fastest PAL Devices 
PALI 6R8-10 Series 

Let's look at the products under discussion. The PAL16R8-10 
Series is comprised of the four most popular PLDs in the 
industry today; the PALI 6L8, PALI 6R8, PALI 6R6, and PALI 6R4. 
The architectures (Figure 1 ) consist of sixteen inputs and eight 
outputs, with zero to eight flip-flops on the outputs. Although 
very simple compared to some of the latest high-density devices 
from Monolithic Memories, these architectures are very power- 
ful, and handle the majority of today's PLD applications. 



PAL16L8-10 



PAL16R8-10 



PAL16R6-10 



PAL16R4-10 







zB^— ID 



Figure 1. The Four Devices Comprising the PAL16R8-10 Series. Each Has Eight Product Terms per Output. 



The speed of the PAL16R8-10 Series is 10 nanoseconds (ns) 
maximum from input to output on any combinatorial path. The 
registered paths offer a 10-ns setup time and only 8 ns from 
clock input to data output. This translates into a maximum clock 
frequency (fMAX) of 1/(10 ns + 8 ns), or55.5 Megahertz (MHz), if 
external feedback is used from the flip-flop. 




It is important to differentiate this number from another common 
fMAX calculation, the maximum flip-flop toggle rate. The toggle 
rate is limited by the greater of the data setup time or the clock 
widths high and low. The toggle rate specification for the 
PAL16R8-10 Series is 62.5 MHz. However, this specification is 
unrealistic since It does provide for feedback. It is a more 
representative path that provides the 55.5-MHz specification 
(Figure 2). clk 
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fMAX no feedback; 1/(tsu + th) or 1/(twh ■*■ twi) 



flVlAX external; 1/(tCLK + tsu) 



Figure 2. Alternative Methods of Measuring fMAX. A More Realistic Path Is the Second One, Including External Feedback. 

Adapted from a paper presented at Electro/87. 
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iCL PAL Devices 

he PAL10H20P8 from Monolithic Memories was the first ECL 
AL device. This device is combinatorial, w^ith twenty inputs and 
ight outputs. A sequential device followed, the PAL10H20G8, 



with the same number of inputs and outputs, but having the 
added feature of output latches (Figure 3). 
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Figure 3. The ECL PAL Devices. On the Left Is the PAL10H20P8 Combinatorial Device, 
and on the Right Is the PAL10H20G8 Latched Device. 



The PAL10H20P8 and PAL10H20G8 have combinatorial propa- 
gation delays of only 6 ns, faster than any other PLDs. The 
latched version has a setup time of 4.5 ns and a latch-to-output 
delay of only 2.5 ns. This provides a maximum frequency of over 
142 MHz. 

History of Speed Developments 

A short history of speed improvements in bipolar PAL devices 
shows how these devices compare to earlier versions (Figure 4). 




In this graph of speed vs. power, performance characteristics 
closer to the lower left are desired. The first PAL devices, 
introduced by Monolithic Memories in the late 70s, were in the 
35-ns range, and consumed 180 mA. Technology improvements 
allowed the introduction of the first 25-ns devices in 1983. In 
early 1 985 the first 1 5-ns devices were released, with half-power 
versions at 25 ns and quarter-power versions at 35 ns. Last year 
Monolithic Memories introduced its industry-leading 10-ns 
PAL16R8-10 Series, along with the ECL devices. 



PALI6R8A-4 SERIES 
55/50 

PAL16R8B-4 SERIES 
35/55 

PAL16R8A-2 SERIES 
35/90 

PAL16R8B-2 SERIES 
25/90 

PAL16R8H-15 SERIES 
15/90 
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Figure 4. Speed vs. Power for Monolithic Memories Bipolar PAL Families. The Year of Introduction Is Noted. 
The PAL16R8-10 Series and the ECL PAL Devices Offer the Highest Performance in the Industry. 
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New High-Performance Applications 

The raw speed of these new devices allows their use in new 
high-performance applications. As microprocessors reach 20 
MHz and beyond, 50 MHz logic is required for peripheral and 
memory interface. Additionally, high-speed static RAMs 
require high speeds in the select logic. 

One application area for these high-speed PAL devices is high- 
resolution graphics, including counters for screen refresh, 
generation of line frame and video information, and micropro- 
cessor interface. High-speed digital processing requires 10-ns 
PAL devices for data pipeline applications and on-the-fly 
instruction decoding. High-speed minicomputers require very- 
high-speed state machines, and 55.5-MHz PAL devices are an 
efficient solution. 

Speeds Up Fastest Discrete Designs 

PAL devices have always been used to replace discrete logic, 
thereby reducing chip count, inventory, and board area; 



simplifying board design and layout; and adding extensi\ 
design flexibility. Now PAL device speed has broken a 
important barrier; the PAL16R8-10 Series allows PAL device 
to speed up a discrete logic design, even one built with th 
fastest available devices. 

Comparison to FAST Logic 

The following example compares the PAL16R8-10 Series t( 
FAST logic in a combinatorial discrete logic design. Thi 
design is an interface between a MC6800 microprocessor anc 
memory elements. Debouncing for the reset switch is aisc 
required. 

The discrete logic implementation is shown in Figure 5. Eight 
discrete chips are required. However, since FAST logic does 
not include 8-input AND gates, three additional chips will be 
required, making a total of eleven chips using FAST logic. The 
worst-case path delay is 27.6 ns. 




Figure 5. Memory Interface Built with Discrete Logic. Eleven 
FAST Logic Chips Are Required. The Worst-Case 
Path Delay Is 27.6 ns. 
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Figure 6 shows the same design implemented in a PAL16L8-10. 
One device replaces eleven chips, and the new worst-case path 
delay is now only 20 ns. 
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Figure 6. Same Memory Interface Implemented in a PAL 

Device. Only One Chip Is Required, and the Delay 
Is Only 20 ns Maximum. 

An example of a sequential circuit (Figure 7) is a simple dice design requires seven chips. The worst-case path for FAST logic 

game, providing two die readouts on LEDs. The discrete logic has a maximum frequency of 27.5 MHz. 




Figure 7. Simple Dice Game Constructed from Discrete 
Gates. Seven Chips Are Required, with a 
Maximum Frequency of 27.5 MHz. 
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The entire design can be implemented in one PAL16R8-10 
(Figure 8). The worst-case path has a maximum frequency of 
55.5 MHz, over twice as fast as the discrete logic version. In 
addition, power requirements remain almost the same: almost 
170 milMAmps (mA) for the FAST version, and 1 80 mA maximum 



for the PAL device. A half-power version of the PAL device will 
require 47% less power than FAST logic while still providing 
higher speed. Of course, the greatest benefit is a 7:1 chip count 
reduction, reducing size and cost while increasing reliability. 
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Figure 8. The Same Dice Game Implemented in a 

PAL16R8-10. Only One Chip Is Required, and the 
Speed Is Doubled, with Little Increase in Power 
Requirements. 



The PAL 16R8-10 Series devices can also be compared to FAST 
MSI functions on a one-to-one basis. For example, the clock-to- 
output time of a standard 74F374 octal register is 10 ns, vs. only 
8nsforthePAL16R8-10. 

A straightf onward combinatorial example is a simple 3:8 decoder 
with three-state outputs, the 74F538. This device has a maximum 
propagation delay of 1 7 ns, while the PALI 6L8-1 can implement 
the same function at only 10 ns, 41% faster. 

Of course, PAL devices add the additional capability of doing 
custom variations of these functions, if needed. For example, 
with the substitution of a registered PAL16R8-10 in the 74F538 
example, control storage functions such as load and clear can 
be added. 



Even faster discrete logic families have been announced that 
improve upon the speeds of FAST logic. Many of these families 
even use CMOS technology. But the fastest devices announced 
are at 100 MHz, equivalent to the combinatorial speed of the 
PAL16R8-10 Series. And the PAL16R8-10 Series is available 
now, providing thousands of functions through its programma- 
bility. We saw in the memory interface example that the high- 
performance logic families are not as complete as earlier 
families. PAL devices can replace this logic, or complement it 
with additional functions. 

We have shown that the PAL16R8-10 Series creates a new 
application area for programmable logic: the design of higher 
performance solutions than any discrete logic implementations. 
Speed-critical discrete logic designs are now open to replace- 
ment by programmable logic. 
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Elimination of PC Board Noise 

This capability may be arriving just in the nick of time. Discrete 
logic is today achieving speeds that challenge printed circuit 
board layout. The fast switching speeds can create noise and 
crosstalk that can bring the high-performance board crashing 
down. Note the new high-speed CMOS logic families that 
require nev/ pinouts and extra grounds because of switching 
noise. 

Integrating this logic into programmable logic eliminates not 
just discrete chips but all those noisy and troublesome connec- 
tions between them. Putting the PC traces into silicon eliminates 
crosstalk and signal distorlions caused by transmission line 
reflections, and improves the reliability of the board. In high- 
speed systems, the length of the interconnect can approach the 
signal wavelength, causing ringing in the connection line. This 
ringing can cause a false signal value to appear on an input tied 
to the ringing line, or can even be fed back into the source device 
if feedback is used from the output. 



Improved Speed 

The reduction in interconnect afforded by PAL devices, with the 
accompanying reduction in inductance and time delays, even 
increases system speed. In high-speed logic systems, each 
1 2-inch length of interconnect wiring introduces a time delay of 
approximately 2 ns, about an entire gate delay. These delays 
slow down the system and can cause glitches and signal errors. 
Thus, integration of high-speed logic into PAL devices improves 
not just reliability, but also speed. 



Technology Advantages 

Two other advantages of the PALI 6R8-1 Series, lower cost and 
reduced signal noise, are a result of the technology used to 
achieve its high speed. Speed improvements in semiconductor 
devices come about primarily through size reductions and 
reduced capacitance. Capacitance, since it stores charge, slows 
down devices as they have to charge and discharge capacitive 
elements. 

The 25-ns devices were achieved through the reduction of 
lateral geometries in a junction-isolated process. This produced 
lowrercollector-base capacitance and therefore provided higherspeed. 
The 1 5-ns devices used the same junction-isolated process, but 
a reduced epitaxial layer produced the higher speeds. 



Oxide-Isolated Process 

For the 10-ns devices, Monolithic Memories used a completely 
new oxide-isolated process to increase the speed to record 
levels. Oxide-isolation allows the transistors to be placed closer 
together (Figure 9). This greatly reduces lateral geometries 
while reducing the parasitic capacitance between circuit ele- 
ments, providing much higher speed. 




p+ 
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Figure 9. Basic Transistor Isolation Structure in Oxide- 
Isolated Process. A Fully Recessed Oxide Region 
Separates Transistors, Allowing Smaller 
Geometries and Higher Speed. 

The process utilizes a walled-base technology. It features 1.5- 
micron design rules and dual-layer metallization. The isolation 
technology uses a combination of high-pressure oxidation to 
form a fully-recessed oxide through the epitaxial layer, and a 
diffused guard band. This technology avoids any contact 
between the channel stop and the buried layer. 

Unique Needs of Programmable Logic 

Programmable logic devices have unique process needs because 
of the high voltages incurred during programming. The oxide- 
isolation technique is especially useful because of its excellent 
insensitivity to substrate injection, which occurs when small 
transistors are driven into hard saturation during programming. 

Lower Manufacturing Costs 

Another requirement for programmable products is a good 
high-current beta so that emitter size can be minimized. The 
NPN transistors in this process have been measured to have a 
beta of higher than 20 at an emitter current of 30 mA. In addition, 
a reduction in the required fusing current allowed smaller transis- 
tors in the programming array. Separation of the current sources 
for programming and normal operation improves the speed- 
power product. These features add reliability to the product 
while reducing die size. 

High Yield 

The walled diffusions improve yield by reducing manufacturing 
tolerance requirements. In addition, the process Is Insensitive to 
threshold problems. The result of these techniques is a high- 
yielding, reliable process that produces both the 10-ns TTL PAL 
devices and the 6-ns ECL PAL devices. 

Smaller die size and high yields result in a much lower cost of 
manufacturing. Thus, once the process is mature, it can produce 
faster products at the same or less cost than earlier technologies. 
This can be seen with the 15-ns devices, which now are cost- 
competitive with any other technology, including CMOS. 

Many users of ICs claim that it is impossible tofind products that 
combine three elusive qualities: good, fast, and cheap. The 
advanced technology used in the high-speed PAL products 
allows them to fit'all three requirements well. 
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Reduced Noise 

The process also provides other important advantages, lower 
capacitance and reduced noise. An enhanced base doping 
allows lower resistance without higher capacitance. Oxide- 
isolated transistors provide much lower input and output 
capacitance levels than that provided by earlidr technologies. 
The input capacitance is typically only 2 picoFarads (pF), while 
the output capacitance is only 4 pF. 

Elimination of False Logic Switching 

Lower capacitance allows your system to operate with cleaner 
switching signals between devices, providing a much more 
reliable system. This is critical in high-speed systems where fast 
rise and fall times can cause signal fluctuations. 

Guard rings limit the susceptibility of the inputs and outputs to 
voltage undershoots and overshoots. Bipolar Schottky circuitry 



is designed with nonlinear input and output impedances. I 
high-speed systems, undershoots in particular can play havo 
with false logic states, if the guard rings do not prevent them. 

New Synchronization Applications 

Another key advantage of the PAL1 6R8-10 Series, only indirecti 
related to speed, is its excellent resistance to metastability 
Metastability is a failure that occurs when required minimun 
setup times are violated. The PALI 6R8-10 Series is less suscep 
tible to metastability, and recovers faster, than any other PLC 
and most other forms of logic. This subject is discussed in detai 
in Session 3 of Electro/87 on Metastability. 

Metastability characteristics are explicitly specified in the 
PAL16R8-10 Series datasheet (Figure 10). For example, at 21- 
MHz clock frequency and 7-MHz data input frequency, a 
metastability failure will occur only once in ten years. 



Metastability Characteristics over operating conditions 
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Figure 10. Guaranteed Metastability Characteristics for the 
PAL16R8-10 Series. The High Resistance to 
Metastability Allows New Use in Synchronization 
Applications. 



Thus, a PAL16R8-10 Series device can be used as a synchronizer 
circuit, a new application that doesn't even require its program- 
mability. Almost every digital system must at some point handle 
incoming signals that are not synchronized to the system clock 
rate, and any such system will encounter metastability. Using 
the PAL16R8-10 Series devices at the interface provides the 
lowest possible failure rate. 

Lower Metastability Means Higher 
Speed 

For applications using the PALI 6R8-1 Series as a synchronizer, 
the lower metastability rate means higher clock speed. For a 
given desired MTBF, the maximum clock rate is limited by the 
susceptibility to metastability failure. The reduced metastability 
rate for the PALI 6R8-1 Series allows a higher clock speed, with 
no reduction in MTBF. 

ECL Provides Highest Speed 

The two ECL PAL devices, the PAL10H20P8 and PAL10H20G8, 
provide the fastest PLDs. 10KH ECL technology also eliminates 



many of the noise problems encountered with high-speed TTL 
boards since the internal switching voltage range is less than 
one volt, and it is often a good alternative to high-speed TTL. 
ECL design techniques achieve high speed by not allowing the 
bipolar transistors to saturate when they are on. Thus, there is no 
storage time delay caused by switching in and out of saturation. 

The elimination of saturated transistors means that current is 
always flowing in ECL devices. This is the cause of high power 
consumption in ECL circuits. However, the ECL PAL devices 
consume only 210 and 225 mA for the PAL10H20P8 and 
PAL10H20G8, respectively. This is only minimally higher than 
the standard power consumption of TTL programmable logic 
circuits. 



New PLD Applications in ECL Systems 

However, these devices do not just speed up programmable 
logic. These devices are fully 10KH compatible and meant to be 
used in 10KH systems. They open up new applications for 
programmable logic in the highest performance ECL systems. 



2-736 



£1 



Monollthlo 



m 



Memories 



^ 



Blazing Fast PAL Devices Enable New Application Areas 



:CL MSI/LSI Functions 

Jince the ECL market is smaller than the TTL market, fewer 
unctions are available in ECL than TTL. This is true on all levels, 
rom SSI and MSI through VLSI. ECL PAL devices provide a 
lexible solution that can implement functions not found in fixed 
ECL logic. Future ECL PAL devices will provide higher density 
ind becapableof implementing the most complex LSI functions. 

Systems are built with ECL technology primarily for high 
performance, and ECL PAL devices must also be high perfor- 
nance. Future ECL PAL devices will improve speeds down to 
;he 3 to 4-ns range and be able to compete effectively with the 
speed of fixed ECL logic. 



The Speed Challenge of CMOS 
Technology 

CMOS technology is quickly encroaching on bipolar speeds. In 
fact. Monolithic Memories has CMOS mask-programmable 
devices (ZHAL'" devices) that are even faster than their bipolar 
equivalents. However, CMOS has not yet reached the best 
bipolar speeds, and is not likely to in the near future. Even if we 
exclude ECL from the comparisons, bipolar has a two-generation 
speed advantage over CMOS technology. 

The fastest CMOS PLDs available today are at 25 ns, while 25-ns 
bipolar devices have been available for years and are currently 
being shipped in huge volumes. 15-ns CMOS is in development, 
but bipolar is already at 1 ns and moving further down the scale. 

Although recent CMOS developments have been rapid, CMOS 
is not expected to pass bipolar speeds, only to get closer. As 
geometries are reduced, CMOS has to contend with short- 
channel effects and hot-electron trapping as scaling increases 
the electric field. Bipolar has fewer problems during reduced 
scaling. 



Speed vs. Density 

Many vendors of high-density CMOS PLDs claim that they have 
surpassed bipolar speeds if examined at a per-gate basis. That 
claim is relevant only if the high densities are required. Even gate 
array applications are relatively low in density, with over half 
below the 2000-gate level. Additionally, the largest application 
area for semicustom arrays is random logic, efficiently met by 
the PAL architecture. And even in those cases where greater 
density than a single PAL device is required, two PAL16R8-10 
Series devices back-to-back have a combined delay of only 
20 ns, still faster than any CMOS PLD of any density. 

Some industry observers have raised an issue of speed vs. 
density, asking which one will win out in the long run. But these 
two attributes do not compete; they are usually required at 
different times in different applications, and thus both areas will 
develop in their own market areas. 



Gate Array Performance 

Gate arrays and other forms of semicustom logic are demon- 
strating very high performance, even in CMOS. The sub- 
nanosecond gate delays of these devices, even when input and 
output delays are taken into account, cannot be matched by 
programmable logic. For the ultimate in speed, a more custom- 
ized design is better, with full custom offering the highest speed. 

Programmable logic competes with other high-performance 
semicustom solutions in applications where the additional 
advantages of zero NRE, instant customization, ease of design 
changes, inexpensive and easy-to-use software, standardization, 
low cost, second sourcing, and other issues weigh in favor of 
programmable logic. 

Studies of gate arrays show that clock frequencies for systems 
using gate arrays are about equal to microprocessor frequen- 
cies; around 8-12 MHz, moving to 16 MHz. But further studies 
show that 76% of users require at least one very-high-speed 
(twice the clock frequency) combinatorial path through the chip. 

Since programmable logic offers only one speed through the 
chip, the device speed must match the speed of the fastest 
required path. This is one reason why high-speed PAL devices 
are required even if system clock rates are low. 

CMOS PLDs can be erasable, a feature bipolar cannot match. 
Although not erased very often in customer applications, 
erasability adds to the testability of the device during manufac- 
turing. Also, CMOS programmable cells are smaller than bipolar 
fuses, allowing a smaller die size. These advantages of CMOS 
will allow it to dominate, in the long term, the PLD market areas 
that do not require high speed. 

Usable Speed 

Just as semicustom arrays offer a greater percentage of "usable 
gates" than PLDs, or gates used vs. gates provided, PLDs can 
offer more "usable speed" than semicustom arrays. Although 
typical gate speeds in custom devices can be below 1 ns, that 
speed is not relevant if the design and simulation tools do not 
provide the accuracy required to guarantee proper timing. 

Since most of a PAL device is fixed logic, guaranteed maximum 
delays through the device can be specified for any custom 
design. In addition, minimum delays for the PALI 6R8-10 Series 
are specified in the datasheet for the first time, allowing accurate 
system timing. Tools such as PLDMaster from Daisy Systems 
provide PAL device simulation in a board-level environment. 
PLDMaster can even help partition designs to take full advan- 
tage of device speeds. 

The PAL device has another advantage over other custom logic 
that provides the ultimate in timing simulation— a device can be 
programmed and immediately dropped into a circuit, to check 
timing. All of these advantages mean that designers can use 
more of the PAL device's speed than that of other custom logic, 
giving a PAL device an edge even at equivalent specs. 
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CMOS Advantages 

CMOS has several advantages over bipolar, depending on the 
technology used. In fact, CMOS technology has opened up 
more new applications for programmable logic than any other 
development, because of several unique characteristics. It can 
be lower in power, can be erasable, and can be srrtaller. Some of 
these advantages are important in some cases but not important 
in others. 

Low power is an important advantage, but Monolithic Memories 
has been producing half- and quarter-power versions of its 
products in bipolar, simply by increasing the resistor values. Our 
PALI 6R8B-4 Series devices at 35 ns and only 55 mA are lower in 
power than many competing CMOS devices at similar speeds. 
The upcoming PAL16R8H-15 Series at 15 ns and 90 mA will 
achieve even greater efficiencies. As top bipolar speeds improve, 
the half-power and quarter-power versions will continue at the 
highest-performance CMOS speed levels. 

CMOS power specifications are often only stated at standby, 
with the device not operating. Operating power for CMOS rises 
dramatically with increasing input frequency and can surpass 
even medium-power bipolar devices (Figure 1 1 ). For high-speed 
applications. CMOS has not shown a power advantage over 
bipolar. 



POWER 
CONSUMPTION 




HALF 
POWER 
BIPOLAR 

QUARTER 
POWER 
BIPOLAR 



OPERATING FREQUENCY 

Figure 11. Power Consumption vs. Operating Frequency for 
CMOS and Low-Power Bipolar PAL Devices. 
CMOS Devices Can Consume as Much Power as 
Bipolar Circuits at High Speeds. Thus, for High- 
Speed Applications, CMOS Loses One of Its Key 
Advantages Over Bipolar. 



Bipolar's Place in Market 

BipolarTTL and ECL PAL devices will continue to dominate th 
large part of the PLD market that requires high speed. Bipola 
will even be dominant in the low-cost part of the market for year 
to come. 

Record Speeds and New Applications 

The 10-ns PAL16R8-10 Series and the 6-ns ECL PAL device; 
provide record speeds for programmable logic, using advancec 
process technology. Future plans call for 7.5-ns TTL and 3 tc 
4-ns ECL PAL devices. The speed and technology open nev 
applications for PAL devices, including: 

• Highest-performance computers and digital processing 

• High-speed microprocessor and memory support 

• Speeding up the fastest discrete logic designs 

• Component integration, eliminating noise and delays from 
device interconnect lines 

• Process technology advantages of lower cost and reduced 
noise 

• Wide range of LSI functions to complement high-speed TTL 
and ECL discrete logic families 

• Synchronization circuits for low metastability rates 

All of these new applications show that these new PAL devices 
are not just faster devices for the same sockets, but rather open 
up whole new applications for programmable logic devices. 
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TLX RS55650 MMI RS 

SWEDEN. 

Stockholm TEL (08) 733 03 50 

FAX (08) 733 22 85 

TLX 11602 

TAIWAN TLX 886-2-7122066 

FAX 886-2-7122017 

UNITED KINGDOM, 

Manchester area TEL (0925) 828008 

FAX (0925) 827693 

TLX 628524 

London area TEL (04862) 22121 

FAX (0483) 756196 

TLX 859103 

North American Representatives 

CANADA 
Burnaby, B.C. 

DAVETEK MARKETING (604) 430-3680 

Calgary. Alberta 

VITEL ELECTRONICS (403) 278-5833 

Kanata, Ontario 

VITEL ELECTRONICS (613) 592-0090 

Mississauga. Ontario. 

VITEL ELECTRONICS (416) 676-9720 

Quebec 

VITEL ELECTRONICS (514) 636-5951 

IDAHO 

INTERMOUNTAIN TECH MKGT (208) 888-6071 

INDIANA 

ELECTRONIC MARKETING 

CONSULTANTS. INC (317) 253-1668 

IOWA 

LORENZ SALES (319) 377-4666 

KANSAS 
Merriam 

LORENZ SALES (913) 384-6556 

Wichita 

LORENZ SALES (316) 721-0500 

KENTUCKY 

ELECTRONIC MARKETING 

CONSULTANTS, INC (317) 253-1668 

MICHIGAN 

MIKE RAICK ASSOCIATES (313) 644-5040 

MISSOURI 

LORENZ SALES (314) 997-4558 

NEBRASKA 

LORENZ SALES (402) 475-4660 

NEW MEXICO 

THORSON DESERT STATES (505) 293-8555 

NEW YORK 

NYCOM. INC (315) 437-8343 

OHIO 
Centerville 

DOLFUSS ROOT & CO (513) 

Columbus 

DOLFUSS ROOT & CO (614) 

Strongsville 

DOLFUSS ROOT & CO (216) 

PENNSYLVANIA 

DOLFUSS ROOT 4 CO (412) 
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433-6776 
885-4844 
238-0300 
221-4420 
R=* MARKETING (801) 595-0631 



Advanced Micro Devices reserves the right to make changes in its product without notice in order to improve design or performance characteristics. The performance 
characteristics listed in this document are guaranteed t>y specific tests, guard banding, design and other practices common to the industry. For specifk: testing details, 
contact your tocal AMD sales representative. The company assumes no responsit>illty for the use of any circuits described herein. 
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O-pin Combinatorial 7TLJCWIOS PAL Devices 








PRODUCT 


PINS 


TECHNOLOGY 


tpD{ns) 


lcc(mA) 


DESCRIPTION 


PAGE 


'AmPALISPB 


20 


TTL 


15,25 


55, 90, 180 


20-pin superset 


5-202 


PAL16P8 


20 


TTL 


25 


180 


Programmable polarity 


5-17 


Am/PAL16L8 


20 


TTL, E CMOS 


10, 15,25,35,55 


0.1,45,90, 180 


Standard 


5-26,5-184 


■PAL10H8 


20 


TTL 


35 


90 


Simple combinatorial 


5-56 


!pAL10L8 


20 


TTL 


35 


90 


Simple combinatorial 


5-56 


|PAL12H6 


20 


TTL 


35 


90 


Simple combinatorial 


5-56 


PAL12L6 


20 


TTL 


35 


90 


Simple combinatorial 


5-56 


IPAL14H4 


20 


TTL 


35 


90 


Simple combinatorial 


5-56 


:PAL14L4 


20 


TTL 


35 


90 


Simple combinatorial 


5-56 


iPAL16H2 


20 


TTL 


35 


90 


Simple combinatorial 


5-56 


PAL16L2 


20 


TTL 


35 


90 


Simple combinatorial 


5-56 


PAL16C1 


20 


TTL 


40 





Simple combinatorial 


5-56 



20-pin Registered TTL/CWIOS PAL Devices 



1 PRODUCT 


PINS 


TECHNOLOGY 


fMAxC^H^) 


lcc(mA) 


DESCRIPTION 


PAGE 


AmPAL23S8 


20 


TTL 


33, 28.5 


200 


Buried registers 


5-169 


PAL16RAB 


20 


TTL 


20 


170 


Asynchronous 


5-11 


PAL16RP8 


20 


TTL 


25 


180 


Programmable polarity 


5-17 


PAL16RP6 


20 


TTL 


25 


180 


Programmable polarity 


5-17 


PAL16RP4 


20 


TTL 


25 


180 


Programmable polarity 


5-17 


Am/PAL16R8 


20 


TTL. E CMOS 


58,40,28.5,18, 11 


0.1,45,90,180 


Standard 


5-26,5-184 


Am/PAL16R6 


20 


TTL, E CMOS 


58.40,28.5,18, 11 


0.1,45,90,180 


Standard 


5-26,5-184 


Am/PAL16R4 


20 


TTL. E CMOS 


58.40,28.5,18, 11 


0.1,45,90,180 


Standard 


5-26,5-184 


PALI 6X4 


20 


TTL 


14 


225 


Arithmetic 


5-51 



Sequencers 



PRODUCT 


PINS 

24 


TECHNOLOGY 


^.lAxC^IH^) 


lcc(mA) 


DESCRIPTION 


PAGE 


PMS14R21 


TTL 


30.25 


210 


PROSE Sequencer 


5-315 


PLS167 


24 


TTL 


33 


200 


Programmable Logic Seq. 


5-331 


PLS168 


24 


TTL 


33 


200 


Programmable Logic Seq. 


5-331 


PLS105 


24 


TTL 


37 


200 


Programmable Logic Seq. 


5-331 


Am29PL141 


28 


TTL 


20 


450 


Fuse Prog. Controller 


5-339 


Am2971 


24 


TTL 


85 


310 


Prog. Event Generalor 


5-365 



ECL PAL Devices 



PRODUCT 


PINS 


TECHNOLOGY 


'max''*pd 


Icc{mA) 


DESCRIPTION 


PAGE 


PAL10H20EV/EG8 
PAL10020EV/EG8 
PAL10H20G8 
PAL10H20P8 


24 
24 
24 
24 


ECL10KH 
ECL100K 
ECL10KH 
ECL10KH 


125 MHz 
125 MHz 

6 ns 

6 ns 


220 
220 
225 
210 


Registered/latched 
Registered/latched 
Latched 
Combinatorial 


5-381 
5-381 
5-382 
5-386 



Logic Cell Array 



PRODUCT 


PINS 


TECHNOLOGY 


f,,Ax{MHz) 


lcc(mA) 


DESCRIPTION 


PAGE 


M2064 
M2018 


48,68 
68,84 


RAM CMOS 
RAM CMOS 


70, 50. 33 
50,33 


5 
5 


Logic Cell Array 
Logic Cell Array 


5-483 
5-483 
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